De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

Publikation: Beitrag in FachzeitschriftForschungsartikelBeigetragenBegutachtung

Beitragende

  • Brian J. Haas - , Broad Institute of Harvard University and MIT (Autor:in)
  • Alexie Papanicolaou - , Commonwealth Scientific & Industrial Research Organisation (CSIRO) (Autor:in)
  • Moran Yassour - , Broad Institute of Harvard University and MIT (Autor:in)
  • Manfred Grabherr - (Autor:in)
  • Philip D. Blood - (Autor:in)
  • Joshua Bowden - (Autor:in)
  • Matthew Brian Couger - (Autor:in)
  • David Eccles - (Autor:in)
  • Bo Li - (Autor:in)
  • Matthias Lieber - , Zentrum für Informationsdienste und Hochleistungsrechnen (ZIH) (Autor:in)
  • Matthew D. MacManes - (Autor:in)
  • Michael Ott - , Commonwealth Scientific & Industrial Research Organisation (CSIRO) (Autor:in)
  • Joshua Orvis - (Autor:in)
  • Nathalie Pochet - , Broad Institute of Harvard University and MIT (Autor:in)
  • Francesco Strozzi - (Autor:in)
  • Nathan Weeks - (Autor:in)
  • Rick Westerman - (Autor:in)
  • Thomas William - , GWT-TUD GmbH (Autor:in)
  • Colin N. Dewey - (Autor:in)
  • Robert Henschel - , Indiana University Bloomington (Autor:in)
  • Richard D. LeDuc - , Indiana University Bloomington (Autor:in)
  • Nir Friedman - (Autor:in)
  • Aviv Regev - , Broad Institute of Harvard University and MIT (Autor:in)

Abstract

De novo assembly of RNA-seq data enables researchers to study transcriptomes without the need for a genome sequence; this approach can be usefully applied, for instance, in research on 'non-model organisms' of ecological and evolutionary importance, cancer samples or the microbiome. In this protocol we describe the use of the Trinity platform for de novo transcriptome assembly from RNA-seq data in non-model organisms. We also present Trinity-supported companion utilities for downstream applications, including RSEM for transcript abundance estimation, R/Bioconductor packages for identifying differentially expressed transcripts across samples and approaches to identify protein-coding genes. In the procedure, we provide a workflow for genome-independent transcriptome analysis leveraging the Trinity platform. The software, documentation and demonstrations are freely available from http://trinityrnaseq.sourceforge.net. The run time of this protocol is highly dependent on the size and complexity of data to be analyzed. The example data set analyzed in the procedure detailed herein can be processed in less than 5 h.

Details

OriginalspracheEnglisch
Seiten (von - bis)1494-1512
Seitenumfang19
FachzeitschriftNature protocols
Ausgabenummer8
PublikationsstatusVeröffentlicht - 2013
Peer-Review-StatusJa

Externe IDs

Scopus 84880266648
ORCID /0000-0003-3137-0648/work/142238863

Schlagworte

Ziele für nachhaltige Entwicklung