Skip to main navigation Skip to search Skip to main content

Full-length transcriptome assembly from RNA-Seq data without a reference genome

Manfred G. Grabherr, Brian J. Haas, Moran Yassour, Joshua Z. Levin, Dawn A. Thompson, Ido Amit, Xian Adiconis, Lin Fan, Raktima Raychowdhury, Qiandong Zeng, Zehua Chen, Evan Mauceli, Nir Hacohen, Andreas Gnirke, Nicholas Rhind, Federica Di Palma, Bruce W. Birren, Chad Nusbaum, Kerstin Lindblad-Toh, Nir FriedmanAviv Regev

Research output: Contribution to journalArticlepeer-review

Abstract

Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

Original languageEnglish
Pages (from-to)644-652
Number of pages9
JournalNature biotechnology
Volume29
Issue number7
DOIs
StatePublished - Jul 2011

All Science Journal Classification (ASJC) codes

  • Biotechnology
  • Bioengineering
  • Applied Microbiology and Biotechnology
  • Molecular Medicine
  • Biomedical Engineering

Fingerprint

Dive into the research topics of 'Full-length transcriptome assembly from RNA-Seq data without a reference genome'. Together they form a unique fingerprint.

Cite this