Key Points
-
Powered by improving next-generation sequencing capabilities, RNA-seq studies continue to provide knowledge about the quantitative and qualitative aspects of transcriptomes in both prokaryotes and eukaryotes.
-
Advances have been made in areas including the cataloguing of sense and antisense transcripts, alternative splicing events, fused transcripts and transcription initiation sites in physiologically normal and disease settings.
-
RNA-seq studies traditionally rely on sequencing of cDNA libraries that are generated from RNAs through various reverse transcription and sample preparation strategies. Biases and artefacts introduced by these experimental steps may hinder some applications.
-
Recent technological developments, including direct RNA sequencing, may alleviate some of the limitations of current RNA-seq approaches and enable new paths of research into transcriptomics.
-
Although several technological barriers still remain, major advances towards reliable analyses of RNAs from limited cell quantities have recently been achieved, paving the way towards transcriptome profiling at the single-cell level.
Abstract
In the few years since its initial application, massively parallel cDNA sequencing, or RNA-seq, has allowed many advances in the characterization and quantification of transcriptomes. Recently, several developments in RNA-seq methods have provided an even more complete characterization of RNA transcripts. These developments include improvements in transcription start site mapping, strand-specific measurements, gene fusion detection, small RNA characterization and detection of alternative splicing events. Ongoing developments promise further advances in the application of RNA-seq, particularly direct RNA sequencing and approaches that allow RNA quantification from very small amounts of cellular materials.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
209,00 ⬠per year
only 17,42 ⬠per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout





Similar content being viewed by others
References
Birney, E. et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799â816 (2007).
Berretta, J. & Morillon, A. Pervasive transcription constitutes a new level of eukaryotic genome regulation. EMBO Rep. 10, 973â982 (2009).
Kapranov, P., Willingham, A. T. & Gingeras, T. R. Genome-wide transcription and the implications for genomic organization. Nature Rev. Genet. 8, 413â423 (2007).
Metzker, M. L. Sequencing technologies â the next generation. Nature Rev. Genet. 11, 31â46 (2010). This Review provides a comprehensive overview of currently available and in-development NGS technologies.
Wang, Z., Gerstein, M. & Snyder, M. RNA-seq: a revolutionary tool for transcriptomics. Nature Rev. Genet. 10, 57â63 (2009).
van Vliet, A. H. Next generation sequencing of microbial transcriptomes: challenges and opportunities. FEMS Microbiol. Lett. 302, 1â7 (2010).
Ozsolak, F. et al. Direct RNA sequencing. Nature 461, 814â818 (2009). The first technology for high-throughput direct sequencing of RNA molecules without prior reverse transcription.
Carninci, P. et al. Genome-wide analysis of mammalian promoter architecture and evolution. Nature Genet. 38, 626â635 (2006).
Shiraki, T. et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc. Natl Acad. Sci. USA 100, 15776â15781 (2003).
Valen, E. et al. Genome-wide detection and analysis of hippocampus core promoters using DeepCAGE. Genome Res. 19, 255â265 (2009).
Ni, T. et al. A paired-end sequencing strategy to map the complex landscape of transcription initiation. Nature Methods 7, 521â527 (2010).
Plessy, C. et al. Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan. Nature Methods 7, 528â534 (2010).
Marson, A. et al. Connecting microRNA genes to the core transcriptional regulatory circuitry of embryonic stem cells. Cell 134, 521â533 (2008).
Ozsolak, F. et al. Chromatin structure analyses identify miRNA promoters. Genes Dev. 22, 3172â3183 (2008).
Affymetrix/Cold Spring Harbor Laboratory ENCODE Transcriptome Project. Post-transcriptional processing generates a diversity of 5â²-modified long and short RNAs. Nature 457, 1028â1032 (2009). This paper raises the possibility of 5â²-cap addition during post-transcriptional processing steps.
Faghihi, M. A. & Wahlestedt, C. Regulatory roles of natural antisense transcripts. Nature Rev. Mol. Cell Biol. 10, 637â643 (2009). An excellent review of the literature on sense and antisense transcription.
Gubler, U. Second-strand cDNA synthesis: mRNA fragments as primers. Meth. Enzymol. 152, 330â335 (1987).
Perocchi, F., Xu, Z., Clauder-Munster, S. & Steinmetz, L. M. Antisense artifacts in transcriptome microarray experiments are resolved by actinomycin D. Nucleic Acids Res. 35, e128 (2007).
Spiegelman, S. et al. DNA-directed DNA polymerase activity in oncogenic RNA viruses. Nature 227, 1029â1031 (1970).
Wu, J. Q. et al. Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome. Genome Biol. 9, R3 (2008).
Cloonan, N. et al. Stem cell transcriptome profiling via massive-scale mRNA sequencing. Nature Methods 5, 613â619 (2008).
Core, L. J., Waterfall, J. J. & Lis, J. T. Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science 322, 1845â1848 (2008).
Mamanova, L. et al. FRT-seq: amplification-free, strand-specific transcriptome sequencing. Nature Methods 7, 130â132 (2010).
Lipson, D. et al. Quantification of the yeast transcriptome by single-molecule sequencing. Nature Biotechnol. 27, 652â658 (2009).
Ozsolak, F. et al. Digital transcriptome profiling from attomole-level RNA samples. Genome Res. 20, 519â525 (2010).
Ozsolak, F. et al. Amplification-free digital gene expression profiling from minute cell quantities. Nature Methods 7, 619â621 (2010).
He, Y., Vogelstein, B., Velculescu, V. E., Papadopoulos, N. & Kinzler, K. W. The antisense transcriptomes of human cells. Science 322, 1855â1857 (2008).
Parkhomchuk, D. et al. Transcriptome analysis by strand-specific sequencing of complementary DNA. Nucleic Acids Res. 37, e123 (2009).
Ingolia, N. T., Ghaemmaghami, S., Newman, J. R. & Weissman, J. S. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324, 218â223 (2009).
Levin, J. Z. et al. Comprehensive comparative analysis of strand-specific RNA sequencing methods. Nature Methods 7, 709â715 (2010).
Faulhammer, D., Lipton, R. J. & Landweber, L. F. Fidelity of enzymatic ligation for DNA computing. J. Comput. Biol. 7, 839â848 (2000).
Housby, J. N. & Southern, E. M. Fidelity of DNA ligation: a novel experimental approach based on the polymerisation of libraries of oligonucleotides. Nucleic Acids Res. 26, 4259â4266 (1998).
Dohm, J. C., Lottaz, C., Borodina, T. & Himmelbauer, H. Substantial biases in ultra-short read data sets from high-throughput DNA sequencing. Nucleic Acids Res. 36, e105 (2008).
Goren, A. et al. Chromatin profiling by directly sequencing small quantities of immunoprecipitated DNA. Nature Methods 7, 47â49 (2010).
Kozarewa, I. et al. Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of (G+C)-biased genomes. Nature Methods 6, 291â295 (2009).
Nilsen, T. W. & Graveley, B. R. Expansion of the eukaryotic proteome by alternative splicing. Nature 463, 457â463 (2010).
Wang, G. S. & Cooper, T. A. Splicing in disease: disruption of the splicing code and the decoding machinery. Nature Rev. Genet. 8, 749â761 (2007).
Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-seq. Nature Methods 5, 621â628 (2008).
Sultan, M. et al. A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome. Science 321, 956â960 (2008).
Tang, F. et al. mRNA-seq whole-transcriptome analysis of a single cell. Nature Methods 6, 377â382 (2009).
Wang, E. T. et al. Alternative isoform regulation in human tissue transcriptomes. Nature 456, 470â476 (2008).
Carninci, P. Is sequencing enlightenment ending the dark age of the transcriptome? Nature Methods 6, 711â713 (2009).
Jiang, H. & Wong, W. H. Statistical inferences for isoform expression in RNA-seq. Bioinformatics 25, 1026â1032 (2009).
Richard, H. et al. Prediction of alternative isoforms from exon expression levels in RNA-seq experiments. Nucleic Acids Res. 38, e112 (2010).
Ameur, A., Wetterbom, A., Feuk, L. & Gyllensten, U. Global and unbiased detection of splice junctions from RNA-seq data. Genome Biol. 11, R34 (2010).
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-seq. Bioinformatics 25, 1105â1111 (2009).
Eid, J. et al. Real-time DNA sequencing from single polymerase molecules. Science 323, 133â138 (2009).
Olasagasti, F. et al. Replication of individual DNA molecules under electronic control using a protein nanopore. Nature Nanotech. 5, 798â806 (2010).
Mitelman, F., Johansson, B. & Mertens, F. The impact of translocations and gene fusions on cancer causation. Nature Rev. Cancer 7, 233â245 (2007).
Korbel, J. O. et al. Paired-end mapping reveals extensive structural variation in the human genome. Science 318, 420â426 (2007).
Maher, C. A. et al. Transcriptome sequencing to detect gene fusions in cancer. Nature 458, 97â101 (2009).
Zhao, Q. et al. Transcriptome-guided characterization of genomic rearrangements in a breast cancer cell line. Proc. Natl Acad. Sci. USA 106, 1886â1891 (2009).
Li, H., Wang, J., Mor, G. & Sklar, J. A neoplastic gene fusion mimics trans-splicing of RNAs in normal human cells. Science 321, 1357â1361 (2008).
Maher, C. A. et al. Chimeric transcript discovery by paired-end transcriptome sequencing. Proc. Natl Acad. Sci. USA 106, 12353â12358 (2009).
Berger, M. F. et al. Integrative analysis of the melanoma transcriptome. Genome Res. 20, 413â427 (2010).
Palanisamy, N. et al. Rearrangements of the RAF kinase pathway in prostate cancer, gastric cancer and melanoma. Nature Med. 16, 793â798 (2010).
McManus, C. J., Duff, M. O., Eipper-Mains, J. & Graveley, B. R. Global analysis of trans-splicing in Drosophila. Proc. Natl Acad. Sci. USA 107, 12975â12979 (2010).
Garber, K. Fixing the front end. Nature Biotech. 26, 1101â1104 (2008).
Chepelev, I., Wei, G., Tang, Q. & Zhao, K. Detection of single nucleotide variations in expressed exons of the human genome using RNA-seq. Nucleic Acids Res. 37, e106 (2009).
Cirulli, E. T. et al. Screening the human exome: a comparison of whole genome and whole transcriptome sequencing. Genome Biol. 11 (2010).
Shah, S. P. et al. Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution. Nature 461, 809â813 (2009).
Levin, J. Z. et al. Targeted next-generation sequencing of a cancer transcriptome enhances detection of sequence variants and novel fusion transcripts. Genome Biol. 10, R115 (2009).
Li, J. B. et al. Genome-wide identification of human RNA editing sites by parallel DNA capturing and sequencing. Science 324, 1210â1213 (2009).
Zhang, K. et al. Digital RNA allelotyping reveals tissue-specific and allele-specific gene expression in human. Nature Methods 6, 613â618 (2009).
Ghildiyal, M. & Zamore, P. D. Small silencing RNAs: an expanding universe. Nature Rev. Genet. 10, 94â108 (2009).
Rajagopalan, R., Vaucheret, H., Trejo, J. & Bartel, D. P. A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes Dev. 20, 3407â3425 (2006).
Ruby, J. G. et al. Large-scale sequencing reveals 21U-RNAs and additional microRNAs and endogenous siRNAs in C. elegans. Cell 127, 1193â1207 (2006).
Seila, A. C. et al. Divergent transcription from active promoters. Science 322, 1849â1851 (2008).
Taft, R. J. et al. Tiny RNAs associated with transcription start sites in animals. Nature Genet. 41, 572â578 (2009).
Berezikov, E. et al. Diversity of microRNAs in human and chimpanzee brain. Nature Genet. 38, 1375â1377 (2006).
Kapranov, P. et al. New class of gene-termini-associated human RNAs suggests a novel RNA copying mechanism. Nature 466, 642â646 (2010).
Lau, N. C., Lim, L. P., Weinstein, E. G. & Bartel, D. P. An abundant class of tiny RNAs with probable regulatory roles in Caenorhabditis elegans. Science 294, 858â862 (2001).
Kawaji, H. & Hayashizaki, Y. Exploration of small RNAs. PLoS Genet. 4, e22 (2008).
Linsen, S. E. et al. Limitations and possibilities of small RNA digital gene expression profiling. Nature Methods 6, 474â476 (2009). The authors describe the difficulties associated with the analysis and quantification of short RNA species using current NGS platforms.
Cocquet, J., Chong, A., Zhang, G. & Veitia, R. A. Reverse transcriptase template switching and false alternative transcripts. Genomics 88, 127â131 (2006).
Mader, R. M. et al. Reverse transcriptase template switching during reverse transcriptase-polymerase chain reaction: artificial generation of deletions in ribonucleotide reductase mRNA. J. Lab. Clin. Med. 137, 422â428 (2001).
Roy, S. W. & Irimia, M. When good transcripts go bad: artifactual RT-PCR 'splicing' and genome analysis. Bioessays 30, 601â605 (2008).
Chen, D. & Patton, J. T. Reverse transcriptase adds nontemplated nucleotides to cDNAs during 5â²-RACE and primer extension. Biotechniques 30, 574â582 (2001).
Roberts, J. D. et al. Fidelity of two retroviral reverse transcriptases during DNA-dependent DNA synthesis in vitro. Mol. Cell. Biol. 9, 469â476 (1989).
Armour, C. D. et al. Digital transcriptome profiling using selective hexamer priming for cDNA synthesis. Nature Methods 6, 647â649 (2009).
Hansen, K. D., Brenner, S. E. & Dudoit, S. Biases in Illumina transcriptome sequencing caused by random hexamer priming. Nucleic Acids Res. 38, e131 (2010).
Rosenkranz, R., Borodina, T., Lehrach, H. & Himmelbauer, H. Characterizing the mouse ES cell transcriptome with Illumina sequencing. Genomics 92, 187â194 (2008).
Oshlack, A. & Wakefield, M. J. Transcript length bias in RNA-seq data confounds systems biology. Biol. Direct 4, 14 (2009).
Young, M. D., Wakefield, M. J., Smyth, G. K. & Oshlack, A. Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol. 11, R14 (2010).
Li, B., Ruotti, V., Stewart, R. M., Thomson, J. A. & Dewey, C. N. RNA-seq gene expression estimation with read mapping uncertainty. Bioinformatics 26, 493â500 (2010).
Guttman, M. et al. Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nature Biotech. 28, 503â510 (2010).
Trapnell, C. et al. Transcript assembly and quantification by RNA-seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature Biotech. 28, 511â515 (2010).
Shcherbik, N., Wang, M., Lapik, Y. R., Srivastava, L. & Pestov, D. G. Polyadenylation and degradation of incomplete RNA polymerase I transcripts in mammalian cells. EMBO Rep. 11, 106â111 (2010).
Makeyev, E. V. & Bamford, D. H. Replicase activity of purified recombinant protein P2 of double-stranded RNA bacteriophage phi6. EMBO J. 19, 124â133 (2000).
Gurumurthy, S. et al. The Lkb1 metabolic sensor maintains haematopoietic stem cell survival. Nature 468, 659â63 (2010).
Ozsolak, F. et al. Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation. Cell 143, 1018â1029 (2010).
Simone, N. L., Bonner, R. F., Gillespie, J. W., Emmert-Buck, M. R. & Liotta, L. A. Laser-capture microdissection: opening the microscopic frontier to molecular analysis. Trends Genet. 14, 272â276 (1998).
Marcy, Y. et al. Dissecting biological âdark matterâ with single-cell genetic analysis of rare and uncultivated TM7 microbes from the human mouth. Proc. Natl Acad. Sci. USA 104, 11889â11894 (2007).
Pfeifer, G. P., Steigerwald, S. D., Mueller, P. R., Wold, B. & Riggs, A. D. Genomic sequencing and methylation analysis by ligation mediated PCR. Science 246, 810â813 (1989).
Dean, F. B. et al. Comprehensive human genome amplification using multiple displacement amplification. Proc. Natl Acad. Sci. USA 99, 5261â5266 (2002).
Dafforn, A. et al. Linear mRNA amplification from as little as 5 ng total RNA for global gene expression analysis. Biotechniques 37, 854â857 (2004).
Eberwine, J. et al. Analysis of gene expression in single live neurons. Proc. Natl Acad. Sci. USA 89, 3010â3014 (1992).
Nygaard, V. & Hovig, E. Options available for profiling small samples: a review of sample amplification technology when combined with microarray profiling. Nucleic Acids Res. 34, 996â1014 (2006). This review provides a good overview of the current low-quantity RNA applications and the complications associated with them.
Geiss, G. K. et al. Direct multiplexed measurement of gene expression with color-coded probe pairs. Nature Biotech. 26, 317â325 (2008).
Amit, I. et al. Unbiased reconstruction of a mammalian transcriptional network mediating pathogen responses. Science 326, 257â263 (2009).
Byrne, J. A., Nguyen, H. N. & Reijo Pera, R. A. Enhanced generation of induced pluripotent stem cells from a subpopulation of human fibroblasts. PLoS ONE 4, e7118 (2009).
Helzer, K. T. et al. Circulating tumor cells are transcriptionally similar to the primary tumor in a murine prostate model. Cancer Res. 69, 7860â7866 (2009).
Lo, Y. M. et al. Plasma placental RNA allelic ratio permits noninvasive prenatal chromosomal aneuploidy detection. Nature Med. 13, 218â223 (2007). This paper describes the quantification of extracellular circulating RNA in mother's plasma during pregnancy to detect fetal aneuploidy.
Bowers, J. et al. Virtual terminator nucleotides for next-generation DNA sequencing. Nature Methods 6, 593â595 (2009).
Acknowledgements
We apologize to authors whose work could not be cited owing to space constraints. We are grateful to the US National Human Genome Research Institute for their support (grants R01 HG005230 and R44 HG005279).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing interests
Faith Ozsolak and Patrice M. Milos are employees of Helicos BioSciences Corporation.
Related links
Glossary
- Next generation DNA sequencing
-
(Often abbreviated to NGS.) Non-Sanger-based high-throughput DNA sequencing technologies. Compared to Sanger sequencing, NGS platforms sequence as many as billions of DNA strands in parallel, yielding substantially more throughput and minimizing the need for the fragment-cloning methods that are often used in Sanger sequencing of genomes.
- Semisuppressive PCR
-
A PCR strategy that aims to reduce primer dimer accumulation by preferentially amplifying longer DNA fragments.
- Spike pool
-
Internal controls added to RNA samples, consisting of RNA elements of known sequence and composition.
- Paired-end reads
-
A strategy involving sequencing of two different regions that are located apart from each other on the same DNA fragment. This strategy provides elevated physical coverage and alleviates several limitations of NGS platforms that arise because of their relatively short read length.
- Laser capture microdissection
-
(Often abbreviated to LCM.) A method allowing cells of interest that are chosen by the operator using a microscope to be specifically captured from heterogeneous tissue samples. The isolated cells can be used for various analyses including of protein and nucleic acid.
- Quantitative real-time polymerase chain reaction
-
A PCR application that enables the measurement of nucleic acid quantities in samples. Nucleic acid of interest is amplified with PCR. The level of the amplified product accumulation during PCR cycles are measured in real time. This data is used to infer starting nucleic acid quantities.
- Circulating extracellular nucleic acid
-
Extracellular DNA or RNA molecules in plasma and serum.
Rights and permissions
About this article
Cite this article
Ozsolak, F., Milos, P. RNA sequencing: advances, challenges and opportunities. Nat Rev Genet 12, 87â98 (2011). https://doi.org/10.1038/nrg2934
Published:
Issue Date:
DOI: https://doi.org/10.1038/nrg2934