Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation
- PMID: 20436464
- PMCID: PMC3146043
- DOI: 10.1038/nbt.1621
Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation
Abstract
High-throughput mRNA sequencing (RNA-Seq) promises simultaneous transcript discovery and abundance estimation. However, this would require algorithms that are not restricted by prior gene annotations and that account for alternative transcription and splicing. Here we introduce such algorithms in an open-source software program called Cufflinks. To test Cufflinks, we sequenced and analyzed >430 million paired 75-bp RNA-Seq reads from a mouse myoblast cell line over a differentiation time series. We detected 13,692 known transcripts and 3,724 previously unannotated ones, 62% of which are supported by independent expression data or by homologous genes in other species. Over the time series, 330 genes showed complete switches in the dominant transcription start site (TSS) or splice isoform, and we observed more subtle shifts in 1,304 other genes. These results suggest that Cufflinks can illuminate the substantial regulatory flexibility and complexity in even this well-studied model of muscle development and that it can improve transcriptome-based genome annotation.
Figures








Comment in
-
Advancing RNA-Seq analysis.Nat Biotechnol. 2010 May;28(5):421-3. doi: 10.1038/nbt0510-421. Nat Biotechnol. 2010. PMID: 20458303 No abstract available.
Similar articles
-
Next-generation sequencing facilitates quantitative analysis of wild-type and Nrl(-/-) retinal transcriptomes.Mol Vis. 2011;17:3034-54. Epub 2011 Nov 23. Mol Vis. 2011. PMID: 22162623 Free PMC article.
-
CIDANE: comprehensive isoform discovery and abundance estimation.Genome Biol. 2016 Jan 30;17:16. doi: 10.1186/s13059-015-0865-0. Genome Biol. 2016. PMID: 26831908 Free PMC article.
-
Sparse linear modeling of next-generation mRNA sequencing (RNA-Seq) data for isoform discovery and abundance estimation.Proc Natl Acad Sci U S A. 2011 Dec 13;108(50):19867-72. doi: 10.1073/pnas.1113972108. Epub 2011 Dec 1. Proc Natl Acad Sci U S A. 2011. PMID: 22135461 Free PMC article.
-
Comparative evaluation of full-length isoform quantification from RNA-Seq.BMC Bioinformatics. 2021 May 25;22(1):266. doi: 10.1186/s12859-021-04198-1. BMC Bioinformatics. 2021. PMID: 34034652 Free PMC article. Review.
-
RNA-seq: from technology to biology.Cell Mol Life Sci. 2010 Feb;67(4):569-79. doi: 10.1007/s00018-009-0180-6. Epub 2009 Oct 27. Cell Mol Life Sci. 2010. PMID: 19859660 Free PMC article. Review.
Cited by
-
Transcriptome Analysis Reveals Immune and Antioxidant Defense Mechanisms in the Eriocheir japonica sinensis after Exposure to Ammonia.Animals (Basel). 2024 Oct 16;14(20):2981. doi: 10.3390/ani14202981. Animals (Basel). 2024. PMID: 39457912 Free PMC article.
-
A user-driven machine learning approach for RNA-based sample discrimination and hierarchical classification.STAR Protoc. 2023 Oct 27;4(4):102661. doi: 10.1016/j.xpro.2023.102661. Online ahead of print. STAR Protoc. 2023. PMID: 39491552 Free PMC article.
-
ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility.Mol Cell. 2013 Jan 10;49(1):18-29. doi: 10.1016/j.molcel.2012.10.015. Epub 2012 Nov 21. Mol Cell. 2013. PMID: 23177736 Free PMC article.
-
Identification of the zebrafish maternal and paternal transcriptomes.Development. 2013 Jul;140(13):2703-10. doi: 10.1242/dev.095091. Epub 2013 May 29. Development. 2013. PMID: 23720042 Free PMC article.
-
Using RNentropy to Detect Significant Variation in Gene Expression Across Multiple RNA-Seq or Single-Cell RNA-Seq Samples.Methods Mol Biol. 2021;2284:77-96. doi: 10.1007/978-1-0716-1307-8_6. Methods Mol Biol. 2021. PMID: 33835439
References
Publication types
MeSH terms
Substances
Associated data
- Actions
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases