Most "dark matter" transcripts are associated with known genes
- PMID: 20502517
- PMCID: PMC2872640
- DOI: 10.1371/journal.pbio.1000371
Most "dark matter" transcripts are associated with known genes
Abstract
A series of reports over the last few years have indicated that a much larger portion of the mammalian genome is transcribed than can be accounted for by currently annotated genes, but the quantity and nature of these additional transcripts remains unclear. Here, we have used data from single- and paired-end RNA-Seq and tiling arrays to assess the quantity and composition of transcripts in PolyA+ RNA from human and mouse tissues. Relative to tiling arrays, RNA-Seq identifies many fewer transcribed regions ("seqfrags") outside known exons and ncRNAs. Most nonexonic seqfrags are in introns, raising the possibility that they are fragments of pre-mRNAs. The chromosomal locations of the majority of intergenic seqfrags in RNA-Seq data are near known genes, consistent with alternative cleavage and polyadenylation site usage, promoter- and terminator-associated transcripts, or new alternative exons; indeed, reads that bridge splice sites identified 4,544 new exons, affecting 3,554 genes. Most of the remaining seqfrags correspond to either single reads that display characteristics of random sampling from a low-level background or several thousand small transcripts (median length = 111 bp) present at higher levels, which also tend to display sequence conservation and originate from regions with open chromatin. We conclude that, while there are bona fide new intergenic transcripts, their number and abundance is generally low in comparison to known exons, and the genome is not as pervasively transcribed as previously reported.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures








Comment in
-
Dark matter transcripts: sound and fury, signifying nothing?PLoS Biol. 2010 May 18;8(5):e1000370. doi: 10.1371/journal.pbio.1000370. PLoS Biol. 2010. PMID: 20502697 Free PMC article. No abstract available.
-
Transcriptomics: Throwing light on dark matter.Nat Rev Genet. 2010 Jul;11(7):455. doi: 10.1038/nrg2819. Nat Rev Genet. 2010. PMID: 20559328 No abstract available.
-
How to find TUFs.Nat Methods. 2010 Aug;7(8):582. doi: 10.1038/nmeth0810-582. Nat Methods. 2010. PMID: 20704018 No abstract available.
Similar articles
-
A systematic search for new mammalian noncoding RNAs indicates little conserved intergenic transcription.BMC Genomics. 2005 Aug 5;6:104. doi: 10.1186/1471-2164-6-104. BMC Genomics. 2005. PMID: 16083503 Free PMC article.
-
A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome.Science. 2008 Aug 15;321(5891):956-60. doi: 10.1126/science.1160342. Epub 2008 Jul 3. Science. 2008. PMID: 18599741
-
Global identification of human transcribed sequences with genome tiling arrays.Science. 2004 Dec 24;306(5705):2242-6. doi: 10.1126/science.1103388. Epub 2004 Nov 11. Science. 2004. PMID: 15539566
-
Transcribed dark matter: meaning or myth?Hum Mol Genet. 2010 Oct 15;19(R2):R162-8. doi: 10.1093/hmg/ddq362. Epub 2010 Aug 25. Hum Mol Genet. 2010. PMID: 20798109 Free PMC article. Review.
-
Dark matter in the genome: evidence of widespread transcription detected by microarray tiling experiments.Trends Genet. 2005 Feb;21(2):93-102. doi: 10.1016/j.tig.2004.12.009. Trends Genet. 2005. PMID: 15661355 Review.
Cited by
-
VlincRNAs controlled by retroviral elements are a hallmark of pluripotency and cancer.Genome Biol. 2013 Jul 22;14(7):R73. doi: 10.1186/gb-2013-14-7-r73. Genome Biol. 2013. PMID: 23876380 Free PMC article.
-
Characteristics and significance of intergenic polyadenylated RNA transcription in Arabidopsis.Plant Physiol. 2013 Jan;161(1):210-24. doi: 10.1104/pp.112.205245. Epub 2012 Nov 6. Plant Physiol. 2013. PMID: 23132786 Free PMC article.
-
PVT1 Expression Is a Predictor for Poor Survival of Prostate Cancer Patients.Technol Cancer Res Treat. 2021 Jan-Dec;20:1533033820971610. doi: 10.1177/1533033820971610. Technol Cancer Res Treat. 2021. PMID: 33752525 Free PMC article.
-
"Hypothesis for the modern RNA world": a pervasive non-coding RNA-based genetic regulation is a prerequisite for the emergence of multicellular complexity.Orig Life Evol Biosph. 2011 Dec;41(6):587-607. doi: 10.1007/s11084-011-9262-1. Epub 2012 Feb 10. Orig Life Evol Biosph. 2011. PMID: 22322874
-
Long Non-Coding RNAs in the Pathogenesis of Diabetic Kidney Disease.Front Cell Dev Biol. 2022 Apr 20;10:845371. doi: 10.3389/fcell.2022.845371. eCollection 2022. Front Cell Dev Biol. 2022. PMID: 35517509 Free PMC article. Review.
References
-
- Kapranov P, Cawley S. E, Drenkow J, Bekiranov S, Strausberg R. L, et al. Large-scale transcriptional activity in chromosomes 21 and 22. Science. 2002;296:916–919. - PubMed
-
- Cheng J, Kapranov P, Drenkow J, Dike S, Brubaker S, et al. Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution. Science. 2005;308:1149–1154. - PubMed
-
- Kapranov P, Cheng J, Dike S, Nix D. A, Duttagupta R, et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007;316:1484–1488. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases