Statistical analysis of the 5' untranslated region of human mRNA using "Oligo-Capped" cDNA libraries
- PMID: 10756096
- DOI: 10.1006/geno.2000.6076
Statistical analysis of the 5' untranslated region of human mRNA using "Oligo-Capped" cDNA libraries
Abstract
We constructed 34 types of human "full-length enriched" and "5'-end enriched" cDNA libraries based on the "Oligo-Capping" method. We randomly picked and sequenced 10,000 clones from these libraries. BLAST analysis showed that about 50% of the cDNAs were identical to known genes. Among them, we selected 954 species of cDNA that should represent the entire sequence from the mRNA start sites. Compared with previously reported sequences, they were on average 45 bp longer in the 5'-end. Using these cDNA data, we statistically analyzed the sequence features of the 5'UTR. The average length of the 5'UTR was 125 bp, and there was little correlation with the corresponding mRNA length (correlation coefficient = 0.26). Of the 954 species of 5'UTR, 459 contained no in-frame terminator codon, which is against the common belief. Two hundred seventy-eight species contained at least one ATG codon upstream of the initiator ATG codon. We identified 569 upstream ATGs, in total, 63% of which adequately satisfied Kozak's criteria. These findings are contrary to the typical translation initiation model, which states that translation is initiated from the "first" ATG codon.
Copyright 2000 Academic Press.
Similar articles
-
Full-length-enriched cDNA libraries from Echinococcus granulosus contain separate populations of oligo-capped and trans-spliced transcripts and a high level of predicted signal peptide sequences.Mol Biochem Parasitol. 2002 Jul;122(2):171-80. doi: 10.1016/s0166-6851(02)00098-1. Mol Biochem Parasitol. 2002. PMID: 12106871
-
The hepatitis C virus 5' untranslated region gene amplified by rapid amplification of cDNA ends and its secondary structure.Hepatobiliary Pancreat Dis Int. 2002 Aug;1(3):368-72. Hepatobiliary Pancreat Dis Int. 2002. PMID: 14607708
-
Characterization of 954 bovine full-CDS cDNA sequences.BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166. BMC Genomics. 2005. PMID: 16305752 Free PMC article.
-
Do the 5'untranslated domains of human cDNAs challenge the rules for initiation of translation (or is it vice versa)?Genomics. 2000 Dec 15;70(3):396-406. doi: 10.1006/geno.2000.6412. Genomics. 2000. PMID: 11161792 Review.
-
Interpreting cDNA sequences: some insights from studies on translation.Mamm Genome. 1996 Aug;7(8):563-74. doi: 10.1007/s003359900171. Mamm Genome. 1996. PMID: 8679005 Review.
Cited by
-
Evidence for conservation and selection of upstream open reading frames suggests probable encoding of bioactive peptides.BMC Genomics. 2006 Jan 26;7:16. doi: 10.1186/1471-2164-7-16. BMC Genomics. 2006. PMID: 16438715 Free PMC article.
-
Generation and analysis of large-scale expressed sequence tags (ESTs) from a full-length enriched cDNA library of porcine backfat tissue.BMC Genomics. 2006 Feb 27;7:36. doi: 10.1186/1471-2164-7-36. BMC Genomics. 2006. PMID: 16504160 Free PMC article.
-
Prediction of unidentified human genes on the basis of sequence similarity to novel cDNAs from cynomolgus monkey brain.Genome Biol. 2002;3(1):RESEARCH0006. doi: 10.1186/gb-2001-3-1-research0006. Epub 2001 Dec 19. Genome Biol. 2002. PMID: 11806829 Free PMC article.
-
Functional polymorphism in H2BFWT-5'UTR is associated with susceptibility to male infertility.J Cell Mol Med. 2009 Aug;13(8B):1942-1951. doi: 10.1111/j.1582-4934.2009.00830.x. Epub 2009 Jul 6. J Cell Mol Med. 2009. PMID: 19583817 Free PMC article.
-
Alternative splicing within the elk-1 5' untranslated region serves to modulate initiation events downstream of the highly conserved upstream open reading frame 2.Mol Cell Biol. 2012 May;32(9):1745-56. doi: 10.1128/MCB.06751-11. Epub 2012 Feb 21. Mol Cell Biol. 2012. PMID: 22354998 Free PMC article.
Publication types
MeSH terms
Substances
Associated data
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
- Actions
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials