A novel bioinformatics pipeline for identification and characterization of fusion transcripts in breast cancer and normal cell lines
- PMID: 21622959
- PMCID: PMC3159479
- DOI: 10.1093/nar/gkr362
A novel bioinformatics pipeline for identification and characterization of fusion transcripts in breast cancer and normal cell lines
Abstract
SnowShoes-FTD, developed for fusion transcript detection in paired-end mRNA-Seq data, employs multiple steps of false positive filtering to nominate fusion transcripts with near 100% confidence. Unique features include: (i) identification of multiple fusion isoforms from two gene partners; (ii) prediction of genomic rearrangements; (iii) identification of exon fusion boundaries; (iv) generation of a 5'-3' fusion spanning sequence for PCR validation; and (v) prediction of the protein sequences, including frame shift and amino acid insertions. We applied SnowShoes-FTD to identify 50 fusion candidates in 22 breast cancer and 9 non-transformed cell lines. Five additional fusion candidates with two isoforms were confirmed. In all, 30 of 55 fusion candidates had in-frame protein products. No fusion transcripts were detected in non-transformed cells. Consideration of the possible functions of a subset of predicted fusion proteins suggests several potentially important functions in transformation, including a possible new mechanism for overexpression of ERBB2 in a HER-positive cell line. The source code of SnowShoes-FTD is provided in two formats: one configured to run on the Sun Grid Engine for parallelization, and the other formatted to run on a single LINUX node. Executables in PERL are available for download from our web site: http://mayoresearch.mayo.edu/mayo/research/biostat/stand-alone-packages.cfm.
Figures




Similar articles
-
Detection of redundant fusion transcripts as biomarkers or disease-specific therapeutic targets in breast cancer.Cancer Res. 2012 Apr 15;72(8):1921-8. doi: 10.1158/0008-5472.CAN-11-3142. Epub 2012 Apr 10. Cancer Res. 2012. PMID: 22496456
-
The eSNV-detect: a computational system to identify expressed single nucleotide variants from transcriptome sequencing data.Nucleic Acids Res. 2014 Dec 16;42(22):e172. doi: 10.1093/nar/gku1005. Epub 2014 Oct 28. Nucleic Acids Res. 2014. PMID: 25352556 Free PMC article.
-
Identification of differentially expressed genes and typical fusion genes associated with three subtypes of breast cancer.Breast Cancer. 2019 May;26(3):305-316. doi: 10.1007/s12282-018-0924-y. Epub 2018 Nov 16. Breast Cancer. 2019. PMID: 30446971
-
SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data.Genome Biol. 2013 Feb 14;14(2):R12. doi: 10.1186/gb-2013-14-2-r12. Genome Biol. 2013. PMID: 23409703 Free PMC article.
-
FusionQ: a novel approach for gene fusion detection and quantification from paired-end RNA-Seq.BMC Bioinformatics. 2013 Jun 15;14:193. doi: 10.1186/1471-2105-14-193. BMC Bioinformatics. 2013. PMID: 23768108 Free PMC article.
Cited by
-
Reanalysis of RNA-sequencing data reveals several additional fusion genes with multiple isoforms.PLoS One. 2012;7(10):e48745. doi: 10.1371/journal.pone.0048745. Epub 2012 Oct 31. PLoS One. 2012. PMID: 23119097 Free PMC article.
-
Novel CSF1-S100A10 fusion gene and CSF1 transcript identified by RNA sequencing in tenosynovial giant cell tumors.Int J Oncol. 2014 May;44(5):1425-32. doi: 10.3892/ijo.2014.2326. Epub 2014 Mar 5. Int J Oncol. 2014. PMID: 24604026 Free PMC article.
-
Recurrent PAX3-MAML3 fusion in biphenotypic sinonasal sarcoma.Nat Genet. 2014 Jul;46(7):666-8. doi: 10.1038/ng.2989. Epub 2014 May 25. Nat Genet. 2014. PMID: 24859338 Free PMC article.
-
Fusion transcriptome profiling provides insights into alveolar rhabdomyosarcoma.Proc Natl Acad Sci U S A. 2016 Nov 15;113(46):13126-13131. doi: 10.1073/pnas.1612734113. Epub 2016 Oct 31. Proc Natl Acad Sci U S A. 2016. PMID: 27799565 Free PMC article.
-
Folate receptor-α (FOLR1) expression and function in triple negative tumors.PLoS One. 2015 Mar 27;10(3):e0122209. doi: 10.1371/journal.pone.0122209. eCollection 2015. PLoS One. 2015. PMID: 25816016 Free PMC article.
References
-
- Tomlins SA, Rhodes DR, Perner S, Dhanasekaran SM, Mehra R, Sun XW, Varambally S, Cao X, Tchinda J, Kuefer R, et al. Recurrent fusion of TMPRSS2 and ETS transcription factor genes in prostate cancer. Science. 2005;310:644–648. - PubMed
-
- Soda M, Choi YL, Enomoto M, Takada S, Yamashita Y, Ishikawa S, Fujiwara S, Watanabe H, Kurashina K, Hatanaka H, et al. Identification of the transforming EML4-ALK fusion gene in non-small-cell lung cancer. Nature. 2007;448:561–566. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Research Materials
Miscellaneous