Calibrating a coalescent simulation of human genome sequence variation
- PMID: 16251467
- PMCID: PMC1310645
- DOI: 10.1101/gr.3709305
Calibrating a coalescent simulation of human genome sequence variation
Abstract
Population genetic models play an important role in human genetic research, connecting empirical observations about sequence variation with hypotheses about underlying historical and biological causes. More specifically, models are used to compare empirical measures of sequence variation, linkage disequilibrium (LD), and selection to expectations under a "null" distribution. In the absence of detailed information about human demographic history, and about variation in mutation and recombination rates, simulations have of necessity used arbitrary models, usually simple ones. With the advent of large empirical data sets, it is now possible to calibrate population genetic models with genome-wide data, permitting for the first time the generation of data that are consistent with empirical data across a wide range of characteristics. We present here the first such calibrated model and show that, while still arbitrary, it successfully generates simulated data (for three populations) that closely resemble empirical data in allele frequency, linkage disequilibrium, and population differentiation. No assertion is made about the accuracy of the proposed historical and recombination model, but its ability to generate realistic data meets a long-standing need among geneticists. We anticipate that this model, for which software is publicly available, and others like it will have numerous applications in empirical studies of human genetics.
Figures





Similar articles
-
Accounting for long-range correlations in genome-wide simulations of large cohorts.PLoS Genet. 2020 May 5;16(5):e1008619. doi: 10.1371/journal.pgen.1008619. eCollection 2020 May. PLoS Genet. 2020. PMID: 32369493 Free PMC article.
-
Temporal challenges in detecting balancing selection from population genomic data.G3 (Bethesda). 2024 Jun 5;14(6):jkae069. doi: 10.1093/g3journal/jkae069. G3 (Bethesda). 2024. PMID: 38551137 Free PMC article.
-
Critical assessment of coalescent simulators in modeling recombination hotspots in genomic sequences.BMC Bioinformatics. 2014 Jan 3;15:3. doi: 10.1186/1471-2105-15-3. BMC Bioinformatics. 2014. PMID: 24387001 Free PMC article.
-
Linkage disequilibrium in humans: models and data.Am J Hum Genet. 2001 Jul;69(1):1-14. doi: 10.1086/321275. Epub 2001 Jun 14. Am J Hum Genet. 2001. PMID: 11410837 Free PMC article. Review.
-
Ancestral Population Genomics.Methods Mol Biol. 2019;1910:555-589. doi: 10.1007/978-1-4939-9074-0_18. Methods Mol Biol. 2019. PMID: 31278677 Review.
Cited by
-
Haplotype kernel association test as a powerful method to identify chromosomal regions harboring uncommon causal variants.Genet Epidemiol. 2013 Sep;37(6):560-70. doi: 10.1002/gepi.21740. Epub 2013 Jun 5. Genet Epidemiol. 2013. PMID: 23740760 Free PMC article.
-
Integrating the signatures of demic expansion and archaic introgression in studies of human population genomics.Curr Opin Genet Dev. 2016 Dec;41:140-149. doi: 10.1016/j.gde.2016.09.007. Epub 2016 Oct 13. Curr Opin Genet Dev. 2016. PMID: 27743539 Free PMC article. Review.
-
Genotype calling from next-generation sequencing data using haplotype information of reads.Bioinformatics. 2012 Apr 1;28(7):938-46. doi: 10.1093/bioinformatics/bts047. Epub 2012 Jan 27. Bioinformatics. 2012. PMID: 22285565 Free PMC article.
-
Positive selection on the osteoarthritis-risk and decreased-height associated variants at the GDF5 gene in East Asians.PLoS One. 2012;7(8):e42553. doi: 10.1371/journal.pone.0042553. Epub 2012 Aug 14. PLoS One. 2012. PMID: 22905146 Free PMC article.
-
Detecting the Genomic Signature of Divergent Selection in Presence of Gene Flow.Curr Genomics. 2015 Jun;16(3):194-202. doi: 10.2174/1389202916666150313230943. Curr Genomics. 2015. PMID: 26069459 Free PMC article.
References
-
- Ardlie, K.G., Kruglyak, L., and Seielstad, M. 2002. Patterns of linkage disequilibrium in the human genome. Nat. Rev. Genet. 3: 299–309. - PubMed
-
- Collins, F.S., Brooks, L.D., and Chakravarti, A. 1998. A DNA polymorphism discovery resource for research on human genetic variation. Genome Res. 8: 1229–1231. - PubMed
-
- Crawford, D.C., Bhangale, T., Li, N., Hellenthal, G., Rieder, M.J., Nickerson, D.A., and Stephens, M. 2004. Evidence for substantial fine-scale variation in recombination rates across the human genome. Nat. Genet. 36: 700–706. - PubMed
Web site references
-
- http://www.broad.mit.edu/∼sfs/cosi; authors' Web site.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials