Model-based genotype and ancestry estimation for potential hybrids with mixed-ploidy.


Shastry V(1), Adams PE(2), Lindtke D(3), Mandeville EG(4), Parchman TL(5), Gompert Z(6), Buerkle CA(1).
Author information:
(1)Department of Botany, University of Wyoming, Laramie, WY, USA.
(2)Department of Biological Sciences, University of Alabama, Tuscaloosa, AL, USA.
(3)Institute of Plant Sciences, University of Bern, Bern, Switzerland.
(4)Department of Integrative Biology, University of Guelph, Guelph, ON, Canada.
(5)Department of Biology, University of Nevada-Reno, Reno, NV, USA.
(6)Department of Biology, Utah State University, Logan, UT, USA.


Non-random mating among individuals can lead to spatial clustering of genetically similar individuals and population stratification. This deviation from panmixia is commonly observed in natural populations. Consequently, individuals can have parentage in single populations or involving hybridization between differentiated populations. Accounting for this mixture and structure is important when mapping the genetics of traits and learning about the formative evolutionary processes that shape genetic variation among individuals and populations. Stratified genetic relatedness among individuals is commonly quantified using estimates of ancestry that are derived from a statistical model. Development of these models for polyploid and mixed-ploidy individuals and populations has lagged behind those for diploids. Here, we extend and test a hierarchical Bayesian model, called entropy, which can use low-depth sequence data to estimate genotype and ancestry parameters in autopolyploid and mixed-ploidy individuals (including sex chromosomes and autosomes within individuals). Our analysis of simulated data illustrated the trade-off between sequencing depth and genome coverage and found lower error associated with low-depth sequencing across a larger fraction of the genome than with high-depth sequencing across a smaller fraction of the genome. The model has high accuracy and sensitivity as verified with simulated data and through analysis of admixture among populations of diploid and tetraploid Arabidopsis arenosa.