Abstract
Hybridization provides a window into the speciation process and reshuffles parental alleles to produce novel recombinant genotypes. The presence or absence of specific hybrid classes across a hybrid zone can provide support for various modes of reproductive isolation. Early generation hybrid classes can be distinguished by their combination of hybrid index and interclass heterozygosity, which can be estimated with molecular data. Hybrid index and interclass heterozygosity are routinely calculated for studies of hybrid zones, but available resources for next-generation sequencing datasets are computationally demanding and tools for visualizing those metrics as a triangle plot are lacking. Here, we provide a resource for identifying ancestry- informative markers (AIMs) from SNP datasets, calculating hybrid index and interclass heterozygosity, and visualizing the relationship as a triangle plot. Our methods are implemented in the R package triangulaR. We validate our methods by simulating genetic data for a hybrid zone between parental groups at low, medium, and high levels of divergence. We find that accurate and precise estimates of hybrid index and interclass heterozygosity can be obtained with sample sizes as low as five individuals per parental group. We explore various allele frequency difference thresholds for AIM identification, and how this threshold influences the accuracy and precision of hybrid index and interclass heterozygosity estimates. We contextualize interpretation of triangle plots by describing the theoretical expectations for covariance of hybrid index and interclass heterozygosity under Hardy-Weinberg Equilibrium and provide recommendations for best practices for identifying AIMs and building triangle plots.
Competing Interest Statement
The authors have declared no competing interest.