PT - JOURNAL ARTICLE AU - Flegontova, Olga AU - Işıldak, Ulaş AU - Yüncü, Eren AU - Williams, Matthew P. AU - Huber, Christian D. AU - Kočí, Jan AU - Vyazov, Leonid A. AU - Changmai, Piya AU - Flegontov, Pavel TI - Performance of <em>qpAdm</em>-based screens for genetic admixture on admixture-graph-shaped histories and stepping-stone landscapes AID - 10.1101/2023.04.25.538339 DP - 2025 Jan 01 TA - bioRxiv PG - 2023.04.25.538339 4099 - http://biorxiv.org/content/early/2025/02/03/2023.04.25.538339.short 4100 - http://biorxiv.org/content/early/2025/02/03/2023.04.25.538339.full AB - qpAdm is a statistical tool that is often used for testing large sets of alternative admixture models for a target population. Despite its popularity, qpAdm remains untested on two-dimensional stepping-stone landscapes and in situations with low pre-study odds (low ratio of true to false models). We tested high-throughput qpAdm protocols with typical properties such as number of source combinations per target, model complexity, model feasibility criteria, etc. Those protocols were applied to admixture-graph-shaped and stepping-stone simulated histories sampled randomly or systematically. We demonstrate that false discovery rates of high-throughput qpAdm protocols exceed 50% for many parameter combinations since: 1) pre-study odds are low and fall rapidly with increasing model complexity; 2) complex migration networks violate the assumptions of the method, hence there is poor correlation between qpAdm p-values and model optimality, contributing to low but non-zero false positive rate and low power; 3) although admixture fraction estimates between 0 and 1 are largely restricted to symmetric configurations of sources around a target, a small fraction of asymmetric highly non-optimal models have estimates in the same interval, contributing to the false positive rate. We also re-interpret large sets of qpAdm models from two studies in terms of source-target distance and symmetry and suggest improvements to qpAdm protocols: 1) temporal stratification of targets and proxy sources in the case of admixture-graph-shaped histories; 2) focused exploration of few models for increasing pre-study odds; dense landscape sampling for increasing power and stringent conditions on estimated admixture fractions for decreasing the false positive rate.Article Summary Proliferation in the archaeogenetic literature of protocols for detection of admixed groups based a so-called qpAdm algorithm became disconnected from performance testing: the only extensive study of qpAdm on simulated data showed that it performs well under an unrealistically simple demographic scenario. We found that false discoveries of gene flows by qpAdm on a collection of random admixture-graph-shaped histories and on complex stepping-stone landscapes are very common and provide guidelines for design of qpAdm protocols in archaeogenetic studies.Competing Interest StatementThe authors have declared no competing interest.