Abstract
There is recognition among human complex-trait geneticists that not only are many common assumptions made for the sake of statistical tractability (e.g., random mating, independence of parent/offspring environments) unlikely to apply in many contexts, but that methods reliant on such assumptions can yield misleading results, even in large samples. Investigations of the consequences of violating these assumptions so far have focused on individual perturbations operating in isolation. Here, we analyze widely used estimators of genetic architectural parameters, including LD-score regression and both population-based and within-family GWAS, across a broad array of perturbations to classical assumptions, such as multivariate assortative mating and vertical transmission (parental effects on offspring phenotypes not mediated by genetic inheritance). We find that widely-used statistical approaches are unreliable across a broad range of perturbations, and that structural sources of confounding often operate synergistically to distort conclusions. For example, mild multivariate assortative mating and vertical transmission together can dramatically inflate heritability estimates and GWAS false positive rates. Further, GWAS will become progressively more polluted by off-target associations as sample sizes increase. Given these challenges, we introduce xftsim, a forward time simulation library capable of modeling a wide range of genetic architectures, mating regimes, and transmission dynamics, to facilitate the systematic comparison of existing approaches and the development of robust methods. Together, our findings illustrate the importance of comprehensive sensitivity analysis and present a valuable tool for future research.
Competing Interest Statement
The authors have declared no competing interest.