PT - JOURNAL ARTICLE AU - David Gerard TI - Scalable Bias-corrected Linkage Disequilibrium Estimation Under Genotype Uncertainty AID - 10.1101/2021.02.08.430270 DP - 2021 Jan 01 TA - bioRxiv PG - 2021.02.08.430270 4099 - http://biorxiv.org/content/early/2021/02/22/2021.02.08.430270.short 4100 - http://biorxiv.org/content/early/2021/02/22/2021.02.08.430270.full AB - Linkage disequilibrium (LD) estimates are often calculated genome-wide for use in many tasks, such as SNP pruning and LD decay estimation. However, in the presence of genotype uncertainty, naive approaches to calculating LD have extreme attenuation biases, incorrectly suggesting that SNPs are less dependent than in reality. These biases are particularly strong in polyploid organisms, which often exhibit greater levels of genotype uncertainty than diploids. A principled approach using maximum likelihood estimation with genotype likelihoods can reduce this bias, but is prohibitively slow for genome-wide applications. Here, we present scalable moment-based adjustments to LD estimates based on the marginal posterior distributions of the genotypes. We demonstrate, on both simulated and real data, that these moment-based estimators are as accurate as maximum likelihood estimators, and are almost as fast as naive approaches based only on posterior mean genotypes. This opens up bias-corrected LD estimation to genome-wide applications. Additionally, we provide standard errors for these moment-based estimators. All methods are implemented in the ldsep package on the Comprehensive R Archive Network https://cran.r-project.org/package=ldsep.Competing Interest StatementThe authors have declared no competing interest.