Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies
====================================================================================================================

* Wei Zhou
* Jonas B. Nielsen
* Lars G. Fritsche
* Rounak Dey
* Maiken B. Elvestad
* Brooke N. Wolford
* Jonathon LeFaive
* Peter VandeHaar
* Aliya Gifford
* Lisa A. Bastarache
* Wei-Qi Wei
* Joshua C. Denny
* Maoxuan Lin
* Kristian Hveem
* Hyun Min Kang
* Goncalo R. Abecasis
* Cristen J. Wilier
* Seunggeun Lee

## Abstract

In genome-wide association studies (GWAS) for thousands of phenotypes in large biobanks, most binary traits have substantially fewer cases than controls. Both of the widely used approaches, linear mixed model and the recently proposed logistic mixed model, perform poorly -- producing large type I error rates -- in the analysis of phenotypes with unbalanced case-control ratios. Here we propose a scalable and accurate generalized mixed model association test that uses the saddlepoint approximation (SPA) to calibrate the distribution of score test statistics. This method, SAIGE, provides accurate p-values even when case-control ratios are extremely unbalanced. It utilizes state-of-art optimization strategies to reduce computational time and memory cost of generalized mixed model. The computation cost linearly depends on sample size, and hence can be applicable to GWAS for thousands of phenotypes by large biobanks. Through the analysis of UK-Biobank data of 408,961 white British European-ancestry samples, we show that SAIGE can efficiently analyze large sample data, controlling for unbalanced case-control ratios and sample relatedness.

## Introduction

Decreases in genotyping cost allow for large biobanks to genotype all participants, enabling genome-wide scale phenome-wide association studies (PheWAS) in hundreds of thousands of samples. In a typical genome-wide PheWAS, GWAS for tens of million variants are performed for thousands of phenotypes constructed from Electronic Health Records (EHR) and/or survey questionnaires from participants in large cohorts1–18. For binary traits based on disease/condition status in PheWAS, cases are typically defined as individuals with specific International Classification of Disease (ICD) codes within the EHR. Controls are usually all participants without the same or other related conditions1,3. Due to the low prevalence of many conditions/diseases, case-control ratios are often unbalanced (case:control=1:10) or extremely unbalanced (case:control<1:100). The scale of data and the unbalanced nature of binary traits pose substantial challenges for genome-wide PheWAS in biobanks.

Population structure and relatedness are major confounders in genetic association studies and also need to be controlled in PheWAS. Linear mixed models (LMM) are widely used to account for these issues in GWAS for both binary and quantitative traits19–25. However, since LMM is not designed to analyze binary traits, it can have inflated type I error rates, especially in the presence of unbalanced case-control ratios. Recently, Chen, H. *et al*. have proposed to use logistic mixed models and developed a score test called the generalized mixed model association test (GMMAT)26. GMMAT assumes that score test statistics asymptotically follow a Gaussian distribution to estimate asymptotic p-values. Although GMMAT test statistics are more robust than the LMM based approaches, it can also suffer type I error rate inflation when case-control ratios are unbalanced, because unbalanced case-control ratios invalidate asymptotic assumptions of logistic regression. In addition, since GMMAT requires O(*MN2*) computation and O(*N2*) memory space, where *M* is the number of genetic variants to be tested and *N* is the number of individuals, it cannot handle data with hundreds of thousands of samples.

Here, we propose a novel method to allow for analysis of very large samples, for binary traits with unbalanced case-control ratios, which also infers and accounts for sample relatedness. Our method, Scalable and Accurate Implementation of GEneralized mixed model (SAIGE), uses the saddlepoint approximation (SPA)27–29 to calibrate unbalanced case-control ratios in score tests based on logistic mixed models. Since SPA uses all the cumulants, and hence all the moments, it is more accurate than using the Gaussian distribution, which uses only the first two moments. Similar to BOLT-LMM25, the large sample size method for linear mixed-models, our method utilizes state-of-art optimization strategies, such as the preconditioned conjugate gradient (PCG) approach30,31 for solving linear systems for large cohorts without requiring a pre-computed genetic relationship matrix (GRM). The overall computation cost of this proposed method is O(*MN*), which is substantially lower than the computation cost of GMMAT26 and many popular LMM methods, such as GEMMA24. In addition, we reduce the memory use by compactly storing raw genotypes instead of calculating and storing the GRM.

We have demonstrated that SAIGE controls for the inflated type I error rates for binary traits with unbalanced case-control ratios in related samples through simulation and the UK Biobank data of 408,961 white British samples32,33. By evaluating its computation performance, we demonstrate the feasibility of SAIGE for large-scale PheWAS.

## RESULTS

### Overview of Methods

The SAIGE method contains two main steps: 1. Fitting the null logistic mixed model to estimate variance component and other model parameters. 2. Testing for association between each genetic variant and phenotypes by applying SPA to the score test statistics. Step 1 iteratively estimates the model parameters using the computational efficient average information restricted maximum likelihood (AIREML) algorithm34, which is also used in GMMAT26. Several optimization strategies have been applied in step 1 to make fitting the null logistic mixed model practical for large data sets, such as the UK Biobank 32,33. First, the spectral decomposition has been replaced by the PCG to solve linear systems without inversing the *N* × *N* GRM31 (as in BOLT-LMM25). The PCG method iteratively finds solutions of the linear system in a computation and memory efficient way. Thus, instead of requiring a pre-computed GRM, which costs a significant amount of time to calculate when sample sizes are large, SAIGE uses the raw genotypes as input. The computation time is about O(*M1N*) times the number of iterations for the conjugate gradient to converge, where *M1* is a number of variants to be used for constructing GRM. Second, to further reduce the memory usage during the model fitting, the raw genotypes are stored in a binary vector and elements of GRM are calculated when needed rather than being stored, so the memory usage is *M1N/4* bytes (as in BOLT-LMM25 and GenABEL35). For example, for the UK Biobank data with *M1* = 93,511 and *N* = 408,961 (white British participants), the memory usage drops from 669 Gigabytes(Gb) for storing the GRM with float numbers to 9.56 Gb for the raw genotypes in a binary vector.

After fitting the null logistic mixed model, the estimate of the random effects for each individual is obtained. The ratio of the variances of the score statistics with and without incorporating the variance components for the random effects is calculated using a subset of randomly selected genetic variants, similar to BOLT-LMM25 and GRAMMAR-Gamma36. This ratio has been previously suggested to be constant for score tests based on LMMs36. We have shown that the ratio is also approximately constant for all genetic variants with MAC ≥ 20 in the scenario of the logistic mixed models through analytic derivation and simulations (**Supplementary Notes and Supplementary Figure 1**).

In step 2, for each variant, the variance ratio is used to calibrate the score statistic variance that does not incorporate variance components for random effects. Since GRM is no longer needed for this step, the computation time to obtain the score statistic for each variant is O(*N*). SAIGE next approximates the score test statistics using the SPA to obtain more accurate p-values than the normal distribution. A faster version of the SPA test, similar to the fastSPA method in the SPAtest R package that we recently developed29, is used to further improve the computation time, which exploits the sparsity in low frequency or rare variants to reduce the computation cost.

### Computation and Memory Cost

The key features of SAIGE compared to other existing methods are presented in Table 1, showing that SAIGE is the only mixed-model association method that is able to account for the unbalanced case-control ratios while remaining computationally practical for large data sets. To further evaluate the computational performance of SAIGE, we randomly sampled subsets from the 408,458 white British UK Biobank participants who are defined as either coronary artery disease (CAD) cases (31,355) or controls (377,103) based on the PheWAS Code 4113,32,33 followed by benchmarking association tests using SAIGE and other existing methods on 200,000 genetic markers randomly selected out of the 71 million with imputation info ≥ 0.3. The non-genetic covariates sex, birth year, and principal components 1 to 4 were adjusted in all tests. The log10 of the memory usage and projected computation time for testing the full set of 71 million genetic variants are plotted against the sample size as shown in **Supplementary Figure 2** and **Supplementary Table 1**. Although SAIGE and BOLT-LMM have the same order of computational complexity (Table 1), SAIGE was slower than BOLT-LMM across all sample sizes (ex. 517 vs 360 CPU hours when *N*=408,458). This is due to the fact that fitting logistic mixed model requires more iterative steps than linear mixed model, and applying SPA requires additional computation. SAIGE requires slightly less memory than BOLT-LMM (10 to 11 Gb when *N*=408,458) and the low memory usage makes both methods feasible for the large data set. In contrast, GMMAT and GEMMA requires substantially more computation time and memory usage. For example, when *N*=400,000, projected memory usages of both GMMAT and GEMMA are more than 600 Gb. The actual computation time and memory usage of association tests for the full UK Biobank data for CAD are given in Table 1. SAIGE required 517 CPU hours and 10.3 Gb memory to analyze 71 million variants that have imputation info ≥ 0.3 for 408,458 samples, which indicates that the analysis will be done in ~26 hours with 20 CPU cores.

View this table:
[Table 1.](http://biorxiv.org/content/early/2017/11/24/212357/T1)

Table 1. Comparison of different methods for GWAS with mixed effect models

### Association analysis of binary traits in UK Biobank data

We applied SAIGE to several randomly selected binary traits defined by the PheWAS Codes (PheCode) of UK Biobank3,32,33 and compared the association results with those obtained from the method based on linear mixed models, BOLT-LMM25, and SAIGE without the saddlepoint approximation (SAIGE-NoSPA), which is asymptotically equivalent to GMMAT26. Due to computation and memory cost, the current GMMAT method cannot analyze the UK Biobank data. We restrict our analysis to markers directly genotyped or imputed by the Haplotype Reference Consortium (HRC)37 panel due to quality control issues of non-HRC markers reported by the UK BioBank. Approximately 28 million markers with minor allele counts (MAC) ≥ 20 and imputation info score > 0.3 were used in the analysis. Among 408,961 white British participants in the UK Biobank, 132,179 have at least one up to the third degree relative among the genotyped individuals32,33. We used 93,511 high quality genotyped variants to construct the GRM. In the UK Biobank data, most binary phenotypes based on PheCodes (1,437 out of 1,663; 86.4%) have case-control ratio lower than 1:99 (**Supplementary Figure 3**) and would likely demonstrate problematic inflation of association test statistics without SPA.

Association results of three exemplary binary traits that have various case-control ratios are plotted in Manhattan plots shown in Figure 1 and in the quantile-quantile (QQ) plots stratified by minor allele frequency (MAF) shown in Figure 2. The three binary traits are coronary artery disease (PheCode 411) with 31,355 cases and 377,103 controls (1:12), colorectal cancer (PheCode 153) with 4,562 cases and 382,756 controls (1:84) and thyroid cancer (PheCode 193) with 358 cases and 407,399 controls (1:1138). In the Manhattan plots in Figure 1, each locus that contains any variant with p-value < 5×10−8 is highlighted as blue or green to indicate whether this locus has been reported by previous studies or not.

![Figure 1.](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/11/24/212357/F1.medium.gif)

[Figure 1.](http://biorxiv.org/content/early/2017/11/24/212357/F1)

Figure 1. 
Manhattan plots of association p values resulting from SAIGE, SAIGE-NoSPA(asymptotically equivalent to GMMAT) and BOLT-LMM for A. coronary artery disease (PheCode 411, case:control = 1:12), B. colorectal cancer (PheCode 153, case:control = 1:84), and C. thyroid cancer (PheCode 193, case:control=1:1138). Blue: loci that have association p-value < 5×10−8, where the top hits are previously reported, Green: loci that have association p-value < 5×10−8 and have not been reported before. Since results from SAIGE-noSPA and BOLT-LMM contain many false positive signals for colorectal cancer and thyroid cancer, the significant loci are not highlighted.

![Figure 2.](http://biorxiv.org/https://www.biorxiv.org/content/biorxiv/early/2017/11/24/212357/F2.medium.gif)

[Figure 2.](http://biorxiv.org/content/early/2017/11/24/212357/F2)

Figure 2. 
Quantile-quantile plots of association p-values resulting from SAIGE, SAIGE-NoSPA (asymptotically equivalent to GMMAT) and BOLT-LMM (non-infinitesimal mixed model association test p-value) A. coronary artery disease (PheCode 411, case: control = 1:12), B. colorectal cancer (PheCode 153, case: control = 1:84), and C. thyroid cancer (PheCode 193, case: control=1:1138).

**Supplementary Table 2** presents the number of all significant loci and those that have not been previously reported by each method for each trait and **Supplementary Table 3** lists all significant loci identified by SAIGE.

Both Manhattan and QQ plots show BOLT-LMM and SAIGE-NoSPA have greatly inflated type I error rates. The inflation problem is more severe as case-control ratios become more unbalanced and the MAF of the tested variants decreases. The genomic inflation factors (λ) at the 0.001, 0.01 p-value percentiles are shown for several MAF categories in **Supplementary Table 4**. For the colorectal cancer GWAS which has case-control ratio 1:84, λ at the 0.001 p-value percentile is 1.68 and 1.71 for variants with MAF< 0.01 by SAIGE-NoSPA and BOLT-LMM, while λ is 0.99 by SAIGE. The inflation is even more severe for the test results by SAIGE-NoSPA and BOLT-LMM for the thyroid cancer, which has case-control ratio 1:1138, with the λ at the 0.001 p-value percentile around 4 to 5 for variants with MAF< 0.01 and all variants, respectively. With the unbalanced case-control ratio accounted for in SAIGE, the λ is again very close to 1.

Results for all ~1,600 PheCode-derived binary traits in 408,961 UK biobank white British European-ancestry samples are currently being generated using SAIGE software and will be available in a public repository as soon as they have completed analysis (see below for URL).

### Simulation Studies

We investigated the type I error control and power of two logistic mixed model approaches, SAIGE and GMMAT, and the linear mixed model method BOLT-LMM that computes mixed model association statistics under the infinitesimal and non-infinitesimal models through simulation studies. We followed the steps described on the Methods section to simulate genotypes for 1,000 families, each with 10 family members (N=10,000), based on the pedigree shown in **Supplementary Figure 4**.

### Type I error rates

The type I error rates for SAIGE, SAIGE-NoSPA, GMMAT, and BOLT-LMM have been evaluated based on the association tests performed on 109 simulated genetic variants. The variants were simulated using the same MAF spectrum of the UK Biobank HRC imputation data with case-control ratio 1:99, 1:9, and 1:1. The empirical type I error rates at the α = 5×10−4 and α = 5×10−8 are shown in the **Supplementary Table 5**. Both SAIGE-NoSPA, GMMAT, and BOLT-LMM have greatly inflated type I error rates when the case-control ratios are moderately or extremely unbalanced and slightly deflated type I error rates when the case-control ratios are balanced. This is expected as previous studies have suggested inflation of the score tests in the presence of the unbalanced case-control ratios and deflation in balanced studies29,38. SAIGE has well corrected the inflation of GMMAT and BOLT-LMM when case-control ratios are unbalanced as well as the deflation when the case-control ratios are balanced. However, compared to the nominal α levels, the empirical type I error rates of SAIGE are slightly deflated.

To further investigate the type I error rates by MAF and case-control ratios, we carried out additional simulations. **Supplementary Figure 5** shows QQ plots of 1,000,000 rare variants (MAF = 0.005) with various case-control ratios (1:1, 1:9, and 1:99) and **Supplementary Figure 6** shows QQ plots of 1,000,000 variants with different MAF (0.005, 0.01, 0.05, 0.1 and 0.3) when case-control ratio was 1:99. Consistent to what has been observed in the real data study, GMMAT and SAIGE-NoSPA is more inflated for less frequent variants with more unbalanced case-control ratios. In contrast, SAIGE has successfully corrected this problem.

### Power

Next we evaluated empirical power. Since power simulation requires re-estimating a variance component parameter for each variant to test, to reduce computational burden, we used SAIGE-NoSPA instead of the original GMMAT software. Due to the inflated type I error rates of BOLT-LMM and GMMAT (and SAIGE-NoSPA), for a fair comparison, we estimated power at the test-specific empirical α levels that yield type I error rate α = 5×10−8 (**Supplementary Table 6**). **Supplementary Figure 7** shows the power curve by odds ratios for variants with MAF 0.05, 0.1 and 0.2. When the case-control ratio is balanced, the power of SAIGE, SAIGE-NoSPA and BOLT-LMM were nearly identical. For studies with moderately unbalanced case-control ratio (case:control=1:9), SAIGE has higher power than SAIGENoSPA and BOLT-LMM, which is due to very small empirical ɑ for SAIGE-NoSPA and BOLT-LMM resulted from type I error inflation. The power gap is much larger when the case-control ratios are extremely unbalanced.

Overall simulation studies show that SAIGE can control type I error rates even when case-control ratios are extremely unbalanced and can be more powerful than GMMAT and BOLT-LMM. In contrast, GMMAT and BOLT-LMM suffer type I error inflation, and the inflation is especially severe with low MAF and unbalanced case-control ratios.

### Code and data availability

SAIGE is implemented as an open-source R package available at [https://github.com/weizhouUMICH/SAIGE/](https://github.com/weizhouUMICH/SAIGE/). The GWAS results for 1,403 binary phenotypes with the PheCodes3 constructed based on ICD codes in UK Biobank using SAIGE are currently available for public download at [https://www.dropbox.com/sh/wuj4y8wsqjz78om/AAACfAJK54KtvnzSTAoaZTLma?dl=0](https://www.dropbox.com/sh/wuj4y8wsqjz78om/AAACfAJK54KtvnzSTAoaZTLma?dl=0) **Supplementary Table 7** includes the phenotype information and URL links for downloading summary statistics, Q-Q plots, and Manhattan plots for the 1,403 phenotypes. We also display the results for 397 binary phenotypes in the Michigan PheWeb [http://pheweb.sph.umich.edu/UKBiobank,](http://pheweb.sph.umich.edu/UKBiobank,) which consists of Manhattan plots, Q-Q plots, and regional association plots for each phenotype as well as the PheWAS plots for every genetic marker. We will populate the pheweb with results for all UK biobank phenotypes (> 1,400).

## DISCUSSION

In this paper, we have presented a method to perform the association tests for binary traits in large cohorts in the presence of sample relatedness, which provides accurate p-value estimates for even extremely unbalanced case-control settings (with a prevalence < 0.1%). The dramatic decrease of the genotyping cost over the last decade allows more and more large biobanks to genotype all of their participants followed by genome-wide PheWAS, in which GWASs are performed for all thousands of diseases/conditions characterized based on EHR and/or survey questionnaires to identify genetic risk factors across different phenotypes1–18. Several challenges exist for PheWAS studies by large cohorts. Statistically, inflated type I error rates caused by unbalanced case-control ratios and sample relatedness need to be corrected. Computationally, most of existing mixed model association methods are not feasible for large sample sizes. Our method, SAIGE, uses logistic mixed model to account for the sample relatedness and applies the saddle point approximation (SPA) to correct the inflation caused by the unbalanced case-control ratio in the score tests based on logistic mixed models

SAIGE successfully corrects the inflation of type I error rates of low-frequency variants with binary traits that have unbalanced case-control ratios while also accounting for the relatedness among samples. Furthermore, our method uses several optimization strategies that are similar to those used by BOLT-LMM to improve its computational feasibility for large cohorts. For example, the preconditioned conjugate gradient algorithm is used to solve linear systems instead of the Cholesky decomposition method so that the time complexity for fitting the null logistic model is decreased from O(*N3*) to approximately O(*M1N1.5*), where *M1* is the number of pruned markers used for estimating the genetic relationship matrix and the N is the sample size. Compared to large N, M1 is usually small. For instance, in the UK Biobank32,33, *M1* = 93,511 and *N* = 408,961 (white British participants).

There are several limitations in SAIGE. First, the time for algorithm convergence may vary among phenotypes and study samples given different heritability levels and sample relatedness. Second, SAIGE has been observed to be slightly conservative when case-control ratios are extremely unbalanced (**Supplementary Table 5**). Third, the accurate odds ratio estimation requires fitting the model under the alternative and is not computational efficient. Similar to several other mixed model methods20,25,36, SAIGE estimates odds ratios for genetic markers using the parameter estimates from the null model. Fourth, SAIGE estimates the genetic relationship matrix using genome-wide genetic markers instead of using the leave-one-chromosome-out (LOCO) scheme, which can avoid proximal contamination 23,25,39,40. Last, SAIGE assumes that the effect sizes of genetic markers are normally distributed, which follows an infinitesimal architecture. With this assumption, SAIGE may sacrifice power to detect genetic signals whose genetic architecture is non-infinitesimal. In future direction, we will incorporate the LOCO scheme, which is straightforward based on the current model and method, and model non-infinitesimal architecture as needed to improve power. In addition, we will extend the current single variant test to gene- or region-based multiple variant test to improve power for identifying disease susceptibility rare variants.

With the emergence of large-scale biobank, PheWAS will be an important tool to identify genetic components of complex traits. Here we describe a scalable and accurate method, SAIGE, for the analysis of binary phenotypes in genome-wide PheWAS. Currently, SAIGE is the only available approach to adjust for both case-control imbalance and family relatedness, which are commonly observed in PheWAS datasets. In addition, the optimization approaches used in SAIGE make it scalable for the current largest (UK Biobank) and future much larger datasets. Through simulation and real data analysis, we have demonstrated that our method can efficiently analyze a dataset with 400,000 samples and adjust for type I error rates even when the case-control ratios are extremely unbalanced. Our method will provide an accurate and scalable solution for large scale biobank data analysis and ultimately contribute to identify genetic mechanism of complex diseases.

## METHODS

### Generalized linear mixed model for binary traits

In a case-control study with sample size *N*, we denote the status of the *ith* individual using *yi* = 1 or 0 for being a case or a control. Let the 1× (1 + *p*) vector *Xi* represent *p* covariates including the intercept and *Gi* represent the allele counts (0, 1 or 2) for the variant to test. The logistic mixed model can be written as ![Formula][1]</img> where *μi* = *P*(*yi* = 1 | *Xi*,*Gi*,*bi*) is the probability for the *ith* individual being a case given the covariates and genotypes as well as the random effect, which is denoted as *bi* The random effect *bi* is assumed to be distributed as N(0, *τ ψ*), where *ψ* is an *N* × *N* genetic relationship matrix (GRM) and *τ* is the additive genetic variance. The *α* is a (1 + *p*) × 1 coefficient vector of fixed effects and *β* is a coefficient of the genetic effect.

### Estimate variance component and other model parameters(Step 1)

To fit the null model, *log it* (*μi*) = *Xiα* + *bi*, penalized quasi-likelihood (PQL) method41 and the AI-REML algorithm34 are used to iteratively estimate ![Graphic][2]</img>. At iteration *k*, let ![Graphic][3]</img> be estimated ![Graphic][4]</img> the estimated mean of *yi*, ![Graphic][5]</img>, and ![Graphic][6]</img> be an *n* × *n* matrix of the variance of working vector ![Graphic][7]</img>. To obtain log quasi-likelihood and average information at each iteration, the current GMMAT approach calculates the inverse of ![Graphic][8]</img>. Since it is computationally too expensive for large *N*, we use the preconditioned conjugate gradient (PCG)30,31, which allows calculating quasi-likelihood and average information without calculating ![Graphic][9]</img> (See Supplementary for details). PCG is a numerical method to find solutions of linear system. It is particularly useful when the system is very large. BOLT-LMM25 successfully used it to estimate variance component in linear mixed model.

A score test statistics for *Ho: β* = 0 is ![Graphic][10]</img> where *G* and *Y* are *N* × 1 genotype and phenotype vectors, respectively, and ![Graphic][11]</img> is the estimated mean of Y under the null hypothesis, and ![Graphic][12]</img> is the covariate adjusted genotype vector. The variance of ![Graphic][13]</img>, where ![Graphic][14]</img>. For each variant, given ![Graphic][15]</img>, calculation of Var(*T*) requires O(*N2*) computation. In addition, since our approach does not calculate ![Graphic][16]</img>, and hence ![Graphic][17]</img>, obtaining Var(*T*) requires applying PCG for each variant, which can be computationally very expensive. To reduce computation cost, we use the same approximation approach used in BOLT-LMM and GRAMMAR-GAMMAR36, in which we estimate a variance of *T* with assuming that true random effect *b* is given, and then calculate ratio between these two variance. Suppose ![Graphic][18]</img>, which is a variance estimate of T assuming ![Graphic][19]</img> is given. Let *r* = Var(*T*)/Var(*T*)* ratio of these two different types of variance estimates. In Supplementary materials, we have shown that the ratio is approximately constant for all variants. Using this fact, we can estimate *r* using a relatively small number of variants. In all the numerical studies in this paper, we used 30 variants to estimate *r*.

### Score test with SPA (Step 2)

Suppose ![Graphic][20]</img> is the estimated ratio (i.e. r) in Step 1. Now the variance adjusted test statistics is ![Formula][21]</img> which has mean zero and variance 1 under the null hypothesis. The computation of *Tadj* requires O(*N*) computation. The traditional score tests assume that *T* (and hence *Tadj*) asymptotically follows a Gaussian distribution under *Ho: β* = 0, which is using only the first two moments (mean and variance). When the case-control ratios are unbalanced and variants have low MAC, the underlying distribution of *Tadj* can be substantially different from Gaussian distribution. To obtain accurate p-values, we use Saddlepoint approximation method (SPA)27–29, which approximates distribution using the entire cumulant generating function (CGF). A fast version of SPA (fastSPA)29 has recently been developed and applied to PheWAS, and provides accurate p-values even when case-control ratios are extremely unbalanced (ex. case:control=1:600).

To apply fastSPA to *Tadj* we need to obtain CGF of Tadj first. To do this, we use the fact that given ![Graphic][22]</img>, *Tadj* is a weighted sum of independent Bernoulli random variables. The approximated cumulant generating function is ![Formula][23]</img> where the constant c=Var*(T)−1/2. Let K′(*t*) and K″(*t*) are first and second derivatives of K with respect to t. To calculate the probability that *Tadj* < *q*, where q is an observed test statistic, we use the following formula27,28,42 ![Formula][24]</img> where ![Graphic][25]</img> and ![Graphic][26]</img> is the solution of the equation ![Graphic][27]</img>. As fastSPA29, we exploit the sparsity of genotype vector when MAF of variants are low. In addition, since normal approximation works well when the test statistic is close to the mean, we use the normal distribution when the test statistic is within two standard deviation of the mean.

### Data simulation

We carried out a series of simulations to evaluate and compare the performance of SAIGE to GMMAT. We randomly simulated a set of 1,000,000 base-pair “pseudo” sequences, in which variants are independent to each other. Alleles for each variant were randomly drawn from Binomial(n = 2, p = MAF). Then we performed the gene-dropping simulation43 using these sequences as founder haplotypes that were propagated through the pedigree of 10 family members shown in **Supplementary Figure 4**. Binary phenotypes were generated from the following logistic mixed model ![Formula][28]</img> where *Gi* is the genotype value, *β* is the genetic log odds ratio, *bi* is the random effect simulated from N(0, *τ ψ*), with *τ* = 1. Two covariates, X1 and X2, were simulated from Bernoulli(0.5) and N(0,1), respectively. The intercept *α* was determined by given prevalence (i.e. case-control ratios).

To evaluate the type I error rates at genome-wide α=5×10−8, 10 million markers along with 100 sets of phenotypes with different random seeds for case-control ratios 1:99, 1:9, and 1:1 were simulated with *β* = 0. Given τ = 1, the estimated heritability is 0.015, 0.092, and 0.17 for phenotypes with case-control ratios 1:99, 1:9, and 1:1, respectively44. Association tests were performed on the 10 million genetic markers for each of the 100 sets of phenotypes using SAIGE, GMMAT, and BOLT-LMM, therefore in total 109 tests were performed. To have a realistic MAF spectrum, MAFs were randomly sampled from the MAF spectrum in UK Biobank data (**Supplementary Figure 8**). Additional type I error simulations were carried out for five different MAFs (0.005, 0.01, 0.05, 0.1 and 0.3) to evaluate type I error rates by MAFs. For the power simulation, phenotypes were generated under the alternative hypothesis *β* ≠ 0. For each of the MAF 0.05 and 0.2, we simulated 1,000 datasets, and power was evaluated at test-specific empirical α, which yields nominal α=5×10−8. The empirical α was estimated from the previous type I error simulations. As the same as type I error simulations, three different case-control ratios (1:99, 1:9, and 1:1) were considered.

Note that since we evaluated the empirical type I error rates and power based on genetic variants that were simulated independently, the LD Score regression45 calibration and the leave-one-chromosome-out (LOCO) scheme were not applied in BOLT-LMM.

### Phenotype definition in UK Biobank

We used a previously published scheme to defined disease-specific binary phenotypes by combining hospital ICD-9 codes into hierarchical PheCodes, each representing a more or less specific disease group3. ICD-10 codes were mapped to PheCodes using a combination of available maps through the Unified Medical Language System([https://www.nlm.nih.gov/research/umls/](https://www.nlm.nih.gov/research/umls/)) and other sources, string matching, and manual review. Study participants were labeled a PheCode if they had one or more of the PheCode-specific ICD codes. Cases were all study participants with the PheCode of interest and controls were all study participants without the PheCode of interest or any related PheCodes. Gender checks were performed, so PheCodes specific for one gender could not mistakenly be assigned to the other gender.

## Author contributions

W.Z. C.W. and S.L. designed experiments. W.Z. and S.L. performed experiments. J.B., L.G.F., A.G., L.A.B, W-Q. W, J.C.D constructed phenotypes for the UK Biobank data. W.Z., J.L., H.M.K., C.W., S.L. and G.R.A. analyzed UK Biobank data. P.V. created the PheWeb. M.B.E. and K.H. provided data. W.Z., J.B., A.G., J.C.D., R.D., C.W. and S.L. wrote the manuscript.

## Acknowledgements

This research has been conducted using the UK Biobank Resource under application number 24460. SL and RD were supported by NIH R01 HG008773. CJW was supported by NIH R35 HL135824. WZ was supported by the University of Michigan Rackham Predoctoral Fellowship. JBN was supported by the Danish Heart Foundation and the Lundbeck Foundation. JCD, AG, LB, and WQW was supported by NIH R01 LM010685 and U2C OD023196.

*   Received October 31, 2017.
*   Revision received November 24, 2017.
*   Accepted November 24, 2017.


*   © 2017, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## References

1.  1.Bush, W. S., Oetjens, M. T. & Crawford, D. C. Unravelling the human genome-phenome relationship using phenome-wide association studies. Nat Rev Genet 17, 129–145 (2016).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nrg.2015.36&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26875678&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

2.  2.Cronin, R. M. et al. Phenome-wide association studies demonstrating pleiotropy of genetic variants within FTO with and without adjustment for body mass index. Front Genet 5, 250 (2014).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.3389/fgene.2014.00250&link_type=DOI) 

3.  3.Denny, J. C. et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat Biotechnol 31, 1102–1110 (2013).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nbt.2749&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24270849&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

4.  4.Denny, J. C. et al. Variants near FOXE1 are associated with hypothyroidism and other thyroid conditions: using electronic medical records for genome- and phenome-wide studies. Am J Hum Genet 89, 529–542 (2011).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2011.09.008&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21981779&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

5.  5.Dumitrescu, L. et al. Towards a phenome-wide catalog of human clinical traits impacted by genetic ancestry. BioData Min 8, 35 (2015).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1186/s13040-015-0068-y&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26566401&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

6.  6.Hall, M. A. et al. Detection of pleiotropy through a Phenome-wide association study (PheWAS) of epidemiologic data as part of the Environmental Architecture for Genes Linked to Environment (EAGLE) study. PLoS Genet 10, e1004678 (2014).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1004678&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25474351&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

7.  7.Hebbring, S. J. et al. Application of clinical text data for phenome-wide association studies (PheWASs). Bioinformatics 31, 1981–1987 (2015).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btv076&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25657332&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

8.  8.Hebbring, S. J. et al. A PheWAS approach in studying HLA-DRB1#1501. Genes Immun 14, 187–191 (2013).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/gene.2013.2&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23392276&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000318062400007&link_type=ISI) 

9.  9.Liao, K. P. et al. Associations of autoantibodies, autoimmune risk alleles, and clinical diagnoses from the electronic medical records in rheumatoid arthritis cases and non-rheumatoid arthritis controls. Arthritis Rheum 65, 571–581 (2013).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1002/art.37801&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23233247&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000315452400005&link_type=ISI) 

10. 10.Millard, L. A. et al. MR-PheWAS: hypothesis prioritization among potential causal effects of body mass index on many outcomes, using Mendelian randomization. Sci Rep 5, 16645 (2015).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/srep16645&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=26568383&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

11. 11.Moore, C. B. et al. Phenome-wide Association Study Relating Pretreatment Laboratory Parameters With Human Genetic Variants in AIDS Clinical Trials Group Protocols. Open Forum Infect Dis 2, ofu113 (2015).
    
    
12. 12.Namjou, B. et al. Phenome-wide association study (PheWAS) in EMR-linked pediatric cohorts, genetically links PLCL1 to speech language development and IL5-IL13 to Eosinophilic Esophagitis. Front Genet 5, 401 (2014).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.3389/fgene.2014.00401&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25477900&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

13. 13.Namjou, B. et al. A GWAS Study on Liver Function Test Using eMERGE Network Participants. PLoS One 10, e0138677 (2015).
    
    
14. 14.Neuraz, A. et al. Phenome-wide association studies on a quantitative trait: application to TPMT enzyme activity and thiopurine therapy in pharmacogenomics. PLoS Comput Biol 9, e1003405 (2013).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pcbi.1003405&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24385893&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

15. 15.Pendergrass, S. A. et al. Phenome-wide association study (PheWAS) for detection of pleiotropy within the Population Architecture using Genomics and Epidemiology (PAGE) Network. PLoS Genet 9, e1003087 (2013).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1003087&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23382687&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

16. 16.Ritchie, M. D. et al. Genome- and phenome-wide analyses of cardiac conduction identifies markers of arrhythmia risk. Circulation 127, 1377–1385 (2013).
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjExOiIxMjcvMTMvMTM3NyI7czo0OiJhdG9tIjtzOjM3OiIvYmlvcnhpdi9lYXJseS8yMDE3LzExLzI0LzIxMjM1Ny5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

17. 17.Shameer, K. et al. A genome- and phenome-wide association study to identify genetic variants influencing platelet count and volume and their pleiotropic effects. Hum Genet 133, 95–109 (2014).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1007/s00439-013-1355-7&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24026423&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

18. 18.Ye, Z. et al. Phenome-wide association studies (PheWASs) for functional variants. Eur J Hum Genet 23, 523–529 (2015).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ejhg.2014.123&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25074467&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

19. 19.Aulchenko, Y. S., de Koning, D. J. & Haley, C. Genomewide Rapid Association Using Mixed Model and Regression: A Fast and Simple Method For Genomewide Pedigree-Based Quantitative Trait Loci Association Analysis. Genetics 177, 577–585 (2007).
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZ2VuZXRpY3MiO3M6NToicmVzaWQiO3M6OToiMTc3LzEvNTc3IjtzOjQ6ImF0b20iO3M6Mzc6Ii9iaW9yeGl2L2Vhcmx5LzIwMTcvMTEvMjQvMjEyMzU3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

20. 20.Kang, H. M. et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet 42, 348–354 (2010).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.548&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20208533&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000276150500016&link_type=ISI) 

21. 21.Zhang, Z. et al. Mixed linear model approach adapted for genome-wide association studies. Nat Genet 42, 355–360 (2010).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.546&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=20208535&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000276150500017&link_type=ISI) 

22. 22.Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 88, 76–82 (2011).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2010.11.011&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21167468&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

23. 23.Lippert, C. et al. FaST linear mixed models for genome-wide association studies. Nat Methods 8, 833–835 (2011).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nmeth.1681&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=21892150&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000295358000017&link_type=ISI) 

24. 24.Zhou, X. & Stephens, M. Genome-wide efficient mixed-model analysis for association studies. Nat Genet 44, 821–824 (2012).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.2310&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22706312&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

25. 25.Loh, P. R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat Genet 47, 284–290 (2015).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.3190&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25642633&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

26. 26.Chen, H. et al. Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models. Am J Hum Genet 98, 653–666 (2016).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2016.02.012&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=27018471&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

27. 27.Kuonen, D. Saddlepoint approximations for distributions of quadratic forms in normal variables. Biometrika 4, 7 (1999).
    
    
28. 28.Imhof, J. P. Computing the Distribution of Quadratic Forms in Normal Variables. Biometrika Trust 48, 8 (1961).
    
    
29. 29.Dey, R., Schmidt, E. M., Abecasis, G. R. & Lee, S. A fast and accurate algorithm to test for binary phenotypes and its application to PheWAS. bioRxiv (2017). doi:doi:10.1101/109876
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=doi:10.1101/109876&link_type=DOI) 

30. 30. E.F. Kaasschieter. Preconditioned conjugate gradients for solving singular systems. J. Comput. Appl. Math. 24, 265–275 (1988).
    
    
31. 31.Hestenes Eduard, M. R. and S. Methods of conjugate gradients for solving linear systems. 49, (NBS, 1952).
    
    
32. 32.Sudlow, C. et al. UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age. PLOS Med. 12, e1001779 (2015).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1371/journal.pmed.1001779&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25826379&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

33. 33.Bycroft, C. et al. Genome-wide genetic data on ~500,000 UK Biobank participants. bioRxiv 166298 (2017). doi:doi:10.1101/166298
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=doi:10.1101/166298&link_type=DOI) 

34. 34.Gilmour, A. R., Thompson, R. & Cullis, B. R. Average Information REML: An Efficient Algorithm for Variance Parameter Estimation in Linear Mixed Models. Biometrics 51, 1440 (1995).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.2307/2533274&link_type=DOI) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=A1995TQ15600022&link_type=ISI) 

35. 35.Aulchenko, Y. S., Ripke, S., Isaacs, A. & van Duijn, C. M. GenABEL: an R library for genome-wide association analysis. Bioinformatics 23, 1294–1296 (2007).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1093/bioinformatics/btm108&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=17384015&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000247348300017&link_type=ISI) 

36. 36.Svishcheva, G. R., Axenovich, T. I., Belonogova, N. M., van Duijn, C. M. & Aulchenko, Y. S. Rapid variance components–based method for whole-genome association analysis. Nat. Genet. 44, 1166–1170 (2012).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.2410&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22983301&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

37. 37.McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 48, 1279–1283 (2016).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.3643&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=27548312&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

38. 38.Ma, C., Blackwell, T., Boehnke, M., Scott, L. J. & GoT2D investigators. Recommended Joint and Meta-Analysis Strategies for Case-Control Association Testing of Single Low-Count Variants. Genet. Epidemiol. 37, 539–550 (2013).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1002/gepi.21742&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=23788246&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

39. 39.Listgarten, J. et al. Improved linear mixed models for genome-wide association studies. Nat. Methods 9, 525–526 (2012).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/nmeth.2037&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=22669648&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000304778500006&link_type=ISI) 

40. 40.Yang, J., Zaitlen, N. A., Goddard, M. E., Visscher, P. M. & Price, A. L. Advantages and pitfalls in the application of mixed-model association methods. Nat. Genet. 46, 100–106 (2014).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.2876&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=24473328&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 

41. 41.Breslow, N. E. & Clayton, D. G. Approximate Inference in Generalized Linear Mixed Models. J. Am. Stat. Assoc. 88, 9 (1993).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.2307/2290687&link_type=DOI) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=A1993KP99300002&link_type=ISI) 

42. 42.Johnson & Kotz, 1970 p.152. Distributions in Statistics: Continuous Univariate Distributions. 2, (Wiley, 1970).
    
    
43. 43.Abecasis, G. R., Cherny, S. S., Cookson, W. O. & Cardon, L. R. Merlin–rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30, 97–101 (2002).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng786&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=11731797&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom) 
    
    [Web of Science](http://biorxiv.org/lookup/external-ref?access_num=000173105600021&link_type=ISI) 

44. 44.de Villemereuil, P., Schielzeth, H., Nakagawa, S. & Morrissey, M. General Methods for Evolutionary Quantitative Genetic Inference from Generalized Mixed Models. Genetics 204, 1281–1294 (2016).
    
    [Abstract/FREE Full Text](http://biorxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6ODoiZ2VuZXRpY3MiO3M6NToicmVzaWQiO3M6MTA6IjIwNC8zLzEyODEiO3M6NDoiYXRvbSI7czozNzoiL2Jpb3J4aXYvZWFybHkvMjAxNy8xMS8yNC8yMTIzNTcuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

45. 45.Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
    
    [CrossRef](http://biorxiv.org/lookup/external-ref?access_num=10.1038/ng.3211&link_type=DOI) 
    
    [PubMed](http://biorxiv.org/lookup/external-ref?access_num=25642630&link_type=MED&atom=%2Fbiorxiv%2Fearly%2F2017%2F11%2F24%2F212357.atom)

 [1]: /embed/graphic-4.gif
 [2]: /embed/inline-graphic-1.gif
 [3]: /embed/inline-graphic-2.gif
 [4]: /embed/inline-graphic-3.gif
 [5]: /embed/inline-graphic-4.gif
 [6]: /embed/inline-graphic-5.gif
 [7]: /embed/inline-graphic-6.gif
 [8]: /embed/inline-graphic-7.gif
 [9]: /embed/inline-graphic-8.gif
 [10]: /embed/inline-graphic-9.gif
 [11]: /embed/inline-graphic-10.gif
 [12]: /embed/inline-graphic-11.gif
 [13]: /embed/inline-graphic-12.gif
 [14]: /embed/inline-graphic-13.gif
 [15]: /embed/inline-graphic-14.gif
 [16]: /embed/inline-graphic-15.gif
 [17]: /embed/inline-graphic-16.gif
 [18]: /embed/inline-graphic-17.gif
 [19]: /embed/inline-graphic-18.gif
 [20]: /embed/inline-graphic-19.gif
 [21]: /embed/graphic-5.gif
 [22]: /embed/inline-graphic-20.gif
 [23]: /embed/graphic-6.gif
 [24]: /embed/graphic-7.gif
 [25]: /embed/inline-graphic-21.gif
 [26]: /embed/inline-graphic-22.gif
 [27]: /embed/inline-graphic-23.gif
 [28]: /embed/graphic-8.gif