TY - JOUR T1 - Estimating degree of polygenicity, causal effect size variance, and confounding bias in GWAS summary statistics JF - bioRxiv DO - 10.1101/133132 SP - 133132 AU - Dominic Holland AU - Chun-Chieh Fan AU - Oleksandr Frei AU - Alexey A. Shadrin AU - Olav B. Smeland AU - V. S. Sundar AU - Enhancing Neuro Imaging Genetics through Meta Analysis Consortium AU - Ole A. Andreassen AU - Anders M. Dale Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/05/24/133132.abstract N2 - Of signal interest in the genetics of traits are estimating the proportion, π1, of causally associated single nucleotide polymorphisms (SNPs), and their effect size variance, , which are components of the mean heritabilities captured by the causal SNP. Here we present the first model, using detailed linkage disequilibrium structure, to estimate these quantities from genome-wide association studies (GWAS) summary statistics, assuming a Gaussian distribution of SNP effect sizes, β. We apply the model to three diverse phenotypes – schizophrenia, putamen volume, and educational attainment – and validate it with extensive simulations. We find that schizophrenia is highly polygenic, with ~ 5 × 104 causal SNPs distributed with small effect size variance, (in units where the phenotype variance is normalized to 1), requiring a GWAS study with more than 1/2-million samples in each arm for full discovery. In contrast, putamen volume involves only ≃ 3 × 102 causal SNPs, but with , indicating a much larger proportion of the causal SNPs that are strongly associated. Educational attainment has similar polygenicity to schizophrenia, but with effects that are substantially weaker, , leading to much lower heritability. Thus the model is able to describe the broad genetic architecture of phenotypes where both polygenicity and effect size variance range over several orders of magnitude, shows why only small proportions of heritability have been explained for discovered SNPs, and provides a roadmap for future GWAS discoveries. ER -