Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits

Nat Genet. 2018 Sep;50(9):1318-1326. doi: 10.1038/s41588-018-0193-x. Epub 2018 Aug 13.

Abstract

We developed a likelihood-based approach for analyzing summary-level statistics and external linkage disequilibrium information to estimate effect-size distributions of common variants, characterized by the proportion of underlying susceptibility SNPs and a flexible normal-mixture model for their effects. Analysis of results available across 32 genome-wide association studies showed that, while all traits are highly polygenic, there is wide diversity in the degree and nature of polygenicity. Psychiatric diseases and traits related to mental health and ability appear to be most polygenic, involving a continuum of small effects. Most other traits, including major chronic diseases, involve clusters of SNPs that have distinct magnitudes of effects. We predict that the sample sizes needed to identify SNPs that explain most heritability found in genome-wide association studies will range from a few hundred thousand to multiple millions, depending on the underlying effect-size distributions of the traits. Accordingly, we project the risk-prediction ability of polygenic risk scores across a wide variety of diseases.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genetic Predisposition to Disease / genetics*
  • Genome / genetics*
  • Genome-Wide Association Study / methods
  • Humans
  • Likelihood Functions
  • Linkage Disequilibrium / genetics
  • Mental Disorders / genetics
  • Models, Genetic
  • Multifactorial Inheritance / genetics
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Risk Factors