Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits and implications for the future

Yan Zhang, Guanghao Qi, Ju-Hyun Park, Nilanjan Chatterjee
doi: https://doi.org/10.1101/175406
Yan Zhang
1Department of Biostatistics, Bloomberg School of Public Health
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Guanghao Qi
1Department of Biostatistics, Bloomberg School of Public Health
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ju-Hyun Park
2Department of Statistics, Dongguk University, Republic of Korea
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nilanjan Chatterjee
1Department of Biostatistics, Bloomberg School of Public Health
3Department of Oncology, School of Medicine, Johns Hopkins University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: [email protected]
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Summary-level statistics from genome-wide association studies are now widely used to estimate heritability and co-heritability of traits using the popular linkage-disequilibrium-score (LD-score) regression method. We develop a likelihood-based approach for analyzing summary-level statistics and external LD information to estimate common variants effect-size distributions, characterized by proportion of underlying susceptibility SNPs and a flexible normal-mixture model for their effects. Analysis of summary-level results across 32 GWAS reveals that while all traits are highly polygenic, there is wide diversity in the degrees of polygenicity. The effect-size distributions for susceptibility SNPs could be adequately modeled by a single normal distribution for traits related to mental health and ability and by a mixture of two normal distributions for all other traits. Among quantitative traits, we predict the sample sizes needed to identify SNPs which explain 80% of GWAS heritability to be between 300K-500K for some of the early growth traits, between 1-2 million for some anthropometric and cholesterol traits and multiple millions for body mass index and some others. The corresponding predictions for disease traits are between 200K-400K for inflammatory bowel diseases, close to one million for a variety of adult onset chronic diseases and between 1-2 million for psychiatric diseases.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted August 11, 2017.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits and implications for the future
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits and implications for the future
Yan Zhang, Guanghao Qi, Ju-Hyun Park, Nilanjan Chatterjee
bioRxiv 175406; doi: https://doi.org/10.1101/175406
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Estimation of complex effect-size distributions using summary-level statistics from genome-wide association studies across 32 complex traits and implications for the future
Yan Zhang, Guanghao Qi, Ju-Hyun Park, Nilanjan Chatterjee
bioRxiv 175406; doi: https://doi.org/10.1101/175406

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (6038)
  • Biochemistry (13742)
  • Bioengineering (10473)
  • Bioinformatics (33263)
  • Biophysics (17156)
  • Cancer Biology (14221)
  • Cell Biology (20184)
  • Clinical Trials (138)
  • Developmental Biology (10898)
  • Ecology (16064)
  • Epidemiology (2067)
  • Evolutionary Biology (20384)
  • Genetics (13432)
  • Genomics (18676)
  • Immunology (13801)
  • Microbiology (32252)
  • Molecular Biology (13408)
  • Neuroscience (70229)
  • Paleontology (528)
  • Pathology (2200)
  • Pharmacology and Toxicology (3748)
  • Physiology (5894)
  • Plant Biology (12040)
  • Scientific Communication and Education (1817)
  • Synthetic Biology (3374)
  • Systems Biology (8183)
  • Zoology (1846)