Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

A flexible modeling and inference framework for estimating variant effect sizes from GWAS summary statistics

View ORCID ProfileJeffrey P. Spence, View ORCID ProfileNasa Sinnott-Armstrong, View ORCID ProfileThemistocles L. Assimes, View ORCID ProfileJonathan K. Pritchard
doi: https://doi.org/10.1101/2022.04.18.488696
Jeffrey P. Spence
1Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jeffrey P. Spence
  • For correspondence: jspence@stanford.edu pritch@stanford.edu
Nasa Sinnott-Armstrong
1Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305
2Herbold Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109
3VA Palo Alto Health Care System, Palo Alto, CA, 94550
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nasa Sinnott-Armstrong
Themistocles L. Assimes
3VA Palo Alto Health Care System, Palo Alto, CA, 94550
4Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Themistocles L. Assimes
Jonathan K. Pritchard
1Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305
5Department of Biology, Stanford University, Stanford, CA, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jonathan K. Pritchard
  • For correspondence: jspence@stanford.edu pritch@stanford.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Genome-wide association studies (GWAS) have highlighted that almost any trait is affected by many variants of relatively small effect. On one hand this presents a challenge for inferring the effect of any single variant as the signal-to-noise ratio is high for variants of small effect. This challenge is compounded when combining information across many variants in polygenic scores for predicting trait values. On the other hand, the large number of contributing variants provides an opportunity to learn about the average behavior of variants encoded in the distribution of variant effect sizes. Many approaches have looked at aspects of this problem, but no method has unified the inference of the effects of individual variants with the inference of the distribution of effect sizes while requiring only GWAS summary statistics and properly accounting for linkage disequilibrium between variants. Here we present a flexible, unifying framework that combines information across variants to infer a distribution of effect sizes and uses this distribution to improve the estimation of the effects of individual variants. We also develop a variational inference (VI) scheme to perform efficient inference under this framework. We show this framework is useful by constructing polygenic scores (PGSs) that outperform the state-of-the-art. Our modeling framework easily extends to jointly inferring effect sizes across multiple cohorts, where we show that building PGSs using additional cohorts of differing ancestries improves predictive accuracy and portability. We also investigate the inferred distributions of effect sizes across many traits and find that these distributions have effect sizes ranging over multiple orders of magnitude, in contrast to the assumptions implicit in many commonly-used statistical genetics methods.

Competing Interest Statement

The authors have declared no competing interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted April 19, 2022.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A flexible modeling and inference framework for estimating variant effect sizes from GWAS summary statistics
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A flexible modeling and inference framework for estimating variant effect sizes from GWAS summary statistics
Jeffrey P. Spence, Nasa Sinnott-Armstrong, Themistocles L. Assimes, Jonathan K. Pritchard
bioRxiv 2022.04.18.488696; doi: https://doi.org/10.1101/2022.04.18.488696
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
A flexible modeling and inference framework for estimating variant effect sizes from GWAS summary statistics
Jeffrey P. Spence, Nasa Sinnott-Armstrong, Themistocles L. Assimes, Jonathan K. Pritchard
bioRxiv 2022.04.18.488696; doi: https://doi.org/10.1101/2022.04.18.488696

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3686)
  • Biochemistry (7766)
  • Bioengineering (5666)
  • Bioinformatics (21234)
  • Biophysics (10552)
  • Cancer Biology (8157)
  • Cell Biology (11902)
  • Clinical Trials (138)
  • Developmental Biology (6736)
  • Ecology (10387)
  • Epidemiology (2065)
  • Evolutionary Biology (13838)
  • Genetics (9693)
  • Genomics (13054)
  • Immunology (8120)
  • Microbiology (19932)
  • Molecular Biology (7824)
  • Neuroscience (42955)
  • Paleontology (318)
  • Pathology (1276)
  • Pharmacology and Toxicology (2256)
  • Physiology (3350)
  • Plant Biology (7207)
  • Scientific Communication and Education (1309)
  • Synthetic Biology (1998)
  • Systems Biology (5528)
  • Zoology (1126)