Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Reconstructing SNP Allele and Genotype Frequencies from GWAS Summary Statistics

Zhiyu Yang, Peristera Paschou, Petros Drineas
doi: https://doi.org/10.1101/2021.04.02.438281
Zhiyu Yang
1Department of Biological Sciences, Purdue University, West Lafayette, Indiana
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peristera Paschou
1Department of Biological Sciences, Purdue University, West Lafayette, Indiana
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Petros Drineas
2Department of Computer Science, Purdue University, West Lafayette, Indiana
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: pdrinease@purdue.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The emergence of genomewide association studies (GWAS) has led to the creation of large repositories of human genetic variation, creating enormous opportunities for genetic research and worldwide collaboration. Methods that are based on GWAS summary statistics seek to leverage such records, overcoming barriers that often exist in individual-level data access while also offering significant computational savings. Here, we propose a novel framework that can reconstruct allelic and genotypic counts/frequencies for each SNP from case-control GWAS summary statistics. Our framework is simple and efficient without the need of any complicated underlying assumptions. Illustrating the great potential of this framework we also propose three summary-statistics-based applications implemented in a new software package (ReACt): GWAS meta-analysis (with and without sample overlap), case-case GWAS, and, for the first time, group polygenic risk score (PRS) estimation. We evaluate our methods against the current state-of-the-art on both synthetic data and real genotype data and show high performance in power and error control. Our novel group PRS method based on summary statistics could not be achieved prior to our proposed framework. We demonstrate here the potential applications and advantages of this approach. Our work further highlights the great potential of summary-statistics-based methodologies towards elucidating the genetic background of complex disease and opens up new avenues for research.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • ↵* ppaschou{at}purdue.edu

  • ↵† pdrineas{at}purdue.edu

  • Updated title and added discussion

  • https://github.com/Paschou-Lab/ReAct

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted April 07, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Reconstructing SNP Allele and Genotype Frequencies from GWAS Summary Statistics
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Reconstructing SNP Allele and Genotype Frequencies from GWAS Summary Statistics
Zhiyu Yang, Peristera Paschou, Petros Drineas
bioRxiv 2021.04.02.438281; doi: https://doi.org/10.1101/2021.04.02.438281
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Reconstructing SNP Allele and Genotype Frequencies from GWAS Summary Statistics
Zhiyu Yang, Peristera Paschou, Petros Drineas
bioRxiv 2021.04.02.438281; doi: https://doi.org/10.1101/2021.04.02.438281

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (2639)
  • Biochemistry (5231)
  • Bioengineering (3653)
  • Bioinformatics (15748)
  • Biophysics (7226)
  • Cancer Biology (5604)
  • Cell Biology (8060)
  • Clinical Trials (138)
  • Developmental Biology (4748)
  • Ecology (7477)
  • Epidemiology (2059)
  • Evolutionary Biology (10535)
  • Genetics (7708)
  • Genomics (10093)
  • Immunology (5168)
  • Microbiology (13844)
  • Molecular Biology (5361)
  • Neuroscience (30629)
  • Paleontology (213)
  • Pathology (873)
  • Pharmacology and Toxicology (1520)
  • Physiology (2236)
  • Plant Biology (4992)
  • Scientific Communication and Education (1039)
  • Synthetic Biology (1382)
  • Systems Biology (4135)
  • Zoology (808)