Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Leveraging correlations between polygenic risk score predictors to detect heterogeneity in GWAS cohorts

Jie Yuan, Henry Xing, Alexandre Lamy, The Schizophrenia Working Group of the Psychiatric Genomics Consortium, Todd Lencz, Itsik Pe’er
doi: https://doi.org/10.1101/827162
Jie Yuan
1Department of Computer Science, Columbia University, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jyuan@cs.columbia.edu
Henry Xing
1Department of Computer Science, Columbia University, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandre Lamy
1Department of Computer Science, Columbia University, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Todd Lencz
2The Center for Psychiatric Neuroscience,, Feinstein Institutes for Medical Research, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Itsik Pe’er
1Department of Computer Science, Columbia University, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Evidence from both GWAS and clinical observation has suggested that certain psychiatric, metabolic, and autoimmune diseases are heterogeneous, comprising multiple subtypes with distinct genomic etiologies and Polygenic Risk Scores (PRS). However, the presence of subtypes within many phenotypes is frequently unknown. We present CLiP (Correlated Liability Predictors), a method to detect heterogeneity in single GWAS cohorts. CLiP calculates a weighted sum of correlations between SNPs contributing to a PRS on the case/control liability scale. We demonstrate mathematically and through simulation that among i.i.d. homogeneous cases, significant anti-correlations are expected between otherwise independent predictors due to ascertainment on the hidden liability score. In the presence of heterogeneity from distinct etiologies, confounding by covariates, or mislabeling, these correlation patterns are altered predictably. We further extend our method to two additional association study designs: CLiP-X for quantitative predictors in applications such as transcriptome-wide association, and CLiP-Y for quantitative phenotypes, where there is no clear distinction between cases and controls. Through simulations, we demonstrate that CLiP and its extensions reliably distinguish between homogeneous and heterogeneous cohorts when the PRS explains as low as 5% of variance on the liability scale and cohorts comprise 50, 000 − 100, 000 samples, an increasingly practical size for modern GWAS. We apply CLiP to heterogeneity detection in schizophrenia cohorts totaling > 50, 000 cases and controls collected by the Psychiatric Genomics Consortium. We observe significant heterogeneity in mega-analysis of the combined PGC data (p-value 8.54e-4), as well as in individual cohorts meta-analyzed using Fisher’s method (p-value 0.03), based on significantly associated variants.

Footnotes

  • https://github.com/jyuan1322/CLiP

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted November 01, 2019.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Leveraging correlations between polygenic risk score predictors to detect heterogeneity in GWAS cohorts
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Leveraging correlations between polygenic risk score predictors to detect heterogeneity in GWAS cohorts
Jie Yuan, Henry Xing, Alexandre Lamy, The Schizophrenia Working Group of the Psychiatric Genomics Consortium, Todd Lencz, Itsik Pe’er
bioRxiv 827162; doi: https://doi.org/10.1101/827162
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Leveraging correlations between polygenic risk score predictors to detect heterogeneity in GWAS cohorts
Jie Yuan, Henry Xing, Alexandre Lamy, The Schizophrenia Working Group of the Psychiatric Genomics Consortium, Todd Lencz, Itsik Pe’er
bioRxiv 827162; doi: https://doi.org/10.1101/827162

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4231)
  • Biochemistry (9123)
  • Bioengineering (6769)
  • Bioinformatics (23971)
  • Biophysics (12110)
  • Cancer Biology (9511)
  • Cell Biology (13754)
  • Clinical Trials (138)
  • Developmental Biology (7623)
  • Ecology (11678)
  • Epidemiology (2066)
  • Evolutionary Biology (15495)
  • Genetics (10633)
  • Genomics (14312)
  • Immunology (9474)
  • Microbiology (22825)
  • Molecular Biology (9087)
  • Neuroscience (48922)
  • Paleontology (355)
  • Pathology (1480)
  • Pharmacology and Toxicology (2566)
  • Physiology (3842)
  • Plant Biology (8322)
  • Scientific Communication and Education (1468)
  • Synthetic Biology (2295)
  • Systems Biology (6183)
  • Zoology (1299)