Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries

Zhili Zheng, Shouye Liu, Julia Sidorenko, Loic Yengo, Patrick Turley, Alireza Ani, Rujia Wang, Ilja M. Nolte, Harold Snieder, Lifelines Cohort Study, View ORCID ProfileJian Yang, View ORCID ProfileNaomi R Wray, Michael E Goddard, Peter M Visscher, View ORCID ProfileJian Zeng
doi: https://doi.org/10.1101/2022.10.12.510418
Zhili Zheng
1Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
2Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
3Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
4Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shouye Liu
1Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Julia Sidorenko
1Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Loic Yengo
1Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patrick Turley
5Center for Economic and Social Research, University of Southern California, Los Angeles, CA, USA
6Department of Economics, University of Southern California, Los Angeles, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alireza Ani
7Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
8Department of Bioinformatics, Isfahan University of Medical Sciences, Isfahan, Iran
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rujia Wang
7Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ilja M. Nolte
7Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harold Snieder
7Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jian Yang
10School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
11Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jian Yang
Naomi R Wray
1Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
12Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Naomi R Wray
Michael E Goddard
13Faculty of Veterinary and Agricultural Science, University of Melbourne, Parkville, Victoria, Australia
14Biosciences Research Division, Department of Economic Development, Jobs, Transport and Resources, Bundoora, Victoria, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter M Visscher
1Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jian Zeng
1Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jian Zeng
  • For correspondence: j.zeng@uq.edu.au
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

We develop a new method, SBayesRC, that integrates GWAS summary statistics with functional genomic annotations to improve polygenic prediction of complex traits. Our method is scalable to whole-genome variant analysis and refines signals from functional annotations by allowing them to affect both causal variant probability and causal effect distribution. We analyse 28 traits in the UK Biobank using ∼7 million common SNPs and 96 annotations. SBayesRC improves prediction accuracy by 14% in European ancestry and by up to 33% in trans-ancestry prediction, compared to the baseline method SBayesR which does not use annotations, and outperforms state-of-the-art methods LDpred-funct, PolyPred-S and PRS-CSx by 12-15%. Investigation of factors affecting prediction accuracy identified a significant interaction between SNP density and annotation information, encouraging future use of whole-genome sequence variants for prediction. Functional partitioning analysis highlights a major contribution of evolutionary constrained regions to prediction accuracy and the largest per-SNP contribution from non-synonymous SNPs.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • ↵9 A full list of members and affiliations appears in the Supplementary Note

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted October 14, 2022.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries
Zhili Zheng, Shouye Liu, Julia Sidorenko, Loic Yengo, Patrick Turley, Alireza Ani, Rujia Wang, Ilja M. Nolte, Harold Snieder, Lifelines Cohort Study, Jian Yang, Naomi R Wray, Michael E Goddard, Peter M Visscher, Jian Zeng
bioRxiv 2022.10.12.510418; doi: https://doi.org/10.1101/2022.10.12.510418
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Leveraging functional genomic annotations and genome coverage to improve polygenic prediction of complex traits within and between ancestries
Zhili Zheng, Shouye Liu, Julia Sidorenko, Loic Yengo, Patrick Turley, Alireza Ani, Rujia Wang, Ilja M. Nolte, Harold Snieder, Lifelines Cohort Study, Jian Yang, Naomi R Wray, Michael E Goddard, Peter M Visscher, Jian Zeng
bioRxiv 2022.10.12.510418; doi: https://doi.org/10.1101/2022.10.12.510418

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4383)
  • Biochemistry (9601)
  • Bioengineering (7097)
  • Bioinformatics (24868)
  • Biophysics (12621)
  • Cancer Biology (9959)
  • Cell Biology (14358)
  • Clinical Trials (138)
  • Developmental Biology (7954)
  • Ecology (12110)
  • Epidemiology (2067)
  • Evolutionary Biology (15989)
  • Genetics (10929)
  • Genomics (14745)
  • Immunology (9871)
  • Microbiology (23680)
  • Molecular Biology (9486)
  • Neuroscience (50884)
  • Paleontology (369)
  • Pathology (1540)
  • Pharmacology and Toxicology (2683)
  • Physiology (4019)
  • Plant Biology (8657)
  • Scientific Communication and Education (1510)
  • Synthetic Biology (2397)
  • Systems Biology (6439)
  • Zoology (1346)