Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Mendelian imputation of parental genotypes for genome-wide estimation of direct and indirect genetic effects

View ORCID ProfileAlexander I. Young, Seyed Moeen Nehzati, View ORCID ProfileChanwook Lee, View ORCID ProfileStefania Benonisdottir, View ORCID ProfileDavid Cesarini, Daniel J. Benjamin, Patrick Turley, View ORCID ProfileAugustine Kong
doi: https://doi.org/10.1101/2020.07.02.185199
Alexander I. Young
1Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, UK
2Center for Economic and Social Research, University of Southern California, Los Angeles, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alexander I. Young
  • For correspondence: alextisyoung@gmail.com augustine.kong@bdi.ox.ac.uk
Seyed Moeen Nehzati
2Center for Economic and Social Research, University of Southern California, Los Angeles, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chanwook Lee
3Department of Economics, Harvard University, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Chanwook Lee
Stefania Benonisdottir
1Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Stefania Benonisdottir
David Cesarini
4National Bureau of Economic Research, Cambridge, MA, USA
5Department of Economics, New York University, New York, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for David Cesarini
Daniel J. Benjamin
2Center for Economic and Social Research, University of Southern California, Los Angeles, CA, USA
4National Bureau of Economic Research, Cambridge, MA, USA
6Department of Economics, University of Southern California, Los Angeles, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patrick Turley
7Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA
8Stanley Centre for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Augustine Kong
1Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Augustine Kong
  • For correspondence: alextisyoung@gmail.com augustine.kong@bdi.ox.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Associations between genotype and phenotype derive from four sources: direct genetic effects, indirect genetic effects from relatives, population stratification, and correlations with other variants affecting the phenotype through assortative mating. Genome-wide association studies (GWAS) of unrelated individuals have limited ability to distinguish the different sources of genotype-phenotype association, confusing interpretation of results and potentially leading to bias when those results are applied – in genetic prediction of traits, for example. With genetic data on families, the randomisation of genetic material during meiosis can be used to distinguish direct genetic effects from other sources of genotype-phenotype association. Genetic data on siblings is the most common form of genetic data on close relatives. We develop a method that takes advantage of identity-by-descent sharing between siblings to impute missing parental genotypes. Compared to no imputation, this increases the effective sample size for estimation of direct genetic effects and indirect parental effects by up to one third and one half respectively. We develop a related method for imputing missing parental genotypes when a parent-offspring pair is observed. We provide the imputation methods in a software package, SNIPar (single nucleotide imputation of parents), that also estimates genome-wide direct and indirect effects of SNPs. We apply this to a sample of 45,826 White British individuals in the UK Biobank who have at least one genotyped first degree relative. We estimate direct and indirect genetic effects for ∼5 million genome-wide SNPs for five traits. We estimate the correlation between direct genetic effects and effects estimated by standard GWAS to be 0.61 (S.E. 0.09) for years of education, 0.68 (S.E. 0.10) for neuroticism, 0.72 (S.E. 0.09) for smoking initiation, 0.87 (S.E. 0.04) for BMI, and 0.96 (S.E. 0.01) for height. These results suggest that GWAS based on unrelated individuals provides an inaccurate picture of direct genetic effects for certain human traits.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://github.com/AlexTISYoung/SNIPar

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted July 03, 2020.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Mendelian imputation of parental genotypes for genome-wide estimation of direct and indirect genetic effects
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Mendelian imputation of parental genotypes for genome-wide estimation of direct and indirect genetic effects
Alexander I. Young, Seyed Moeen Nehzati, Chanwook Lee, Stefania Benonisdottir, David Cesarini, Daniel J. Benjamin, Patrick Turley, Augustine Kong
bioRxiv 2020.07.02.185199; doi: https://doi.org/10.1101/2020.07.02.185199
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Mendelian imputation of parental genotypes for genome-wide estimation of direct and indirect genetic effects
Alexander I. Young, Seyed Moeen Nehzati, Chanwook Lee, Stefania Benonisdottir, David Cesarini, Daniel J. Benjamin, Patrick Turley, Augustine Kong
bioRxiv 2020.07.02.185199; doi: https://doi.org/10.1101/2020.07.02.185199

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4230)
  • Biochemistry (9123)
  • Bioengineering (6766)
  • Bioinformatics (23968)
  • Biophysics (12109)
  • Cancer Biology (9510)
  • Cell Biology (13753)
  • Clinical Trials (138)
  • Developmental Biology (7623)
  • Ecology (11674)
  • Epidemiology (2066)
  • Evolutionary Biology (15492)
  • Genetics (10631)
  • Genomics (14310)
  • Immunology (9473)
  • Microbiology (22821)
  • Molecular Biology (9086)
  • Neuroscience (48919)
  • Paleontology (355)
  • Pathology (1480)
  • Pharmacology and Toxicology (2566)
  • Physiology (3840)
  • Plant Biology (8322)
  • Scientific Communication and Education (1468)
  • Synthetic Biology (2295)
  • Systems Biology (6180)
  • Zoology (1299)