Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

The population genetics of human disease: the case of recessive, lethal mutations

Carlos Eduardo G. Amorim, Ziyue Gao, Zachary Baker, José Francisco Diesel, Yuval B. Simons, Imran S. Haque, Joseph Pickrell, Molly Przeworski
doi: https://doi.org/10.1101/091579
Carlos Eduardo G. Amorim
1Department of Biological Sciences, Columbia University, New York, NY
2CAPES Foundation, Ministry of Education of Brazil, Brasília, DF, Brazil
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: guerraamorim@gmail.com
Ziyue Gao
3Howard Hughes Medical Institution, Stanford University, Stanford, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zachary Baker
4Department of Systems Biology, Columbia University, New York, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
José Francisco Diesel
5Universidade Federal de Santa Maria, Santa Maria, RS, Brazil
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuval B. Simons
1Department of Biological Sciences, Columbia University, New York, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Imran S. Haque
6Counsyl, 180 Kimball Way, South San Francisco, CA. Current address: Freenome, 201 Gateway Blvd, South San Francisco, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joseph Pickrell
1Department of Biological Sciences, Columbia University, New York, NY
7New York Genome Center, New York, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Molly Przeworski
1Department of Biological Sciences, Columbia University, New York, NY
4Department of Systems Biology, Columbia University, New York, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Do the frequencies of disease mutations in human populations reflect a simple balance between mutation and purifying selection? What other factors shape the prevalence of disease mutations? To begin to answer these questions, we focused on one of the simplest cases: recessive mutations that alone cause lethal diseases or complete sterility. To this end, we generated a hand-curated set of 417 Mendelian mutations in 32 genes, reported to cause a recessive, lethal Mendelian disease. We then considered analytic models of mutation-selection balance in infinite and finite populations of constant sizes and simulations of purifying selection in a more realistic demographic setting, and tested how well these models fit allele frequencies estimated from 33,370 individuals of European ancestry. In doing so, we distinguished between CpG transitions, which occur at a substantially elevated rate, and three other mutation types. The observed frequency for CpG transitions is slightly higher than expectation but close, whereas the frequencies observed for the three other mutation types are an order of magnitude higher than expected. This discrepancy is even larger when subtle fitness effects in heterozygotes or lethal compound heterozygotes are taken into account. In principle, higher than expected frequencies of disease mutations could be due to widespread errors in reporting causal variants, compensation by other mutations, or balancing selection. It is unclear why these factors would have a greater impact on variants with lower mutation rates, however. We argue instead that the unexpectedly high frequency of disease mutations and the relationship to the mutation rate likely reflect an ascertainment bias: of all the mutations that cause recessive lethal diseases, those that by chance have reached higher frequencies are more likely to have been identified and thus to have been included in this study. Beyond the specific application, this study highlights the parameters likely to be important in shaping the frequencies of Mendelian disease alleles.

Author Summary What determines the frequencies of disease mutations in human populations? To begin to answer this question, we focus on one of the simplest cases: mutations that cause completely recessive, lethal Mendelian diseases. We first review theory about what to expect from mutation and selection in a population of finite size and further generate predictions based on simulations using a realistic demographic scenario of human evolution. For a highly mutable type of mutations, such as transitions at CpG sites, we find that the predictions are close to the observed frequencies of recessive lethal disease mutations. For less mutable types, however, predictions substantially under-estimate the observed frequency. We discuss possible explanations for the discrepancy and point to a complication that, to our knowledge, is not widely appreciated: that there exists ascertainment bias in disease mutation discovery. Specifically, we suggest that alleles that have been identified to date are likely the ones that by chance have reached higher frequencies and are thus more likely to have been mapped. More generally, our study highlights the factors that influence the frequencies of Mendelian disease alleles.

Footnotes

  • ↵+ These authors co-supervised this work

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted May 18, 2017.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
The population genetics of human disease: the case of recessive, lethal mutations
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
The population genetics of human disease: the case of recessive, lethal mutations
Carlos Eduardo G. Amorim, Ziyue Gao, Zachary Baker, José Francisco Diesel, Yuval B. Simons, Imran S. Haque, Joseph Pickrell, Molly Przeworski
bioRxiv 091579; doi: https://doi.org/10.1101/091579
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
The population genetics of human disease: the case of recessive, lethal mutations
Carlos Eduardo G. Amorim, Ziyue Gao, Zachary Baker, José Francisco Diesel, Yuval B. Simons, Imran S. Haque, Joseph Pickrell, Molly Przeworski
bioRxiv 091579; doi: https://doi.org/10.1101/091579

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3479)
  • Biochemistry (7318)
  • Bioengineering (5296)
  • Bioinformatics (20196)
  • Biophysics (9976)
  • Cancer Biology (7701)
  • Cell Biology (11249)
  • Clinical Trials (138)
  • Developmental Biology (6417)
  • Ecology (9915)
  • Epidemiology (2065)
  • Evolutionary Biology (13276)
  • Genetics (9352)
  • Genomics (12551)
  • Immunology (7673)
  • Microbiology (18937)
  • Molecular Biology (7417)
  • Neuroscience (40887)
  • Paleontology (298)
  • Pathology (1226)
  • Pharmacology and Toxicology (2125)
  • Physiology (3140)
  • Plant Biology (6837)
  • Scientific Communication and Education (1270)
  • Synthetic Biology (1891)
  • Systems Biology (5296)
  • Zoology (1084)