Estimating the selective effects of heterozygous protein-truncating variants from human exome data

Nat Genet. 2017 May;49(5):806-810. doi: 10.1038/ng.3831. Epub 2017 Apr 3.

Abstract

The evolutionary cost of gene loss is a central question in genetics and has been investigated in model organisms and human cell lines. In humans, tolerance of the loss of one or both functional copies of a gene is related to the gene's causal role in disease. However, estimates of the selection and dominance coefficients in humans have been elusive. Here we analyze exome sequence data from 60,706 individuals to make genome-wide estimates of selection against heterozygous loss of gene function. Using this distribution of selection coefficients for heterozygous protein-truncating variants (PTVs), we provide corresponding Bayesian estimates for individual genes. We find that genes under the strongest selection are enriched in embryonic lethal mouse knockouts, Mendelian disease-associated genes, and regulators of transcription. Screening by essentiality, we find a large set of genes under strong selection that are likely to have crucial functions but have not yet been thoroughly characterized.

MeSH terms

  • Algorithms
  • Animals
  • Bayes Theorem
  • Exome / genetics*
  • Gene Frequency
  • Genetic Predisposition to Disease / genetics
  • Genetic Variation*
  • Genome-Wide Association Study / methods*
  • Genotype
  • Heterozygote
  • Humans
  • Mice, Knockout
  • Models, Genetic
  • Mutation
  • Selection, Genetic*
  • Sequence Analysis, DNA / methods
  • Sequence Analysis, DNA / statistics & numerical data