PT - JOURNAL ARTICLE AU - Christopher A. Cassa AU - Donate Weghorn AU - Daniel J. Balick AU - Daniel M. Jordan AU - David Nusinow AU - Kaitlin E. Samocha AU - Anne O'Donnell Luria AU - Daniel G. MacArthur AU - Mark J. Daly AU - David R. Beier AU - Shamil R. Sunyaev TI - Estimating the Selective Effect of Heterozygous Protein Truncating Variants from Human Exome Data AID - 10.1101/075523 DP - 2016 Jan 01 TA - bioRxiv PG - 075523 4099 - http://biorxiv.org/content/early/2016/09/16/075523.short 4100 - http://biorxiv.org/content/early/2016/09/16/075523.full AB - The dispensability of individual genes for viability has interested generations of geneticists. For some genes it is essential to maintain two functional chromosomal copies, while other genes may tolerate the loss of one or both copies. Exome sequence data from 60,706 individuals provide sufficient observations of rare protein truncating variants (PTVs) to make genome-wide estimates of selection against heterozygous loss of gene function. The cumulative frequency of rare deleterious PTVs is primarily determined by the balance between incoming mutations and purifying selection rather than genetic drift. This enables the estimation of the genome-wide distribution of selection coefficients for heterozygous PTVs and corresponding Bayesian estimates for individual genes. The strength of selection can help discriminate the severity, age of onset, and mode of inheritance in Mendelian exome sequencing cases. We find that genes under the strongest selection are enriched in embryonic lethal mouse knockouts, putatively cell-essential genes inferred from human tumor cells, Mendelian disease genes, and regulators of transcription. Using an essentiality screen, we find a large set of genes under strong selection that are likely to have critical function but that have not yet been studied extensively.