Integrating healthcare and research genetic data empowers the discovery of 49 novel developmental disorders
Summary
De novo mutations (DNMs) in protein-coding genes are a well-established cause of developmental disorders (DD). However, known DD-associated genes only account for a minority of the observed excess of such DNMs. To identify novel DD-associated genes, we integrated healthcare and research exome sequences on 31,058 DD parent-offspring trios, and developed a simulation-based statistical test to identify gene-specific enrichments of DNMs. We identified 299 significantly DD-associated genes, including 49 not previously robustly associated with DDs. Despite detecting more DD-associated genes than in any previous study, much of the excess of DNMs of protein-coding genes remains unaccounted for. Modelling suggests that over 500 novel DD-associated genes await discovery, many of which are likely to be less penetrant than the currently known genes. Research access to clinical diagnostic datasets will be critical for completing the map of dominant DDs.
Subject Area
- Biochemistry (12930)
- Bioengineering (9831)
- Bioinformatics (31535)
- Biophysics (16248)
- Cancer Biology (13331)
- Cell Biology (19012)
- Clinical Trials (138)
- Developmental Biology (10306)
- Ecology (15312)
- Epidemiology (2067)
- Evolutionary Biology (19545)
- Genetics (12975)
- Genomics (17930)
- Immunology (13049)
- Microbiology (30486)
- Molecular Biology (12711)
- Neuroscience (66537)
- Paleontology (489)
- Pathology (2059)
- Pharmacology and Toxicology (3547)
- Physiology (5527)
- Plant Biology (11396)
- Synthetic Biology (3166)
- Systems Biology (7829)
- Zoology (1769)