Abstract
Most patients with hereditary rare diseases do not receive a molecular diagnosis and the aetiological variants and mediating genes for half such disorders remain to be discovered. We implemented whole-genome sequencing (WGS) in a national healthcare system to streamline diagnosis and to discover unknown aetiological variants, in the coding and non-coding regions of the genome. In a pilot study for the 100,000 Genomes Project, we generated WGS data for 13,037 participants, of whom 9,802 had a rare disease, and provided a genetic diagnosis to 1,040 of the 7,065 patients with detailed phenotypic data. We identified 99 Mendelian associations between genes and rare diseases, of which at least 80 are confirmed aetiological. Using WGS of UK Biobank, we showed that rare alleles can explain the presence of some individuals in the tails of a quantitative red blood cell (RBC) trait 1. Finally, we reported novel non-coding variants which cause disease through the disruption of transcription of ARPC1B, GATA1, LRBA and MPL. Our study demonstrates a synergy by using WGS for diagnosis and aetiological discovery in routine healthcare.