Project MinE: study design and pilot analyses of a large-scale whole genome sequencing study in amyotrophic lateral sclerosis
Abstract
The most recent genome-wide association study in amyotrophic lateral sclerosis (ALS) demonstrates a disproportionate contribution from low-frequency variants to genetic susceptibility of disease. We have therefore begun Project MinE, an international collaboration that seeks to analyse whole-genome sequence data of at least 15,000 ALS patients and 7,500 controls. Here, we report on the design of Project MinE and pilot analyses of newly whole-genome sequenced 1,264 ALS patients and 611 controls drawn from the Netherlands. As has become characteristic of sequencing studies, we find an abundance of rare genetic variation (minor allele frequency < 0.1 %), the vast majority of which is absent in public data sets. Principal component analysis reveals local geographical clustering of these variants within The Netherlands. We use the whole-genome sequence data to explore the implications of poor geographical matching of cases and controls in a sequence-based disease study and to investigate how ancestry-matched, externally sequenced controls can induce false positive associations. Also, we have publicly released genome-wide minor allele counts in cases and controls, as well as results from genic burden tests.
Subject Area
- Biochemistry (11730)
- Bioengineering (8743)
- Bioinformatics (29179)
- Biophysics (14964)
- Cancer Biology (12080)
- Cell Biology (17399)
- Clinical Trials (138)
- Developmental Biology (9417)
- Ecology (14174)
- Epidemiology (2067)
- Evolutionary Biology (18294)
- Genetics (12233)
- Genomics (16791)
- Immunology (11858)
- Microbiology (28051)
- Molecular Biology (11575)
- Neuroscience (60919)
- Paleontology (451)
- Pathology (1870)
- Pharmacology and Toxicology (3238)
- Physiology (4955)
- Plant Biology (10422)
- Synthetic Biology (2881)
- Systems Biology (7338)
- Zoology (1650)