Whole genome association testing in 333,100 individuals across three biobanks identifies rare non-coding single variant and genomic aggregate associations with height
Abstract
The role of rare non-coding variation in complex human phenotypes is still largely unknown. To elucidate the impact of rare variants in regulatory elements, we performed a whole-genome sequencing association analysis for height using 333,100 individuals from three datasets: UK Biobank (N=200,003), TOPMed (N=87,652) and All of Us (N=45,445). We performed rare (<0.1% minor-allele-frequency) single-variant and aggregate testing of non-coding variants in regulatory regions based on proximal, intergenic and deep-intronic annotation. We observed 29 independent variants associated with height at P < 6 × 10−10 after conditioning on previously reported variants, with effect sizes ranging from -7cm to +4.7cm. We also identified and replicated non-coding aggregate-based associations proximal to HMGA1 containing variants associated with a 5cm taller height and of highly-conserved variants in MIR497HG on chromosome 17. We have developed a novel approach for identifying non-coding rare variants in regulatory regions with large effects from whole-genome sequencing data associated with complex traits.
Competing Interest Statement
Bruce M. Psaty serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. Xihong Lin is a consultant of AbbVie Pharmaceuticals and Verily Life Sciences. The remaining authors declare no competing interests.
Subject Area
- Biochemistry (13659)
- Bioengineering (10389)
- Bioinformatics (33063)
- Biophysics (17045)
- Cancer Biology (14113)
- Cell Biology (20012)
- Clinical Trials (138)
- Developmental Biology (10829)
- Ecology (15960)
- Epidemiology (2067)
- Evolutionary Biology (20288)
- Genetics (13371)
- Genomics (18595)
- Immunology (13697)
- Microbiology (32054)
- Molecular Biology (13347)
- Neuroscience (69845)
- Paleontology (521)
- Pathology (2175)
- Pharmacology and Toxicology (3728)
- Physiology (5841)
- Plant Biology (11981)
- Synthetic Biology (3358)
- Systems Biology (8143)
- Zoology (1839)