PT - JOURNAL ARTICLE AU - Dalia Mizikovsky AU - Marina Naval Sanchez AU - Christian M. Nefzger AU - Gabriel Cuellar Partida AU - Nathan J. Palpant TI - Organisation of gene programs revealed by unsupervised analysis of diverse gene-trait associations AID - 10.1101/2022.04.07.487559 DP - 2022 Jan 01 TA - bioRxiv PG - 2022.04.07.487559 4099 - http://biorxiv.org/content/early/2022/05/27/2022.04.07.487559.short 4100 - http://biorxiv.org/content/early/2022/05/27/2022.04.07.487559.full AB - Genome wide association studies provide statistical measures of gene-trait associations that reveal how genetic variation influences phenotypes. This study develops an unsupervised dimensionality reduction method called UnTANGLeD (Unsupervised Trait Analysis of Networks from Gene Level Data) which organises 16,849 genes into discrete gene programs by measuring the statistical association between genetic variants and 1,393 diverse complex traits. UnTANGLeD reveals 173 gene clusters enriched for protein-protein interactions and highly distinct biological processes governing development, signalling, disease, and homeostasis. We identify diverse gene networks with robust interactions but not associated with known biological processes. Analysis of independent disease traits shows that UnTANGLeD gene clusters are conserved across all complex traits, providing a simple and powerful framework to predict novel gene candidates and programs influencing orthogonal disease phenotypes. Collectively, this study demonstrates that gene programs co-ordinately orchestrating cell functions can be identified without reliance on prior knowledge, providing a method for use in functional annotation, hypothesis generation, machine learning and prediction algorithms, and the interpretation of diverse genomic data.Competing Interest StatementGabriel Cuellar Partida is currently an employee of 23andMe Inc. and holds stock options for the company.