TY - JOUR T1 - Pathway based factor analysis of gene expression data produces highly heritable phenotypes that associate with age JF - bioRxiv DO - 10.1101/016154 SP - 016154 AU - Andrew Brown AU - Zhihao Ding AU - Ana Viñuela AU - Dan Glass AU - Leopold Parts AU - Tim Spector AU - John Winn AU - Richard Durbin Y1 - 2015/01/01 UR - http://biorxiv.org/content/early/2015/03/06/016154.abstract N2 - Statistical factor analysis methods have previously been used to remove noise components from high dimensional data prior to genetic association mapping, and in a guided fashion to summarise biologically relevant sources of variation. Here we show how the derived factors summarising pathway expression can be used to analyse the relationships between expression, heritability and ageing. We used skin gene expression data from 647 twins from the MuTHER Consortium and applied factor analysis to concisely summarise patterns of gene expression, both to remove broad confounding influences and to produce concise pathway-level phenotypes. We derived 930 “pathway phe-notypes” which summarised patterns of variation across 186 KEGG pathways (five phenotypes per pathway). We identified 69 significant associations of age with phenotype from 57 distinct KEGG pathways at a stringent Bon-ferroni threshold (P < 5.38 × 10−5). These phenotypes are more heritable (h2 = 0.32) than gene expression levels. On average, expression levels of 16% of genes within these pathways are associated with age. Several significant pathways relate to metabolising sugars and fatty acids, others with insulin signalling. We have demonstrated that factor analysis methods combined with biological knowledge can produce more reliable phenotypes with less stochastic noise than the individual gene expression levels, which increases our power to discover biologically relevant associations. These phenotypes could also be applied to discover associations with other environmental factors. ER -