ABSTRACT
We do not know the extent to which genetic interactions affect the observed phenotype in diseases, because the current interaction detection approaches are limited: they only consider interactions between the top SNPs of each gene, and only simple forms of interaction. We introduce methods for increasing the statistical power of interaction detection by taking into account all SNPs and complex interactions between them, beyond only the currently considered multiplicative relationships. In brief, the relation between SNPs and a phenotype is captured by a gene interaction neural network (NN), and the interactions are quantified by the Shapley score between hidden nodes, which are gene representations that optimally combine information from all SNPs in the gene. Additionally, we design a new permutation procedure tailored for NNs to assess the significance of interactions. The new approach outperformed existing alternatives on simulated datasets, and in a cholesterol study on the UK Biobank it detected six interactions which replicated on an independent FINRISK dataset, four of them novel findings.
Competing Interest Statement
The authors have declared no competing interest.