Abstract
Biological networks constructed from varied data, including protein-protein interactions, gene expression data, and genetic interactions can be used to map cellular function, but each data type has individual limitations such as bias and incompleteness. Network integration promises to address these limitations by combining and automatically weighting input information to obtain a more accurate and comprehensive representation of the underlying biology. However, existing network integration methods may fail to adequately scale to the number of nodes and networks present in genome-scale data, may perform poorly, and may not handle partial network overlap. To address these issues, we developed a deep learning-based network integration algorithm that incorporates a graph convolutional network (GCN) framework to effectively learn dependencies between any input network. Our method, BIONIC (Biological Network Integration using Convolutions), learns features which contain substantially more functional information compared to existing approaches, linking genes that share diverse functional relationships, including co-complex and shared bioprocess annotation. BIONIC can integrate networks in a fully unsupervised manner if functional gene annotations are not available, and it can also leverage available annotations in a semi-supervised manner. BIONIC is scalable in both size and quantity of the input networks, making it feasible to integrate numerous networks on the scale of the human genome. To demonstrate the utility of BIONIC in identifying novel biology, we predicted essential gene chemical-genetic interactions from a small set of diagnostic non-essential gene profiles in yeast, and experimentally validated these predictions. BIONIC correctly predicted many chemical-genetic interactions, and it correctly predicted genes that are required for proper β-1,6-glucan synthesis as significant interactions with the bioactive compound pseudojervine.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
https://figshare.com/articles/dataset/Hyperparameter_Optimization_Results/15025995
https://figshare.com/articles/dataset/Integrated_Networks_Details_xlsx/14866917
https://figshare.com/articles/dataset/Figure_2_Evaluation_Standards_Details/14983683
https://figshare.com/articles/dataset/Figure_2_Module_Detection_Results/14975298
https://figshare.com/articles/dataset/Fig_S3_S4_Module_Detection_Results/14976747
https://figshare.com/articles/dataset/50_Compound_TS_Allele_Screen_Results/16645894
https://figshare.com/articles/dataset/Essential_Gene_Compound_Sensitivity_Predictions/15158037
https://figshare.com/articles/dataset/BIONIC_Yeast_Features/16632286
https://figshare.com/articles/dataset/Evaluation_Standards/16629139