Interpretable Drug Target Predictions using Self-Expressiveness

Diego Galeano; Santiago Noto; Ruben Jimenez; Alberto Paccanaro

doi:10.1101/2021.03.01.433365

Abstract

The identification of missing drug targets is critical for the development of treatments and for the molecular elucidation of drug side effects. Drug targets have been predicted by exploiting molecular, biological or pharmacological features of drugs and protein targets. Yet, developing integrative and interpretable machine learning models for predicting drug targets remains a challenging task. We present Inception, an integrative and interpretable matrix completion model for predicting drug targets. Inception is a self-expressive model that learns two similarity matrices: one for drugs and another for protein targets. These learned similarity matrices are key for our models’ interpretability: they can explain how a predicted drug-target interaction can be explain in terms of a linear combination of chemical, biological and pharmacological similarities. We develop a novel objective function with efficient closed-form solution. To demonstrate the ability of Inception at recovering missing drug-target interactions (DTIs), we perform cross-validation experiments with stringent controls of data imbalance, chemical similarities between drugs and sequence similarities between targets. We also assess the performance of our model using a simulated prospective approach. Having trained our model with DTIs from a snapshot 2011 of the DrugBank database, we test whether we could predict DTIs from a 2020 snapshot of DrugBank. Inception outperforms two state-of-the-art drug target prediction models in all the scenarios. This suggests that Inception could be useful for predicting missing drug target interactions while providing interpretable predictions.