TY - JOUR T1 - DEEPScreen: High Performance Drug-Target Interaction Prediction with Convolutional Neural Networks Using 2-D Structural Compound Representations JF - bioRxiv DO - 10.1101/491365 SP - 491365 AU - Ahmet Sureyya Rifaioglu AU - Volkan Atalay AU - Maria Jesus Martin AU - Rengul Cetin-Atalay AU - Tunca Doğan Y1 - 2018/01/01 UR - http://biorxiv.org/content/early/2018/12/09/491365.abstract N2 - The identification of physical interactions between drug candidate chemical substances and target biomolecules is an important step in the process of drug discovery, where the standard procedure is the systematic screening of chemical compounds against pre-selected target proteins. However, experimental screening procedures are expensive and time consuming, therefore, it is not possible to carry out comprehensive tests. Within the last decade, computational approaches have been developed with the objective of aiding experimental studies by predicting novel drug-target interactions (DTI), via the construction and application of statistical models. In this study, we propose a large-scale DTI interaction prediction system, DEEPScreen, for early stage drug discovery, using convolutional deep neural networks. One of the main advantages of DEEPScreen is employing readily available simple 2-D images of compounds at the input level instead of engineered complex feature vectors that displayed limited performance in DTI prediction tasks previously. DEEPScreen learns complex features inherently from the 2-D molecular representations, thus producing highly accurate predictions. DEEPScreen system was trained for 704 target proteins (using ChEMBL curated bioactivity data) and finalized with rigorous hyper-parameter optimization tests. We compared the performance of DEEPScreen against shallow classifiers such as the random forest, logistic regression and support vector machines, to indicate the effectiveness of the proposed deep learning approach. Additionally, we compared DEEPScreen with other deep learning based state-of-the-art DTI predictors on widely used benchmark datasets and showed that DEEPScreen produces better or comparable results to the top performers. The method proposed here can be employed to computationally scan a large portion of the recorded drug candidate compound and protein spaces to aid the experimentalists working in the field of drug discovery and repurposing by providing a preselection of interesting novel DTIs. ER -