Protein-Ligand Scoring with Convolutional Neural Networks

J Chem Inf Model. 2017 Apr 24;57(4):942-957. doi: 10.1021/acs.jcim.6b00740. Epub 2017 Apr 11.

Abstract

Computational approaches to drug discovery can reduce the time and cost associated with experimental assays and enable the screening of novel chemotypes. Structure-based drug design methods rely on scoring functions to rank and predict binding affinities and poses. The ever-expanding amount of protein-ligand binding and structural data enables the use of deep machine learning techniques for protein-ligand scoring. We describe convolutional neural network (CNN) scoring functions that take as input a comprehensive three-dimensional (3D) representation of a protein-ligand interaction. A CNN scoring function automatically learns the key features of protein-ligand interactions that correlate with binding. We train and optimize our CNN scoring functions to discriminate between correct and incorrect binding poses and known binders and nonbinders. We find that our CNN scoring function outperforms the AutoDock Vina scoring function when ranking poses both for pose prediction and virtual screening.

MeSH terms

  • Computational Biology / methods*
  • Drug Evaluation, Preclinical
  • Ligands
  • Models, Molecular
  • Neural Networks, Computer*
  • Protein Conformation
  • Proteins / chemistry
  • Proteins / metabolism*
  • User-Computer Interface

Substances

  • Ligands
  • Proteins