Variability in docking success rates due to dataset preparation

J Comput Aided Mol Des. 2012 Jun;26(6):775-86. doi: 10.1007/s10822-012-9570-1. Epub 2012 May 8.

Abstract

The results of cognate docking with the prepared Astex dataset provided by the organizers of the "Docking and Scoring: A Review of Docking Programs" session at the 241st ACS national meeting are presented. The MOE software with the newly developed GBVI/WSA dG scoring function is used throughout the study. For 80 % of the Astex targets, the MOE docker produces a top-scoring pose within 2 Å of the X-ray structure. For 91 % of the targets a pose within 2 Å of the X-ray structure is produced in the top 30 poses. Docking failures, defined as cases where the top scoring pose is greater than 2 Å from the experimental structure, are shown to be largely due to the absence of bound waters in the source dataset, highlighting the need to include these and other crucial information in future standardized sets. Docking success is shown to depend heavily on data preparation. A "dataset preparation" error of 0.5 kcal/mol is shown to cause fluctuations of over 20 % in docking success rates.

MeSH terms

  • Algorithms*
  • Binding Sites
  • Computer Simulation
  • Crystallography, X-Ray
  • Hydrogen Bonding
  • Ligands*
  • Models, Molecular
  • Protein Binding
  • Protein Conformation
  • Proteins / chemistry*
  • Software*

Substances

  • Ligands
  • Proteins