Looking at enzymes from the inside out: the proximity of catalytic residues to the molecular centroid can be used for detection of active sites and enzyme-ligand interfaces

J Mol Biol. 2005 Aug 12;351(2):309-26. doi: 10.1016/j.jmb.2005.06.047.

Abstract

Analysis of the distances of the exposed residues in 175 enzymes from the centroids of the molecules indicates that catalytic residues are very often found among the 5% of residues closest to the enzyme centroid. This property of catalytic residues is implemented in a new prediction algorithm (named EnSite) for locating the active sites of enzymes and in a new scheme for re-ranking enzyme-ligand docking solutions. EnSite examines only 5% of the molecular surface (represented by surface dots) that is closest to the centroid, identifying continuous surface segments and ranking them by their area size. EnSite ranks the correct prediction 1-4 in 97% of the cases in a dataset of 65 monomeric enzymes (rank 1 for 89% of the cases) and in 86% of the cases in a dataset of 176 monomeric and multimeric enzymes from all six top-level enzyme classifications (rank 1 in 74% of the cases). Importantly, identification of buried or flat active sites is straightforward because EnSite "looks" at the molecular surface from the inside out. Detailed examination of the results indicates that the proximity of the catalytic residues to the centroid is a property of the functional unit, defined as the assembly of domains or chains that form the active site (in most cases the functional unit corresponds to a single whole polypeptide chain). Using the functional unit in the prediction further improves the results. The new property of active sites is also used for re-evaluating enzyme-inhibitor unbound docking results. Sorting the docking solutions by the distance of the interface to the centroid of the enzyme improves remarkably the ranks of nearly correct solutions compared to ranks based on geometric-electrostatic-hydrophobic complementarity scores.

MeSH terms

  • Algorithms
  • Bacterial Proteins / chemistry
  • Binding Sites
  • Catalysis
  • Catalytic Domain
  • Cluster Analysis
  • Databases as Topic
  • Databases, Protein
  • Enzymes / chemistry*
  • Ligands
  • Models, Molecular
  • Protein Binding
  • Protein Conformation
  • Protein Engineering
  • Protein Structure, Tertiary
  • Proteomics / methods
  • Solvents
  • Static Electricity

Substances

  • Bacterial Proteins
  • Enzymes
  • Ligands
  • Solvents