ABSTRACT
We present a structure-based method for finding and evaluating structural similarities in protein regions relevant to ligand binding. PDBspheres comprises an exhaustive library of protein structure regions (“spheres”) adjacent to complexed ligands derived from the Protein Data Bank (PDB), along with methods to find and evaluate structural matches between a protein of interest and spheres in the library. PDBspheres uses the LGA structure alignment algorithm as the main engine for detecting structure similarities between the protein of interest and template spheres from the library, which currently contains more than 2 million spheres. To assess confidence in structural matches an all-atom-based similarity metric takes sidechain placement into account. Here, we describe the PDBspheres method, demonstrate its ability to detect and characterize binding sites in protein structures, show how PDBspheres - a strictly structure-based method - performs on a curated dataset of 2,528 ligand-bound and ligand-free crystal structures, and use PDBspheres to cluster pockets and assess structural similarities among protein binding sites of 4,876 structures in the “refined set” of the PDBbind 2019 dataset.
Competing Interest Statement
The authors have declared no competing interest.