Backbone cluster identification in proteins by a graph theoretical method

Biophys Chem. 2000 Feb 14;84(1):13-25. doi: 10.1016/s0301-4622(99)00134-9.

Abstract

A graph theoretical algorithm has been developed to identify backbone clusters of residues in proteins. The identified clusters show protein sites with the highest degree of interactions. An adjacency matrix is constructed from the non-bonded connectivity information in proteins. The diagonalization of such a matrix yields eigenvalues and eigenvectors, which contain the information on clusters. In graph theory, distinct clusters can be obtained from the second lowest eigenvector components of the matrix. However, in an interconnected graph, all the points appear as one single cluster. We have developed a method of identifying highly interacting centers (clusters) in proteins by truncating the vector components of high eigenvalues. This paper presents in detail the method adopted for identifying backbone clusters and the application of the algorithm to families of proteins like RNase-A and globin. The objective of this study was to show the efficiency of the algorithm as well as to detect conserved or similar backbone packing regions in a particular protein family. Three clusters in topologically similar regions in the case of the RNase-A family and three clusters around the porphyrin ring in the globin family were observed. The predicted clusters are consistent with the features of the family of proteins such as the topology and packing density. The method can be applied to problems such as identification of domains and recognition of structural similarities in proteins.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Animals
  • Aplysia
  • Binding Sites
  • Cattle
  • Cluster Analysis*
  • Data Display
  • Globins / chemistry*
  • Humans
  • Molecular Sequence Data
  • Protein Conformation
  • Protein Folding*
  • Protein Structure, Secondary
  • Rana pipiens
  • Ribonuclease, Pancreatic / chemistry*
  • Seals, Earless
  • Sequence Homology, Amino Acid

Substances

  • Globins
  • Ribonuclease, Pancreatic