Development and implementation of an algorithm for detection of protein complexes in large interaction networks

BMC Bioinformatics. 2006 Apr 14:7:207. doi: 10.1186/1471-2105-7-207.

Abstract

Background: After complete sequencing of a number of genomes the focus has now turned to proteomics. Advanced proteomics technologies such as two-hybrid assay, mass spectrometry etc. are producing huge data sets of protein-protein interactions which can be portrayed as networks, and one of the burning issues is to find protein complexes in such networks. The enormous size of protein-protein interaction (PPI) networks warrants development of efficient computational methods for extraction of significant complexes.

Results: This paper presents an algorithm for detection of protein complexes in large interaction networks. In a PPI network, a node represents a protein and an edge represents an interaction. The input to the algorithm is the associated matrix of an interaction network and the outputs are protein complexes. The complexes are determined by way of finding clusters, i. e. the densely connected regions in the network. We also show and analyze some protein complexes generated by the proposed algorithm from typical PPI networks of Escherichia coli and Saccharomyces cerevisiae. A comparison between a PPI and a random network is also performed in the context of the proposed algorithm.

Conclusion: The proposed algorithm makes it possible to detect clusters of proteins in PPI networks which mostly represent molecular biological functional units. Therefore, protein complexes determined solely based on interaction data can help us to predict the functions of proteins, and they are also useful to understand and explain certain biological processes.

MeSH terms

  • Algorithms*
  • Cell Physiological Phenomena*
  • Cluster Analysis*
  • Computer Simulation
  • Escherichia coli / metabolism
  • Escherichia coli Proteins / metabolism
  • Models, Biological*
  • Pattern Recognition, Automated
  • Protein Interaction Mapping / methods*
  • Proteome / metabolism*
  • Saccharomyces cerevisiae / metabolism
  • Saccharomyces cerevisiae Proteins / metabolism
  • Signal Transduction / physiology*

Substances

  • Escherichia coli Proteins
  • Proteome
  • Saccharomyces cerevisiae Proteins