Prediction of protein-protein interactions using random decision forest framework

Bioinformatics. 2005 Dec 15;21(24):4394-400. doi: 10.1093/bioinformatics/bti721. Epub 2005 Oct 18.

Abstract

Motivation: Protein interactions are of biological interest because they orchestrate a number of cellular processes such as metabolic pathways and immunological recognition. Domains are the building blocks of proteins; therefore, proteins are assumed to interact as a result of their interacting domains. Many domain-based models for protein interaction prediction have been developed, and preliminary results have demonstrated their feasibility. Most of the existing domain-based methods, however, consider only single-domain pairs (one domain from one protein) and assume independence between domain-domain interactions.

Results: In this paper, we introduce a domain-based random forest of decision trees to infer protein interactions. Our proposed method is capable of exploring all possible domain interactions and making predictions based on all the protein domains. Experimental results on Saccharomyces cerevisiae dataset demonstrate that our approach can predict protein-protein interactions with higher sensitivity (79.78%) and specificity (64.38%) compared with that of the maximum likelihood approach. Furthermore, our model can be used to infer interactions not only for single-domain pairs but also for multiple domain pairs.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computational Biology
  • Databases, Factual
  • Decision Theory
  • Likelihood Functions
  • Multiprotein Complexes
  • Protein Binding
  • Protein Structure, Tertiary
  • Proteins / chemistry*
  • Proteins / metabolism*
  • Saccharomyces cerevisiae / metabolism
  • Saccharomyces cerevisiae Proteins / metabolism
  • Sensitivity and Specificity

Substances

  • Multiprotein Complexes
  • Proteins
  • Saccharomyces cerevisiae Proteins