Predicting protein function from protein/protein interaction data: a probabilistic approach

Bioinformatics. 2003:19 Suppl 1:i197-204. doi: 10.1093/bioinformatics/btg1026.

Abstract

Motivation: The development of experimental methods for genome scale analysis of molecular interaction networks has made possible new approaches to inferring protein function. This paper describes a method of assigning functions based on a probabilistic analysis of graph neighborhoods in a protein-protein interaction network. The method exploits the fact that graph neighbors are more likely to share functions than nodes which are not neighbors. A binomial model of local neighbor function labeling probability is combined with a Markov random field propagation algorithm to assign function probabilities for proteins in the network.

Results: We applied the method to a protein-protein interaction dataset for the yeast Saccharomyces cerevisiae using the Gene Ontology (GO) terms as function labels. The method reconstructed known GO term assignments with high precision, and produced putative GO assignments to 320 proteins that currently lack GO annotation, which represents about 10% of the unlabeled proteins in S. cerevisiae.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Validation Study

MeSH terms

  • Algorithms*
  • Databases, Protein*
  • Models, Biological*
  • Models, Statistical
  • Protein Interaction Mapping / methods*
  • Saccharomyces cerevisiae / metabolism*
  • Saccharomyces cerevisiae Proteins / classification*
  • Saccharomyces cerevisiae Proteins / metabolism*
  • Structure-Activity Relationship

Substances

  • Saccharomyces cerevisiae Proteins