Labeling nodes using three degrees of propagation

PLoS One. 2012;7(12):e51947. doi: 10.1371/journal.pone.0051947. Epub 2012 Dec 28.

Abstract

The properties (or labels) of nodes in networks can often be predicted based on their proximity and their connections to other labeled nodes. So-called "label propagation algorithms" predict the labels of unlabeled nodes by propagating information about local label density iteratively through the network. These algorithms are fast, simple and scale to large networks but nonetheless regularly perform better than slower and much more complex algorithms on benchmark problems. We show here, however, that these algorithms have an intrinsic limitation that prevents them from adapting to some common patterns of network node labeling; we introduce a new algorithm, 3Prop, that retains all their advantages but is much more adaptive. As we show, 3Prop performs very well on node labeling problems ill-suited to label propagation, including predicting gene function in protein and genetic interaction networks and gender in friendship networks, and also performs slightly better on problems already well-suited to label propagation such as labeling blogs and patents based on their citation networks. 3Prop gains its adaptability by assigning separate weights to label information from different steps of the propagation. Surprisingly, we found that for many networks, the third iteration of label propagation receives a negative weight.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Gene Regulatory Networks
  • Humans
  • Models, Theoretical*
  • Protein Binding
  • Protein Interaction Mapping
  • Proteins / genetics
  • Proteins / metabolism*
  • Social Networking

Substances

  • Proteins

Grants and funding

SM, AG, and QM gratefully acknowledge the support from an NSERC operating grant to QM. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.