Robust data-driven incorporation of prior knowledge into the inference of dynamic regulatory networks

Bioinformatics. 2013 Apr 15;29(8):1060-7. doi: 10.1093/bioinformatics/btt099. Epub 2013 Mar 21.

Abstract

Motivation: Inferring global regulatory networks (GRNs) from genome-wide data is a computational challenge central to the field of systems biology. Although the primary data currently used to infer GRNs consist of gene expression and proteomics measurements, there is a growing abundance of alternate data types that can reveal regulatory interactions, e.g. ChIP-Chip, literature-derived interactions, protein-protein interactions. GRN inference requires the development of integrative methods capable of using these alternate data as priors on the GRN structure. Each source of structure priors has its unique biases and inherent potential errors; thus, GRN methods using these data must be robust to noisy inputs.

Results: We developed two methods for incorporating structure priors into GRN inference. Both methods [Modified Elastic Net (MEN) and Bayesian Best Subset Regression (BBSR)] extend the previously described Inferelator framework, enabling the use of prior information. We test our methods on one synthetic and two bacterial datasets, and show that both MEN and BBSR infer accurate GRNs even when the structure prior used has significant amounts of error (>90% erroneous interactions). We find that BBSR outperforms MEN at inferring GRNs from expression data and noisy structure priors.

Availability and implementation: Code, datasets and networks presented in this article are available at http://bonneaulab.bio.nyu.edu/software.html.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Bacillus subtilis / genetics
  • Bacillus subtilis / metabolism
  • Bayes Theorem
  • Escherichia coli / genetics
  • Escherichia coli / metabolism
  • Gene Expression
  • Gene Regulatory Networks*
  • Models, Genetic
  • Regression Analysis
  • Systems Biology / methods
  • Transcription Factors / metabolism

Substances

  • Transcription Factors