Efficient sampling for Bayesian inference of conjunctive Bayesian networks

Bioinformatics. 2012 Sep 15;28(18):2318-24. doi: 10.1093/bioinformatics/bts433. Epub 2012 Jul 10.

Abstract

Motivation: Cancer development is driven by the accumulation of advantageous mutations and subsequent clonal expansion of cells harbouring these mutations, but the order in which mutations occur remains poorly understood. Advances in genome sequencing and the soon-arriving flood of cancer genome data produced by large cancer sequencing consortia hold the promise to elucidate cancer progression. However, new computational methods are needed to analyse these large datasets.

Results: We present a Bayesian inference scheme for Conjunctive Bayesian Networks, a probabilistic graphical model in which mutations accumulate according to partial order constraints and cancer genotypes are observed subject to measurement noise. We develop an efficient MCMC sampling scheme specifically designed to overcome local optima induced by dependency structures. We demonstrate the performance advantage of our sampler over traditional approaches on simulated data and show the advantages of adopting a Bayesian perspective when reanalyzing cancer datasets and comparing our results to previous maximum-likelihood-based approaches.

Availability: An R package including the sampler and examples is available at http://www.cbg.ethz.ch/software/bayes-cbn.

Contacts: niko.beerenwinkel@bsse.ethz.ch.

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Genomics
  • Genotype
  • Humans
  • Likelihood Functions
  • Models, Statistical*
  • Mutation
  • Neoplasms / genetics*