Guided tree topology proposals for Bayesian phylogenetic inference

Syst Biol. 2012 Jan;61(1):1-11. doi: 10.1093/sysbio/syr074. Epub 2011 Aug 9.

Abstract

Increasingly, large data sets pose a challenge for computationally intensive phylogenetic methods such as Bayesian Markov chain Monte Carlo (MCMC). Here, we investigate the performance of common MCMC proposal distributions in terms of median and variance of run time to convergence on 11 data sets. We introduce two new Metropolized Gibbs Samplers for moving through "tree space." MCMC simulation using these new proposals shows faster average run time and dramatically improved predictability in performance, with a 20-fold reduction in the variance of the time to estimate the posterior distribution to a given accuracy. We also introduce conditional clade probabilities and demonstrate that they provide a superior means of approximating tree topology posterior probabilities from samples recorded during MCMC.

Publication types

  • Evaluation Study

MeSH terms

  • Algorithms
  • Bayes Theorem*
  • Classification / methods*
  • Data Interpretation, Statistical
  • Markov Chains
  • Models, Genetic
  • Monte Carlo Method
  • Phylogeny
  • Probability