Bayesian model adequacy and choice in phylogenetics

Jonathan P Bollback

doi:10.1093/oxfordjournals.molbev.a004175

Bayesian model adequacy and choice in phylogenetics

Mol Biol Evol. 2002 Jul;19(7):1171-80. doi: 10.1093/oxfordjournals.molbev.a004175.

Author

Jonathan P Bollback¹

Affiliation

¹ Department of Biology, University of Rochester, NY 14627, USA. bollback@brahms.biology.rochester.edu

PMID: 12082136
DOI: 10.1093/oxfordjournals.molbev.a004175

Abstract

Bayesian inference is becoming a common statistical approach to phylogenetic estimation because, among other reasons, it allows for rapid analysis of large data sets with complex evolutionary models. Conveniently, Bayesian phylogenetic methods use currently available stochastic models of sequence evolution. However, as with other model-based approaches, the results of Bayesian inference are conditional on the assumed model of evolution: inadequate models (models that poorly fit the data) may result in erroneous inferences. In this article, I present a Bayesian phylogenetic method that evaluates the adequacy of evolutionary models using posterior predictive distributions. By evaluating a model's posterior predictive performance, an adequate model can be selected for a Bayesian phylogenetic study. Although I present a single test statistic that assesses the overall (global) performance of a phylogenetic model, a variety of test statistics can be tailored to evaluate specific features (local performance) of evolutionary models to identify sources failure. The method presented here, unlike the likelihood-ratio test and parametric bootstrap, accounts for uncertainty in the phylogeny and model parameters.

Publication types

Comparative Study
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Animals
Bayes Theorem*
Computer Simulation
DNA / genetics
Evolution, Molecular*
Humans
Likelihood Functions
Models, Genetic
Models, Statistical
Phylogeny*
Probability

Substances

DNA