Reducing model complexity of the general Markov model of evolution

Mol Biol Evol. 2011 Nov;28(11):3045-59. doi: 10.1093/molbev/msr128. Epub 2011 May 18.

Abstract

The selection of an optimal model for data analysis is an important component of model-based molecular phylogenetic studies. Owing to the large number of Markov models that can be used for data analysis, model selection is a combinatorial problem that cannot be solved by performing an exhaustive search of all possible models. Currently, model selection is based on a small subset of the available Markov models, namely those that assume the evolutionary process to be globally stationary, reversible, and homogeneous. This forces the optimal model to be time reversible even though the actual data may not satisfy these assumptions. This problem can be alleviated by including more complex models during the model selection. We present a novel heuristic that evaluates a small fraction of these complex models and identifies the optimal model.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Classification / methods
  • Evolution, Molecular*
  • Markov Chains*
  • Models, Genetic*
  • Phylogeny*