CodonPhyML: fast maximum likelihood phylogeny estimation under codon substitution models

Mol Biol Evol. 2013 Jun;30(6):1270-80. doi: 10.1093/molbev/mst034. Epub 2013 Feb 23.

Abstract

Markov models of codon substitution naturally incorporate the structure of the genetic code and the selection intensity at the protein level, providing a more realistic representation of protein-coding sequences compared with nucleotide or amino acid models. Thus, for protein-coding genes, phylogenetic inference is expected to be more accurate under codon models. So far, phylogeny reconstruction under codon models has been elusive due to computational difficulties of dealing with high dimension matrices. Here, we present a fast maximum likelihood (ML) package for phylogenetic inference, CodonPhyML offering hundreds of different codon models, the largest variety to date, for phylogeny inference by ML. CodonPhyML is tested on simulated and real data and is shown to offer excellent speed and convergence properties. In addition, CodonPhyML includes most recent fast methods for estimating phylogenetic branch supports and provides an integral framework for models selection, including amino acid and DNA models.

Keywords: codon model; evolution; maximum likelihood; phylogeny inference; selection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bacteria
  • Codon*
  • Computational Biology / methods*
  • Computer Simulation
  • Eukaryota
  • Evolution, Molecular
  • Markov Chains
  • Models, Genetic*
  • Phylogeny*
  • Proteins / genetics
  • Selection, Genetic

Substances

  • Codon
  • Proteins