The effects of partitioning on phylogenetic inference

Mol Biol Evol. 2015 Jun;32(6):1611-27. doi: 10.1093/molbev/msv026. Epub 2015 Feb 6.

Abstract

Partitioning is a commonly used method in phylogenetics that aims to accommodate variation in substitution patterns among sites. Despite its popularity, there have been few systematic studies of its effects on phylogenetic inference, and there have been no studies that compare the effects of different approaches to partitioning across many empirical data sets. In this study, we applied four commonly used approaches to partitioning to each of 34 empirical data sets, and then compared the resulting tree topologies, branch-lengths, and bootstrap support estimated using each approach. We find that the choice of partitioning scheme often affects tree topology, particularly when partitioning is omitted. Most notably, we find occasional instances where the use of a suboptimal partitioning scheme produces highly supported but incorrect nodes in the tree. Branch-lengths and bootstrap support are also affected by the choice of partitioning scheme, sometimes dramatically so. We discuss the reasons for these effects and make some suggestions for best practice.

Keywords: molecular evolution; partitioning; tree inference; tree topology.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Classification / methods*
  • Databases, Genetic
  • Empirical Research
  • Evolution, Molecular
  • Models, Genetic*
  • Phylogeny*