Practical performance of tree comparison metrics

Syst Biol. 2015 Mar;64(2):205-14. doi: 10.1093/sysbio/syu085. Epub 2014 Nov 4.

Abstract

The phylogenetic literature contains numerous measures for assessing differences between two phylogenetic trees. Individual measures have been criticized on various grounds, but little is known about their comparative performance in typical applications. We evaluate the performance of nine tree distance measures on two tasks: 1) distinguishing trees separated by lesser versus greater numbers of recombinations, and 2) distinguishing trees inferred with lower versus higher quality data. We find that when the trees being compared are similar, measures that make use of branch lengths are superior, with the branch-length version of the Robinson-Foulds metric performing best. In contrast, for dissimilar trees topology-only measures are superior, with the Alignment metric of Nye et al. performing best. We also apply the measures to a mammalian dataset and observe that the best metric depends on whether branch-length information is of interest. We give practical recommendations for choosing a tree distance metric in different applications.

Keywords: Phylogenetics; tree comparison; tree distance metrics.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Classification / methods*
  • Computer Simulation
  • Mammals / classification
  • Mammals / genetics
  • Phylogeny*