A New Orthology Assessment Method for Phylogenomic Data: Unrooted Phylogenetic Orthology

Mol Biol Evol. 2016 Aug;33(8):2117-34. doi: 10.1093/molbev/msw069. Epub 2016 Apr 6.

Abstract

Current sequencing technologies are making available unprecedented amounts of genetic data for a large variety of species including nonmodel organisms. Although many phylogenomic surveys spend considerable time finding orthologs from the wealth of sequence data, these results do not transcend the original study and after being processed for specific phylogenetic purposes these orthologs do not become stable orthology hypotheses. We describe a procedure to detect and document the phylogenetic distribution of orthologs allowing researchers to use this information to guide selection of loci best suited to test specific evolutionary questions. At the core of this pipeline is a new phylogenetic orthology method that is neither affected by the position of the root nor requires explicit assignment of outgroups. We discuss the properties of this new orthology assessment method and exemplify its utility for phylogenomics using a small insects dataset. In addition, we exemplify the pipeline to identify and document stable orthologs for the group of orb-weaving spiders (Araneoidea) using RNAseq data. The scripts used in this study, along with sample files and additional documentation, are available at https://github.com/ballesterus/UPhO.

Keywords: Araneae; Markov cluster; genomics.; protein homology; spiders; transcriptomics.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Animals
  • Aphids / genetics
  • Evolution, Molecular
  • Genomics / methods*
  • Pediculus / genetics
  • Phylogeny
  • Sequence Analysis, DNA / methods*
  • Software