Genome-wide ultraconserved elements exhibit higher phylogenetic informativeness than traditional gene markers in percomorph fishes

Mol Phylogenet Evol. 2015 Nov:92:140-6. doi: 10.1016/j.ympev.2015.05.027. Epub 2015 Jun 12.

Abstract

Ultraconserved elements (UCEs) have become popular markers in phylogenomic studies because of their cost effectiveness and their potential to resolve problematic phylogenetic relationships. Although UCE datasets typically contain a much larger number of loci and sites than more traditional datasets of PCR-amplified, single-copy, protein coding genes, a fraction of UCE sites are expected to be part of a nearly invariant core, and the relative performance of UCE datasets versus protein coding gene datasets is poorly understood. Here we use phylogenetic informativeness (PI) to compare the resolving power of multi-locus and UCE datasets in a sample of percomorph fishes with sequenced genomes (genome-enabled). We compare three data sets: UCE core regions, flanking sequence adjacent to the UCE core and a set of ten protein coding genes commonly used in fish systematics. We found the net informativeness of UCE core and flank regions to be roughly ten-fold and 100-fold more informative than that of the protein coding genes. On a per locus basis UCEs and protein coding genes exhibited similar levels of phylogenetic informativeness. Our results suggest that UCEs offer enormous potential for resolving relationships across the percomorph tree of life.

Keywords: Molecular evolution; Next-generation sequencing; Non-coding DNA; Phylogenetic informativeness; Phylogenomics; Ultraconserved elements.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Confidence Intervals
  • Conserved Sequence / genetics*
  • Databases, Genetic
  • Fishes / genetics*
  • Genetic Loci
  • Genetic Markers
  • Genome*
  • Linear Models
  • Nucleotides / genetics
  • Open Reading Frames / genetics
  • Phylogeny*
  • Reproducibility of Results

Substances

  • Genetic Markers
  • Nucleotides