Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

RecPD: A Recombination-Aware Measure of Phylogenetic Diversity

Cedoljub Bundalovic-Torma, Darrell Desveaux, View ORCID ProfileDavid S. Guttman
doi: https://doi.org/10.1101/2021.10.01.462747
Cedoljub Bundalovic-Torma
aDepartment of Cell & Systems Biology, University of Toronto, Toronto, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Darrell Desveaux
aDepartment of Cell & Systems Biology, University of Toronto, Toronto, Ontario, Canada
bCentre for the Analysis of Genome Evolution & Function, University of Toronto, Toronto, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David S. Guttman
aDepartment of Cell & Systems Biology, University of Toronto, Toronto, Ontario, Canada
bCentre for the Analysis of Genome Evolution & Function, University of Toronto, Toronto, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for David S. Guttman
  • For correspondence: david.guttman@utoronto.ca
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

ABSTRACT

A critical step in studying biological features (e.g., genetic variants, gene families, metabolic capabilities, or taxa) underlying traits or outcomes of interest is assessing their diversity and distribution. Accurate assessments of these patterns are essential for linking features to traits or outcomes and understanding their functional impact. Consequently, it is of crucial importance that the metrics employed for quantifying feature diversity can perform robustly under any evolutionary scenario. However, the standard metrics used for quantifying and comparing the distribution of features, such as prevalence, phylogenetic diversity, and related approaches, either do not take into consideration evolutionary history, or assume strictly vertical patterns of inheritance. Consequently, these approaches cannot accurately assess diversity for features that have undergone recombination or horizontal transfer. To address this issue, we have devised RecPD, a novel recombination-aware phylogenetic-diversity metric for measuring the distribution and diversity of features under all evolutionary scenarios. RecPD utilizes ancestral-state reconstruction to map the presence / absence of features onto ancestral nodes in a species tree, and then identifies potential recombination events in the evolutionary history of the feature. We also derive a number of related metrics from RecPD that can be used to assess and quantify evolutionary dynamics and correlation of feature evolutionary histories. We used simulation studies to show that RecPD reliably identifies evolutionary histories under diverse recombination and loss scenarios. We then apply RecPD in a real-world scenario in a preliminary study type III effector protein families secreted by the plant pathogenic bacterium Pseudomonas syringae and demonstrate that prevalence is an inadequate metric that obscures the potential impact of recombination. We believe RecPD will have broad utility for revealing and quantifying complex evolutionary processes for features at any biological level.

AUTHOR SUMMARY Phylogenetic diversity is an important concept utilized in evolutionary ecology which has extensive applications in population genetics to help us understand how evolutionary processes have distributed genetic variation among individuals of a species, and how this impacts phenotypic diversification over time. However, existing approaches for studying phylogenetic diversity largely assume that the genetic features follow vertical inheritance, which is frequently violated in the case of microbial genomes due to horizontal transfer. To address this shortcoming, we present RecPD, a recombination-aware phylogenetic diversity metric, which incorporates ancestral state reconstruction to quantify the phylogenetic diversity of genetic features mapped onto a species phylogeny. Through simulation experiments we show that RecPD robustly reconstructs the evolutionary histories of features evolving under various scenarios of recombination and loss. When applied to a real-world example of type III secreted effector protein families from the plant pathogenic bacterium Pseudomonas syringae, RecPD reveals that horizontal transfer has played an important role in shaping the phylogenetic distributions of aa substantial proportion of families across the P. syringae species complex. Furthermore, we demonstrate that the traditional measures of feature prevalence are unsuitable as a metric for comparing feature diversity.

Competing Interest Statement

The authors have declared no competing interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted October 01, 2021.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
RecPD: A Recombination-Aware Measure of Phylogenetic Diversity
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
RecPD: A Recombination-Aware Measure of Phylogenetic Diversity
Cedoljub Bundalovic-Torma, Darrell Desveaux, David S. Guttman
bioRxiv 2021.10.01.462747; doi: https://doi.org/10.1101/2021.10.01.462747
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
RecPD: A Recombination-Aware Measure of Phylogenetic Diversity
Cedoljub Bundalovic-Torma, Darrell Desveaux, David S. Guttman
bioRxiv 2021.10.01.462747; doi: https://doi.org/10.1101/2021.10.01.462747

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (3603)
  • Biochemistry (7570)
  • Bioengineering (5526)
  • Bioinformatics (20798)
  • Biophysics (10329)
  • Cancer Biology (7985)
  • Cell Biology (11640)
  • Clinical Trials (138)
  • Developmental Biology (6606)
  • Ecology (10205)
  • Epidemiology (2065)
  • Evolutionary Biology (13620)
  • Genetics (9542)
  • Genomics (12847)
  • Immunology (7921)
  • Microbiology (19543)
  • Molecular Biology (7660)
  • Neuroscience (42113)
  • Paleontology (308)
  • Pathology (1258)
  • Pharmacology and Toxicology (2202)
  • Physiology (3267)
  • Plant Biology (7042)
  • Scientific Communication and Education (1294)
  • Synthetic Biology (1951)
  • Systems Biology (5426)
  • Zoology (1117)