Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Identification of hidden population structure in time-scaled phylogenies

View ORCID ProfileErik M. Volz, View ORCID ProfileCarsten Wiuf, View ORCID ProfileYonatan H. Grad, View ORCID ProfileSimon D.W. Frost, Ann M. Dennis, View ORCID ProfileXavier Didelot
doi: https://doi.org/10.1101/704528
Erik M. Volz
1Department of Infectious Disease Epidemiology and MRC Centre for Global Infectious Disease Analysis, Imperial College London
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Erik M. Volz
  • For correspondence: e.volz@imperial.ac.uk
Carsten Wiuf
2Department of Mathematical Sciences, University of Copenhagen
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Carsten Wiuf
Yonatan H. Grad
3Department of Immunology and Infectious Diseases, TH Chan School of Public Health, Harvard University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yonatan H. Grad
Simon D.W. Frost
4Department of Veterinary Medicine, University of Cambridge
5The Alan Turing Institute
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Simon D.W. Frost
Ann M. Dennis
6School of Medicine, University of North Carolina Chapel Hill
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xavier Didelot
7School of Life Sciences and Department of Statistics, University of Warwick
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Xavier Didelot
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Population structure influences genealogical patterns, however data pertaining to how populations are structured are often unavailable or not directly observable. Inference of population structure is highly important in molecular epidemiology where pathogen phylogenetics is increasingly used to infer transmission patterns and detect outbreaks. Discrepancies between observed and idealised genealogies, such as those generated by the coalescent process, can be quantified, and where significant differences occur, may reveal the action of natural selection, host population structure, or other demographic and epidemiological heterogeneities. We have developed a fast non-parametric statistical test for detection of cryptic population structure in time-scaled phylogenetic trees. The test is based on contrasting estimated phylogenies with the theoretically expected phylodynamic ordering of common ancestors in two clades within a coalescent framework. These statistical tests have also motivated the development of algorithms which can be used to quickly screen a phylogenetic tree for clades which are likely to share a distinct demographic or epidemiological history. Epidemiological applications include identification of outbreaks in vulnerable host populations or rapid expansion of genotypes with a fitness advantage. To demonstrate the utility of these methods for outbreak detection, we applied the new methods to large phylogenies reconstructed from thousands of HIV-1 partial pol sequences. This revealed the presence of clades which had grown rapidly in the recent past, and was significantly concentrated in young men, suggesting recent and rapid transmission in that group. Furthermore, to demonstrate the utility of these methods for the study of antimicrobial resistance, we applied the new methods to a large phylogeny reconstructed from whole genome Neisseria gonorrhoeae sequences. We find that population structure detected using these methods closely overlaps with the appearance and expansion of mutations conferring antimicrobial resistance.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted January 10, 2020.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Identification of hidden population structure in time-scaled phylogenies
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Identification of hidden population structure in time-scaled phylogenies
Erik M. Volz, Carsten Wiuf, Yonatan H. Grad, Simon D.W. Frost, Ann M. Dennis, Xavier Didelot
bioRxiv 704528; doi: https://doi.org/10.1101/704528
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Identification of hidden population structure in time-scaled phylogenies
Erik M. Volz, Carsten Wiuf, Yonatan H. Grad, Simon D.W. Frost, Ann M. Dennis, Xavier Didelot
bioRxiv 704528; doi: https://doi.org/10.1101/704528

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Evolutionary Biology
Subject Areas
All Articles
  • Animal Behavior and Cognition (4222)
  • Biochemistry (9096)
  • Bioengineering (6742)
  • Bioinformatics (23922)
  • Biophysics (12069)
  • Cancer Biology (9484)
  • Cell Biology (13722)
  • Clinical Trials (138)
  • Developmental Biology (7614)
  • Ecology (11646)
  • Epidemiology (2066)
  • Evolutionary Biology (15468)
  • Genetics (10613)
  • Genomics (14285)
  • Immunology (9451)
  • Microbiology (22756)
  • Molecular Biology (9057)
  • Neuroscience (48817)
  • Paleontology (354)
  • Pathology (1478)
  • Pharmacology and Toxicology (2559)
  • Physiology (3819)
  • Plant Biology (8307)
  • Scientific Communication and Education (1467)
  • Synthetic Biology (2285)
  • Systems Biology (6165)
  • Zoology (1296)