Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Quantifying bacterial evolution in the wild: a birthday problem for Campylobacter lineages

View ORCID ProfileJessica K. Calland, View ORCID ProfileBen Pascoe, View ORCID ProfileSion C. Bayliss, View ORCID ProfileEvangelos Mourkas, Elvire Berthenet, Harry A. Thorpe, Matthew D. Hitchings, Edward J. Feil, Martin J. Blaser, Daniel Falush, Samuel K. Sheppard
doi: https://doi.org/10.1101/2020.12.02.407999
Jessica K. Calland
1The Milner Centre for Evolution, University of Bath, Claverton down, Bath, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jessica K. Calland
Ben Pascoe
1The Milner Centre for Evolution, University of Bath, Claverton down, Bath, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ben Pascoe
Sion C. Bayliss
1The Milner Centre for Evolution, University of Bath, Claverton down, Bath, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sion C. Bayliss
Evangelos Mourkas
1The Milner Centre for Evolution, University of Bath, Claverton down, Bath, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Evangelos Mourkas
Elvire Berthenet
2French National Reference Center for Campylobacters and Helicobacters, University of Bordeaux, 33076, Bordeaux, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Harry A. Thorpe
1The Milner Centre for Evolution, University of Bath, Claverton down, Bath, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew D. Hitchings
3Institute of Life Sciences, Swansea University Medical School, Swansea University, Singleton Park, Swansea, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Edward J. Feil
1The Milner Centre for Evolution, University of Bath, Claverton down, Bath, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Martin J. Blaser
4Center for Advanced Biotechnology and Medicine, Rutgers University, New Brunswick, New Jersey, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel Falush
5Centre for Microbes, Development and Health, Institute Pasteur Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: s.k.sheppard@bath.ac.uk daniel.falush@ips.ac.cn
Samuel K. Sheppard
1The Milner Centre for Evolution, University of Bath, Claverton down, Bath, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: s.k.sheppard@bath.ac.uk daniel.falush@ips.ac.cn
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Measuring molecular evolution in bacteria typically requires estimation of the rate at which nucleotide changes accumulate in strains sampled at different times that share a common ancestor. This approach has been useful for dating ecological and evolutionary events that coincide with the emergence of important lineages, such as outbreak strains and obligate human pathogens. However, in multi-host (niche) transmission scenarios, where the pathogen is essentially an opportunistic environmental organism, sampling is often sporadic and rarely reflects the overall population, particularly when concentrated on clinical isolates. This means that approaches that assume recent common ancestry are not applicable. Here we present a new approach to estimate the molecular clock rate in Campylobacter that draws on the popular probability conundrum known as the ‘birthday problem’. Using large genomic datasets and comparative genomic approaches, we use isolate pairs that share recent common ancestry to estimate the rate of nucleotide change for the population. Identifying synonymous and non-synonymous nucleotide changes, both within and outside of recombined regions of the genome, we quantify clock-like diversification to estimate synonymous rates of nucleotide change for the common pathogenic bacteria Campylobacter coli (2.4 x 10-6 s/s/y) and Campylobacter jejuni (3.4 x 10-6 s/s/y). Finally, using estimated total rates of nucleotide change, we infer the number of effective lineages within the sample time-frame – analogous to a shared birthdays – and assess the rate of turnover of lineages in our sample set over short evolutionary timescales. This provides a generalizable approach to calibrating rates in populations of environmental bacteria and shows that multiple lineages are maintained, implying that large-scale clonal sweeps may take hundreds of years or more in these species.

Author Summary Growth and reproduction in living organisms require DNA replication but this process is error prone. Along with variation introduced by horizontal gene transfer, it can lead to alterations in the nucleotide sequence. These nucleotide changes accumulate over time in successive generations at an approximately constant rate termed the molecular clock. Therefore, if this rate is known, one can estimate the date when two or more lineages diverged. In bacteria, this can be informative for understanding the time-scale of emergence and spread of pathogenic strains. Such analyses are robust when the ancestral population is known, such as for obligate pathogens that only infect humans. However, when the bacterium inhabits multiple hosts or niches it is difficult to infer direct ancestry from one strain to another, reducing the accuracy of molecular clock estimates. Here we focus on one such multi-host organism, Campylobacter, a leading cause of food-borne gastroenteritis. Reconstructing the population history by estimating empirical nucleotide change rates from carefully selected isolate pairs, and evaluating the maintenance of multiple lineages over time, we provide information about strain diversification. Our method is a new addition to the bacterial genomics toolkit that will help in understanding the spread of opportunistic pathogens.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://www.ncbi.nlm.nih.gov/bioproject/PRJNA524315

  • https://doi.org/10.6084/m9.figshare.7886810

  • https://github.com/SionBayliss/CallandMolClock

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted May 27, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Quantifying bacterial evolution in the wild: a birthday problem for Campylobacter lineages
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Quantifying bacterial evolution in the wild: a birthday problem for Campylobacter lineages
Jessica K. Calland, Ben Pascoe, Sion C. Bayliss, Evangelos Mourkas, Elvire Berthenet, Harry A. Thorpe, Matthew D. Hitchings, Edward J. Feil, Martin J. Blaser, Daniel Falush, Samuel K. Sheppard
bioRxiv 2020.12.02.407999; doi: https://doi.org/10.1101/2020.12.02.407999
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Quantifying bacterial evolution in the wild: a birthday problem for Campylobacter lineages
Jessica K. Calland, Ben Pascoe, Sion C. Bayliss, Evangelos Mourkas, Elvire Berthenet, Harry A. Thorpe, Matthew D. Hitchings, Edward J. Feil, Martin J. Blaser, Daniel Falush, Samuel K. Sheppard
bioRxiv 2020.12.02.407999; doi: https://doi.org/10.1101/2020.12.02.407999

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4667)
  • Biochemistry (10332)
  • Bioengineering (7653)
  • Bioinformatics (26278)
  • Biophysics (13497)
  • Cancer Biology (10663)
  • Cell Biology (15392)
  • Clinical Trials (138)
  • Developmental Biology (8480)
  • Ecology (12800)
  • Epidemiology (2067)
  • Evolutionary Biology (16817)
  • Genetics (11380)
  • Genomics (15451)
  • Immunology (10591)
  • Microbiology (25141)
  • Molecular Biology (10189)
  • Neuroscience (54319)
  • Paleontology (399)
  • Pathology (1663)
  • Pharmacology and Toxicology (2889)
  • Physiology (4332)
  • Plant Biology (9223)
  • Scientific Communication and Education (1585)
  • Synthetic Biology (2552)
  • Systems Biology (6769)
  • Zoology (1459)