Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Controlling the SARS-CoV-2 outbreak, insights from large scale whole genome sequences generated across the world

View ORCID ProfileJody Phelan, Wouter Deelder, Daniel Ward, Susana Campino, Martin L. Hibberd, Taane G Clark
doi: https://doi.org/10.1101/2020.04.28.066977
Jody Phelan
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jody Phelan
Wouter Deelder
2Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, United Kingdom
3Dalberg Advisors, 7 Rue de Chantepoulet, CH-1201 Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel Ward
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Susana Campino
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Martin L. Hibberd
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: Taane.clark@lshtm.ac.uk martin.hibberd@lshtm.ac.uk
Taane G Clark
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
2Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: Taane.clark@lshtm.ac.uk martin.hibberd@lshtm.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Background SARS-CoV-2 most likely evolved from a bat beta-coronavirus and started infecting humans in December 2019. Since then it has rapidly infected people around the world, with more than 3 million confirmed cases by the end of April 2020. Early genome sequencing of the virus has enabled the development of molecular diagnostics and the commencement of therapy and vaccine development. The analysis of the early sequences showed relatively few evolutionary selection pressures. However, with the rapid worldwide expansion into diverse human populations, significant genetic variations are becoming increasingly likely. The current limitations on social movement between countries also offers the opportunity for these viral variants to become distinct strains with potential implications for diagnostics, therapies and vaccines.

Methods We used the current sequencing archives (NCBI and GISAID) to investigate 5,349 whole genomes, looking for evidence of strain diversification and selective pressure.

Results We used 3,958 SNPs to build a phylogenetic tree of SARS-CoV-2 diversity and noted strong evidence for the existence of two major clades and six sub-clades, unevenly distributed across the world. We also noted that convergent evolution has potentially occurred across several locations in the genome, showing selection pressures, including on the spike glycoprotein where we noted a potentially critical mutation that could affect its binding to the ACE2 receptor. We also report on mutations that could prevent current molecular diagnostics from detecting some of the sub-clades.

Conclusions The worldwide whole genome sequencing effort is revealing the challenge of developing SARS-CoV-2 containment tools suitable for everyone and the need for data to be continually evaluated to ensure accuracy in outbreak estimations.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • https://www.gisaid.org/

  • https://submit.ncbi.nlm.nih.gov/sarscov2/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted April 29, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Controlling the SARS-CoV-2 outbreak, insights from large scale whole genome sequences generated across the world
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Controlling the SARS-CoV-2 outbreak, insights from large scale whole genome sequences generated across the world
Jody Phelan, Wouter Deelder, Daniel Ward, Susana Campino, Martin L. Hibberd, Taane G Clark
bioRxiv 2020.04.28.066977; doi: https://doi.org/10.1101/2020.04.28.066977
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Controlling the SARS-CoV-2 outbreak, insights from large scale whole genome sequences generated across the world
Jody Phelan, Wouter Deelder, Daniel Ward, Susana Campino, Martin L. Hibberd, Taane G Clark
bioRxiv 2020.04.28.066977; doi: https://doi.org/10.1101/2020.04.28.066977

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4223)
  • Biochemistry (9101)
  • Bioengineering (6748)
  • Bioinformatics (23929)
  • Biophysics (12080)
  • Cancer Biology (9488)
  • Cell Biology (13725)
  • Clinical Trials (138)
  • Developmental Biology (7614)
  • Ecology (11653)
  • Epidemiology (2066)
  • Evolutionary Biology (15471)
  • Genetics (10613)
  • Genomics (14289)
  • Immunology (9454)
  • Microbiology (22773)
  • Molecular Biology (9065)
  • Neuroscience (48824)
  • Paleontology (354)
  • Pathology (1479)
  • Pharmacology and Toxicology (2560)
  • Physiology (3820)
  • Plant Biology (8307)
  • Scientific Communication and Education (1467)
  • Synthetic Biology (2287)
  • Systems Biology (6168)
  • Zoology (1297)