Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Controlling the SARS-CoV-2 outbreak, insights from large scale whole genome sequences generated across the world

View ORCID ProfileJody Phelan, Wouter Deelder, Daniel Ward, Susana Campino, Martin L. Hibberd, Taane G Clark
doi: https://doi.org/10.1101/2020.04.28.066977
Jody Phelan
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jody Phelan
Wouter Deelder
2Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, United Kingdom
3Dalberg Advisors, 7 Rue de Chantepoulet, CH-1201 Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel Ward
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Susana Campino
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Martin L. Hibberd
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: Taane.clark@lshtm.ac.uk martin.hibberd@lshtm.ac.uk
Taane G Clark
1Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
2Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: Taane.clark@lshtm.ac.uk martin.hibberd@lshtm.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Background SARS-CoV-2 most likely evolved from a bat beta-coronavirus and started infecting humans in December 2019. Since then it has rapidly infected people around the world, with more than 3 million confirmed cases by the end of April 2020. Early genome sequencing of the virus has enabled the development of molecular diagnostics and the commencement of therapy and vaccine development. The analysis of the early sequences showed relatively few evolutionary selection pressures. However, with the rapid worldwide expansion into diverse human populations, significant genetic variations are becoming increasingly likely. The current limitations on social movement between countries also offers the opportunity for these viral variants to become distinct strains with potential implications for diagnostics, therapies and vaccines.

Methods We used the current sequencing archives (NCBI and GISAID) to investigate 5,349 whole genomes, looking for evidence of strain diversification and selective pressure.

Results We used 3,958 SNPs to build a phylogenetic tree of SARS-CoV-2 diversity and noted strong evidence for the existence of two major clades and six sub-clades, unevenly distributed across the world. We also noted that convergent evolution has potentially occurred across several locations in the genome, showing selection pressures, including on the spike glycoprotein where we noted a potentially critical mutation that could affect its binding to the ACE2 receptor. We also report on mutations that could prevent current molecular diagnostics from detecting some of the sub-clades.

Conclusions The worldwide whole genome sequencing effort is revealing the challenge of developing SARS-CoV-2 containment tools suitable for everyone and the need for data to be continually evaluated to ensure accuracy in outbreak estimations.

Background

Within the last 20 years, three ‘spill-over events’ have occurred where a coronavirus has entered the human population from an animal host and cause significant morbidity and mortality. In the most recent event, the currently emerging SARS-CoV-2 virus – associated with disease COVID-19 – has spread rapidly worldwide to infect over 3,000,000 individuals and has been associated with more than 200,000 deaths across more than 180 countries [1]. The COVID-19 pandemic is a global crisis and control strategies; including the development of PCR-based diagnostics, serological assays, monoclonal antibodies and vaccines, as well as an increased understanding of transmission dynamics, will be informed by knowledge of SARS-CoV-2 sequence data. The sequences in the receptor-binding domain (RBD) of the SARS-CoV-2 viral spike (S) protein, used to bind to cell surface receptors and gain entry into human host cells, are evolutionary close to and possibly spilled over from bat (RaTG13) -like betacoronaviruses, and appear efficient for binding to human ACE2 receptors [1,2]. Structural analysis has revealed that the SARS-CoV-2 RBD contains a core and a receptor-binding motif (RBM) that mediates contact with ACE2 [3]. Several residue changes in SARS-CoV-2 RBD stabilize two virus-binding hotspots at the RBD and human ACE2 interface and enhance the virus’s binding affinity [3]. Therefore, it has been suggested that neutralising monoclonal antibodies targeting SARS-CoV-2 RBM are promising antiviral therapies, and that the RBD itself can function as a subunit vaccine [3,4].

Whilst COVID-19 has spread from China across the globe, the routes of travel are not always clear. Moreover, with this expansion, the likelihood of mutations and viral changes increases, which may impact the efficacy of diagnostics and potentially that of future therapies and vaccines. A growing number of SARS-CoV-2 whole genomic sequences are available in the public domain [5,6] and their phylogenetic analysis can assist researchers and policymakers in answering questions about the spread and evolution of the virus. In general, the phylogeny can be used to identify the transmission network through finding samples with identical sequences, characterising mutations that barcode clades, and inform on underlying host or pathogen phenotypes through genome-wide association methods. In some pathogens, it is possible to detect selection pressure that causes phenotypic traits, through identifying signatures of convergent evolution, that is, where the same causal mutations occur independently in unrelated branches of the phylogenetic tree [7]. An increased understanding of the genomic locations that might be under selective pressure could be of value to epidemiologists, disease modellers and vaccine developers. We used more than 5,300 whole genome SARS-CoV-2 virus sequences (size 29.6kbp) in the public domain, sourced globally [5,6] (Table 1). We explored the mutation diversity across 62 countries to provide insights into the evolutionary history and report any informative mutations for control tools. To assist the ongoing analysis of SARS-CoV-2 virus sequences and to facilitate the investigation of genetic variation, we provide an online tool – “Covid-Profiler” [8].

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1.

Clade frequency (%) across geographic regions

Results

An analysis was performed on 5,349 SAR-CoV-2 sequences, sourced from isolates collected between December 24, 2019 and April 6, 2020. A comparison with the Wuhan-Hu-1 reference (NC_045512) resulted in 3,502 unique SNPs in high quality regions, 3,434 (98.1%) of which in genic regions with 1,845 (52.7%) non-synonymous amino acid changes. The number of SNP differences per isolate was small (median 6; range 1 to 17) (S1 figure), and the nucleotide diversity across all isolates was low (π = 2.7×10−4). A Bayesian SNP-time dated phylogenetic tree (S2 figure) highlights the emergence of the virus in China, and its spread through multiple introductions into Europe, North America, and Oceania. It also reveals the clades of transmission involving (near-)identical sequences, initially both within and between countries, but more recently mostly within countries due to nation lockdowns. By looking at the ancestral splits within the tree, there appear to be two main clusters (denoted C1 and C2), which were further partitioned into six main clades (C1, C1.1, C2, C2.1, C2.1.1, and C2.1.2; see Figure 1a). Bayesian phylogenetic analysis revealed an association between clade and geographic location, with C2 being more common in all continents (Figure 1b, Table 1). The most ancestral cluster was determined to be C1 through phylogenetic reconstruction using the closest known relative to SARS-CoV-2 (MN996532.1). By using ancestral reconstruction of nucleotide states at the most recent common ancestors, we identified thirteen informative barcoding mutations for each clade (Table 2). Nine of the clade-defining SNPs are non-synonymous mutations and include D614S in the spike glycoprotein (S) (C2.1) and a triple mutation affecting two codons (R203K, G204R) in the nucleocapsid phosphoprotein (N) (C2.1.2). Interestingly, the three mutations separating C2.1.2 from other isolates are located in the primer binding site of a qPCR kit used by the Chinese CDC [9] (S3 figure).

S1 figure
  • Download figure
  • Open in new tab
S1 figure

Distribution of SNP differences from the Wuhan-Hu-1 reference (NC_045512) by clade

S2 figure
  • Download figure
  • Open in new tab
S2 figure Bayesian time-dated tree of the first 1,200 SARS-CoV-2 genomes, demonstrating origin of the clades

Ancestral reconstruction of the continent of isolation confirms known patterns of spread throughout the globe

S3 figure
  • Download figure
  • Open in new tab
S3 figure The global distribution of the mutation pattern for the PCR diagnostic primers and prob of the China-N assay.

Panel A shows the fraction of isolates with at least one mutation in the forward primer in different countries. Panel B shows the number of samples with different numbers of mismatches. Panel C shows the frequency and location of mismatches across the primer binding site. Panels D, E and F show the same metrics for the probe and panels G, H and I for the reverse primer.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2.

Barcoding SNPs for the six Clades

Figure 1.
  • Download figure
  • Open in new tab
Figure 1.
  • Download figure
  • Open in new tab
Figure 1.

Phylogenetic and clade analysis of 5,349 whole genome SARS-CoV-2 sequences

To establish whether particular mutations were under selection we looked for convergent evolution across the whole genome. From a total of 3,455 variant positions, 728 sites may have evolved convergently. However, with the relatively large viral genome size and number of isolates, it is expected that random genetic drift will lead to some sites evolving in a convergent manner. An empirical approach was taken to find possible candidates that might be under selective pressure, rather than genetic drift, by looking for outliers from the expected distribution. In particular, using the 99.5 percentile of the distribution of the number of origins, led to a cut-off of 7 independent acquisitions, above which mutations may represent candidates for positive selection. Using this cut-off, we found 13 sites with missense mutations and 6 sites with synonymous mutations (Table 3). The strongest signal was for mutation L37F in nsp6, which had 41 independent mutation events. Nsp6 is involved in the formation of double-membrane vesicles (together with nsp3 and nsp4) and is thought to play a role in restricting autophagosome expansion [10]. The second strongest hit was the D614G mutation in the spike glycoprotein (S4 figure). These mutations are putatively the result of selection pressure and should be characterized further to understand how the virus is adapting to human transmission.

S4 figure
  • Download figure
  • Open in new tab
S4 figure Mutations visualised on the spike glycoprotein

SNP mutations are presented in green. The C2.1 defining SNP D614G is highlighted in red

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3.

Genetic variants with evidence of positive selection by homoplasy

A number of diagnostic primer sets have been defined by the WHO [9]. Ideally, primers should target a genomic region which is well conserved among global isolates, or at least so for the geographic catchment area of a diagnostic laboratory. To understand potential differences between primer sets, an analysis of the diversity in their binding sites was performed. For each diagnostic, the number of mismatches in their forward, reverse and probe primer sets was characterised across the 5,349 sequenced isolates (Table 4). This revealed that overall diversity was low in binding sites, however, there were some differences between geographic origins. The highest proportion of samples with >=1 mismatch in a primer was the China-N set for which 14.7% of global isolates had a mutation. When analysed on the country level there were marked differences with some countries in Latin America showing >70% of samples carrying a mutation in the primer binding site (S3 figure).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 4.

Reported COVID-19 PCR diagnostic assays, with their associated frequency of mismatches identified across the 5,349 SARS-CoV-2 sequences

The spike glycoprotein mediates cell entry by binding to host ACE2 and has been proposed as a target for antibody-mediated neutralisation [11]. It is important to take into account the variation in the amino acid sequence as this could influence the efficacy of the vaccine. Across the 5,349 SAR-CoV-2, we found 512 high quality mutations in the spike gene, with 252 missense and 260 synonymous mutations. The mutations were generally rare, with 73.6% occurring in only single isolates. One high-frequency spike mutation, D614G, was acquired on the branch defining C2.1, leading to just over half of all isolates containing this mutation (Figure 3A). The SNPs were spread throughout the gene, with a sliding window analysis of the SNP density (using windows of 50nt) reporting a generally even distribution (range: 2 to 17 SNPs, Figure 3A). Interestingly the RBM inside the RBD had higher SNP density than neighbouring regions although none of these mutations were currently common or under selection (Table 3). Analysis of the protein structural model indicated some of these mutations present on the interface between spike and ACE2 (Figure 3B, S4 Figure) and could potentially affect vaccine performance if selection pressure (such as vaccine escape) drove them to higher frequencies. Although these mutations are currently rare, they need to be evaluated in case they affect vaccine performance.

Figure 2.
  • Download figure
  • Open in new tab
Figure 2. The genetic diversity across the SARS-CoV-2 genome

Each SNP is represented by a circle. The number of times the SNP has been independently acquired is indicated on the Y-axis. The size of the circle represents the derived allele frequency. The circles are coloured to represent intergenic (yellow), synonymous (purple), missense (green) or a mix of the aforementioned (grey). The line plot represents the SNP density across the genome in windows of 100 nucleotides.

Figure 3.
  • Download figure
  • Open in new tab
Figure 3. The SNPs in the spike glycoprotein (S)

A) A gene level view of genetic variation in S is shown, with each SNP represented by a circle. The number of times the SNP has been independently acquired is indicated on the left Y-axis. The size of the circle represents the derived allele frequency. The circles are coloured to represent synonymous (purple), missense (green) or a mix of the aforementioned (grey). The line plots represents the SNP density, total (grey) and non-synonymous (green), across the gene in windows of 100 nucleotides (Y-axis right hand scale). The location of the receptor binding domain (RBD, in brown) and the receptor binding motif (RBM, in blue) are highlighted on the gene track. B) The structure of the receptor binding domain of S (brown) bound to ACE2 (grey). Mutation positions in S are highlighted in green.

Discussion

Our analysis, based on 5,349 SAR-CoV-2 genomes, shows a clear segregation into clades of transmission within and between countries. We report two main clades which are further clustered into six sub-clades, defined by thirteen informative barcoding mutations. There is overlap between our two main clades (C1 vs. non-C1) and the L and S types defined using only 103 isolates [12], where the two separating or barcoding mutations (in orf1ab and ORF8) are identical. Our phylogenetic analysis provides much finer-scale resolution on the phylogeny of C2 into sub-clades, especially in the context that SARS-CoV-2 has been accumulating an average of one to two mutations per month. Having arisen recently, the level of diversity observed are much lower than for other viruses (e.g. dengue) where subtypes or lineages exist with many more SNP differences and associated functional differences [13]. Whilst functional follow-up is required for the barcoding mutations, our SNPs (or updated versions) are likely to be useful once countries have attained greater control over domestic transmission chains, where the ability to understand and map the geographic sources of viral importation might become crucial for effective disease control.

Our analysis furthermore reveals thirteen non-synonymous sites that putatively might be subject to convergent evolution. Of these, the two strongest SNPs are located in nsp6 and the spike glycoprotein. Further research might be warranted to elucidate the role of these mutations and to further characterize the evolutionary benefits encoded by these mutations. In the future, when effective antivirals may have been developed and deployed, the tracking of the occurrence of convergent SNPs might further gain in importance and utility in monitoring the onset of antiviral resistance.

The limitations of our study are the relatively small proportion of sampled sequences (>5,000) compared to infections (>3,000,000), an over-reliance on sequences from relatively severe cases, and a potential bias towards collecting more isolates early in an outbreak when most genomes are still very similar. The resulting bias in cluster size, means we cannot infer that larger clades are more virulent or more transmissible. However, we can use the transmissibility phenotype, or others involving laboratory-based virulence or patient disease severity to identify related causal mutations, using GWAS or convergent evolution techniques [14]. One roadblock is the small number of observed polymorphisms observed in SARS-CoV-2, and the unknown contributing role of host genetics and immune system. Understanding the role of host genetics in host-virus interactions with regards to pathogenesis and disease progression is also important and await studies of sufficient size to be conclusive.

This study has also highlighted a number of SNPs that might cause misalignment with commonly used diagnostic PCR primer sets and might thus hamper screening programs and patient diagnosis through providing false negative test results. These mismatches were especially common in Latin America. Our “Covid-profiler” tool allows for the upload of SARS-CoV-2 sequence data to infer informative mutations, update phylogenies, and assess the robustness of diagnostics, as well as other control tools like vaccines. As SARS-CoV-2 virus is likely to become the fifth endemic coronavirus in the human population, and probably will not be the last coronavirus to jump the animal-to-human species boundary, our “Covid-profiler” is likely to have utility beyond the current outbreak.

Conclusions

Analysis of global SARS-CoV-2 viral sequences has led to insights into the robustness of existing, and development of new, tools for infection control. We have developed an online tool (“Covid-profiler”) to facilitate this type of analysis and profile SARS-CoV-2 viral sequences.

Methods

Full SARS-CoV-2 genome sequences were downloaded from the GISAID [6] and NCBI [5], covering isolates collected between December 24, 2019 and April 6, 2020. Sequences were aligned to the reference genome (NC_045512.2) using minimap2 software [15]. Variants were called using paftools.js from the minimap2 package and bcftools software [16], and transformed into a FASTA alignment. IQ-TREE [17] and BEAST [18] software were used to reconstruct the phylogeny tree, which was rooted using the closest known relative to SARS-CoV-2 sourced from bats (MN996532.1). The number of independent acquisitions of the mutation was counted using empirical Bayesian ancestral reconstruction methods and custom scripts utilising the ete3 python library [19]. A high number of acquisitions can be indicative of genomic regions under positive selection pressure. The SARS-CoV-2 spike glycoprotein structure was downloaded from the PDB (6m0J) [20] and visualised using chimera software [21].

DECLARATIONS

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Availability of data and materials

The datasets analysed during the current study are available from GISAID (https://www.gisaid.org) and NCBI (https://www.ncbi.nlm.nih.gov)

Competing interests

The authors declare that they have no competing interests

Funding

JP is funded by Bloomsbury SET funding. DW is funded by a Bloomsbury Research PhD studentship. SC is funded by Medical Research Council UK grants (MR/M01360X/1, MR/R025576/1, and MR/R020973/1) and BBSRC (Grant no. BB/R013063/1). TGC is funded by the Medical Research Council UK (Grant no. MR/M01360X/1, MR/N010469/1, MR/R025576/1, and MR/R020973/1) and BBSRC (Grant no. BB/R013063/1).

Authors’ contributions

JP, SC, MLH, and TGC conceived and directed the project. JP, WD and DW coordinated data processing. JP and WD performed bioinformatic and statistical analyses under the supervision of MLH and TGC. All authors interpreted results. JP, WD, MLH and TGC wrote the first draft of the manuscript. All authors commented and edited on various versions of the draft manuscript and approved the final manuscript. JP compiled the final manuscript.

Acknowledgements

We thank Aleksei Ponomarev for providing coding support on the data processing. We gratefully acknowledge the laboratories who submitted the sequences to the NCBI and GISAID public databases on which this research is based. We also thank NCBI and GISAID for developing and curating their databases.

Footnotes

  • https://www.gisaid.org/

  • https://submit.ncbi.nlm.nih.gov/sarscov2/

References

  1. [1].↵
    https://www.worldometers.info/coronavirus/
  2. [2].↵
    Zhang Y-Z, Holmes EC. A Genomic Perspective on the Origin and Emergence of SARS-CoV-2. Cell, 2020 DOI: 10.1016/j.cell.2020.03.035
    OpenUrlCrossRef
  3. [3].↵
    Lam TT, Shum MH, Zhu HC, Tong YG, Ni XB, Liao YS, Wei W, Cheung WY, Li WJ, Li LF, Leung GM, Holmes EC, Hu YL, Guan Y. Identifying SARS-CoV-2 related coronaviruses in Malayan pangolins. Nature, 2020; DOI: 10.1038/s41586-020-2169-0
    OpenUrlCrossRefPubMed
  4. [4].↵
    Shang J, Ye G, Shi K, Wan Y, Luo C, Aihara H, Geng Q, Auerbach A, Li F. Structural basis of receptor recognition by SARS-CoV-2. Nature. 2020 Mar 30. doi: 10.1038/s41586-020-2179-y.
    OpenUrlCrossRefPubMed
  5. [5].↵
    https://www.ncbi.nlm.nih.gov
  6. [6].↵
    https://www.gisaid.org
  7. [7].↵
    Coll F, Phelan J, Hill-Cawthorne GA, Nair MB, Mallard K, Ali S, et al. Genome-wide analysis of multi- and extensively drug-resistant Mycobacterium tuberculosis. Nat Genet. 2018 Feb;50(2):307–316.
    OpenUrlCrossRef
  8. [8].↵
    genomics.lshtm.ac.uk
  9. [9].↵
    https://www.who.int/emergencies/diseases/novel-coronavirus-2019/technical-guidance/laboratory-guidance
  10. [10].↵
    Benvenuto D, Angeletti S, Giovanetti M, Bianchi M, Pascarella S, Cauda R, Ciccozzi M, Cassone A. Evolutionary analysis of SARS-CoV-2: how mutation of Non-Structural Protein 6 (NSP6) could affect viral autophagy. J Infect. 2020 Apr 10. pii: S0163–4453(20)30186-9.
  11. [11].↵
    Wrapp D, Wang N, Corbett KS, Goldsmith JA, Hsieh CL, Abiona O, Graham BS, McLellan JS. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science. 2020 Mar 13;367:1260–1263.
    OpenUrlAbstract/FREE Full Text
  12. [12].↵
    Xiaolu Tang, Changcheng Wu, Xiang Li, Yuhe Song, Xinmin Yao, Xinkai Wu, Yuange Duan, Hong Zhang, Yirong Wang, Zhaohui Qian. On the origin and continuing evolution of SARS-CoV-2. National Science Review, nwaa036, https://doi.org/10.1093/nsr/nwaa036
  13. [13].↵
    Bell SM, Katzelnick L, Bedford T. Dengue genetic divergence generates within-serotype antigenic variation, but serotypes dominate evolutionary dynamics. Elife. 2019 Aug 6;8. pii: e42496. doi: 10.7554/eLife.42496.
    OpenUrlCrossRef
  14. [14].↵
    Sobkowiak B, Banda L, Mzembe T, Crampin AC, Glynn JR, Clark TG. Bayesian reconstruction of Mycobacterium tuberculosis transmission networks in a high incidence area over two decades in Malawi reveals associated risk factors and genomic variants. Microb Genom. 2020 Apr 1. doi: 10.1099/mgen.0.000361.
    OpenUrlCrossRef
  15. [15].↵
    Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 2018 Sep 15;34(18):3094–3100.
    OpenUrlCrossRefPubMed
  16. [16].↵
    http://github.com/samtools/bcftools
  17. [17].↵
    http://www.iqtree.org
  18. [18].↵
    Suchard MA, Lemey P, Baele G, Ayres DL, Drummond AJ & Rambaut A (2018) Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10 Virus Evolution 4, vey016.
    OpenUrlCrossRefPubMed
  19. [19].↵
    http://etetoolkit.org
  20. [20].↵
    http://www.rcsb.org/structure/6m0j
  21. [21].↵
    https://www.cgl.ucsf.edu/chimera/
Back to top
PreviousNext
Posted April 29, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Controlling the SARS-CoV-2 outbreak, insights from large scale whole genome sequences generated across the world
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Controlling the SARS-CoV-2 outbreak, insights from large scale whole genome sequences generated across the world
Jody Phelan, Wouter Deelder, Daniel Ward, Susana Campino, Martin L. Hibberd, Taane G Clark
bioRxiv 2020.04.28.066977; doi: https://doi.org/10.1101/2020.04.28.066977
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Controlling the SARS-CoV-2 outbreak, insights from large scale whole genome sequences generated across the world
Jody Phelan, Wouter Deelder, Daniel Ward, Susana Campino, Martin L. Hibberd, Taane G Clark
bioRxiv 2020.04.28.066977; doi: https://doi.org/10.1101/2020.04.28.066977

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4234)
  • Biochemistry (9128)
  • Bioengineering (6774)
  • Bioinformatics (23989)
  • Biophysics (12117)
  • Cancer Biology (9523)
  • Cell Biology (13773)
  • Clinical Trials (138)
  • Developmental Biology (7627)
  • Ecology (11686)
  • Epidemiology (2066)
  • Evolutionary Biology (15506)
  • Genetics (10638)
  • Genomics (14322)
  • Immunology (9479)
  • Microbiology (22832)
  • Molecular Biology (9089)
  • Neuroscience (48987)
  • Paleontology (355)
  • Pathology (1480)
  • Pharmacology and Toxicology (2568)
  • Physiology (3844)
  • Plant Biology (8327)
  • Scientific Communication and Education (1471)
  • Synthetic Biology (2296)
  • Systems Biology (6187)
  • Zoology (1300)