ABSTRACT
Objectives During the first months of SARS-CoV-2 evolution in a new host, contrasting hypotheses have been proposed about the way the virus has evolved and diversified worldwide. The aim of this study was to perform a comprehensive evolutionary analysis to describe the human outbreak and the evolutionary rate of different genomic regions of SARS-CoV-2.
Methods The molecular evolution in nine genomic regions of SARS-CoV-2 was analyzed using three different approaches: phylogenetic signal assessment, emergence of amino acid substitutions, and Bayesian evolutionary rate estimation in eight successive fortnights since the virus emergence.
Results All observed phylogenetic signals were very low and consistent trees were obtained. However, after four months of evolution, it was possible to identify regions revealing an incipient viral lineages formation despite the low phylogenetic signal, since fortnight 3. Finally, the SARS-CoV-2 evolutionary rate for regions nsp3 and S, the ones presenting greater variability, was estimated to range from 1.37 to 2.19 x 10−3 substitution/site/year.
Conclusions In conclusion, results obtained in this work about the variable diversity of crucial viral regions and the determination of the evolutionary rate are consequently decisive to understanding essential feature of viral emergence. In turn, findings may allow identifying the best targets for antiviral treatments and vaccines development.
Competing Interest Statement
The authors have declared no competing interest.