PT - JOURNAL ARTICLE AU - Veronika Tchesnokova AU - Hemantha Kulakesara AU - Lydia Larson AU - Victoria Bowers AU - Elena Rechkina AU - Dagmara Kisiela AU - Yulia Sledneva AU - Debarati Choudhury AU - Iryna Maslova AU - Kai Deng AU - Kirthi Kutumbaka AU - Hao Geng AU - Curtis Fowler AU - Dina Greene AU - James Ralston AU - Mansour Samadpour AU - Evgeni Sokurenko TI - Acquisition of the L452R mutation in the ACE2-binding interface of Spike protein triggers recent massive expansion of SARS-Cov-2 variants AID - 10.1101/2021.02.22.432189 DP - 2021 Jan 01 TA - bioRxiv PG - 2021.02.22.432189 4099 - http://biorxiv.org/content/early/2021/03/11/2021.02.22.432189.short 4100 - http://biorxiv.org/content/early/2021/03/11/2021.02.22.432189.full AB - The recent rise in mutational variants of SARS-CoV-2, especially with changes in the Spike protein, is of significant concern due to the potential ability for these mutations to increase viral infectivity, virulence and/or ability to escape protective antibodies. Here, we investigated genetic variations in a 414-583 amino acid region of the Spike protein, partially encompassing the ACE2 receptor-binding domain (RBD), across a subset of 570 nasopharyngeal samples isolated between April 2020 and February 2021, from Washington, California, Arizona, Colorado, Minnesota and Illinois. We found that samples isolated since November have an increased number of amino acid mutations in the region, with L452R being the dominant mutation. This mutation is associated with a recently discovered CAL.20C viral variant from clade 20C, lineage B.1.429, that since November-December 2020 is associated with multiple outbreaks and is undergoing massive expansion across California. In some samples, however, we found a distinct L452R-carrying variant of the virus that, upon detailed analysis of the GISAID database genomes, is also circulating primarily in California, but emerged even more recently.The newly identified variant derives from the clade 20A (lineage B.1.232) and is named CAL.20A. We also found that the SARS-CoV-2 strain that caused the only recorded case of infection in an ape - gorillas in the San Diego Zoo, reported in January 2021 - is CAL.20A. In contrast to CAL.20C that carries two additional to L452R mutations in the Spike protein, L452R is the only mutation found in CAL.20A. According to the phylogenetic analysis, however, emergence of CAL.20C was also specifically triggered by acquisition of the L452R mutation. Further analysis of GISAID-deposited genomes revealed that several independent L452R-carrying lineages have recently emerged across the globe, with over 90% of the isolates reported between December 2020 – February 2021. Taken together, these results indicate that the L452R mutation alone is of significant adaptive value to SARS-CoV-2 and, apparently, the positive selection for this mutation became particularly strong only recently, possibly reflecting viral adaptation to the containment measures or increasing population immunity. While the functional impact of L452R has not yet been extensively evaluated, leucine-452 is positioned in the receptor-binding motif of RBD, in the interface of direct contact with the ACE2 receptor. Its replacement with arginine is predicted to result in both a much stronger binding to the receptor and escape from neutralizing antibodies. If true, this in turn might lead to significantly increased infectivity of the L452R variants, warranting their close surveillance and in-depth functional studies.Competing Interest StatementE. Sokurenko is Founder of ID Genomics