TY - JOUR T1 - SARS-CoV-2 Genomic Surveillance in Costa Rica: Evidence of a Divergent Population and an Increased Detection of a Spike T1117I Mutation JF - bioRxiv DO - 10.1101/2020.12.21.423850 SP - 2020.12.21.423850 AU - Jose Arturo Molina-Mora AU - Estela Cordero-Laurent AU - Adriana Godínez AU - Melany Calderón-Osorno AU - Hebleen Brenes AU - Claudio Soto-Garita AU - Cristian Pérez-Corrales AU - COINGESA-CR Consorcio Interinstitucional de Estudios Genómicos del SARS-CoV-2 Costa Rica AU - Jan Felix Drexler AU - Andres Moreira-Soto AU - Eugenia Corrales-Aguilar AU - Francisco Duarte-Martínez Y1 - 2021/01/01 UR - http://biorxiv.org/content/early/2021/01/19/2020.12.21.423850.abstract N2 - Genome sequencing is a key strategy in the surveillance of SARS-CoV-2, the virus responsible for the COVID-19 pandemic. Latin America is the hardest hit region of the world, accumulating almost 20% of COVID-19 cases worldwide. Costa Rica was first exemplary for the region in its pandemic control, declaring a swift state of emergency on March 16th that led to a low quantity of cases, until measures were lifted in early May. From the first detected case in March 6th to December 31st almost 170 000 cases have been reported in Costa Rica, 99.5% of them from May onwards. We analyzed the genomic variability during the SARS-CoV-2 pandemic in Costa Rica using 185 sequences, 52 from the first months of the pandemic, and 133 from the current wave.Three GISAID clades (G, GH, and GR) and three PANGOLIN lineages (B.1, B.1.1, and B.1.291) are predominant, with phylogenetic relationships that are in line with the results of other Latin American countries, suggesting introduction and multiple re-introductions from other regions of the world. The whole-genome variant calling analysis identified a total of 283 distinct nucleotide variants. These correspond mostly to non-synonymous mutations (51.6%, 146) but 45.6% (129) corresponded to synonymous mutations. The 283 variants showed an expected power-law distribution: 190 single nucleotide mutations were identified in single sequences, only 16 single nucleotide mutations were found in >5% sequences, and only two mutations in >50% genomes. These mutations were distributed through the whole genome. However, 63.6% were present in ORF1ab, 11.7% in Spike gene and 10.6% in the Nucleocapsid gene. Additionally, the prevalence of worldwide-found variant D614G in the Spike (98.9% in Costa Rica), ORF8 L84S (1.1%) is similar to what is found elsewhere. Interestingly, the frequency of mutation T1117I in the Spike has increased during the current pandemic wave beginning in May 2020 in Costa Rica, reaching 29.2% detection in the full genome analyses in November 2020. This variant has been observed in less than 1% of the GISAID reported sequences worldwide in all the 2020. Structural modeling of the Spike protein with the T1117I mutation suggest a potential effect on the viral oligomerization needed for cell infection, but no differences with other genomes on transmissibility, severity nor vaccine effectiveness are predicted. Nevertheless, in-vitro experiments are required to support these in-silico findings. In conclusion, genome analyses of the SARS-CoV-2 sequences over the course of COVID-19 pandemic in Costa Rica suggest introduction of lineages from other countries as travel bans and measures were lifted, similar to results found in other studies, as well as an increase in the Spike-T1117I variant that needs to be monitored and studied in further analyses as part of the surveillance program during the pandemic.Competing Interest StatementThe authors have declared no competing interest. ER -