TY - JOUR T1 - Unraveling the molecular basis of host cell receptor usage in SARS-CoV-2 and other human pathogenic β-CoVs JF - bioRxiv DO - 10.1101/2020.08.21.260745 SP - 2020.08.21.260745 AU - Camila Pontes AU - Victoria Ruiz-Serra AU - Rosalba Lepore AU - Alfonso Valencia Y1 - 2020/01/01 UR - http://biorxiv.org/content/early/2020/08/21/2020.08.21.260745.abstract N2 - The recent emergence of the novel SARS-CoV-2 in China and its rapid spread in the human population has led to a public health crisis worldwide. Like in SARS-CoV, horseshoe bats currently represent the most likely candidate animal source for SARS-CoV-2. Yet, the specific mechanisms of cross-species transmission and adaptation to the human host remain unknown. Here we show that the unsupervised analysis of conservation patterns across the β-CoV spike protein family, using sequence information alone, can provide rich information on the molecular basis of the specificity of β-CoVs to different host cell receptors. More precisely, our results indicate that host cell receptor usage is encoded in the amino acid sequences of different CoV spike proteins in the form of a set of specificity determining positions (SDPs). Furthermore, by integrating structural data, in silico mutagenesis and coevolution analysis we could elucidate the role of SDPs in mediating ACE2 binding across the Sarbecovirus lineage, either by engaging the receptor through direct intermolecular interactions or by affecting the local environment of the receptor binding motif. Finally, by the analysis of coevolving mutations across a paired MSA we were able to identify key intermolecular contacts occurring at the spike-ACE2 interface. These results show that effective mining of the evolutionary records held in the sequence of the spike protein family can help tracing the molecular mechanisms behind the evolution and host-receptors adaptation of circulating and future novel β-CoVs.Significance Unraveling the molecular basis for host cell receptor usage among β-CoVs is crucial to our understanding of cross-species transmission, adaptation and for molecular-guided epidemiological monitoring of potential outbreaks. In the present study, we survey the sequence conservation patterns of the β-CoV spike protein family to identify the evolutionary constraints shaping the functional specificity of the protein across the β-CoV lineage. We show that the unsupervised analysis of statistical patterns in a MSA of the spike protein family can help tracing the amino acid space encoding the specificity of β-CoVs to their cognate host cell receptors. We argue that the results obtained in this work can provide a framework for monitoring the evolution of SARS-CoV-2 specificity to the hACE2 receptor, as the virus continues spreading in the human population and differential virulence starts to arise.Competing Interest StatementThe authors have declared no competing interest. ER -