ABSTRACT
The novel betacoronavirus (SARS-CoV-2) is highly contagious and can cause serious acute respiratory illness syndromes, often fatal, called covid-19. It is an urgent priority to better understand SARS-CoV-2 infection mechanisms that will help in the development of prophylactic vaccines and therapeutics that are very important to people health and socioeconomic stability around the world. The surface coronavirus spike (S) glycoprotein is considered as a key factor in host specificity because it mediates infection by receptor-recognition and membrane fusion. Here the analysis of CoV-2 S protein revealed in S1subunit a B56-binding LxxIxE-like motif that could recruit the host protein phosphatase 2A (PP2A). This motif is absent in SARS-CoV and MERS-CoV. PP2A is a major family of serine/threonine phosphatases in eukaryotic cells. Phosphatases and kinases are big players in the regulation of pro-inflammatory responses during pathogenic infections. Moreover, studies have shown that viruses use multiple strategies to target PP2A in order to manipulate host’s antiviral responses. The latest studies have indicated that SARS-CoV-2 is involved in sustained inflammation in the host. Therefore, by controlling acute inflammation; it is possible to eliminate its dangerous effects on the host. Among efforts to fight covid-19, the interaction between LxxIxE-like motif and PP2A-B56-binding pocket could be a target for the development of a bioactive peptide and ligand inhibitors for therapeutic purposes.
INTRODUCTION
In March 11th 2020, the World Health Organization (WHO) announced that covid-19 situation is a pandemic because of the speed and scale of transmission. Coronaviruses (CoVs) are a large family of enveloped single positive-stranded RNA viruses that can infect both mammalian and avian species because their rapid mutation and recombination facilitate their adaptation to new hosts (Graham and Baric, 2010; Li, 2013). They can cause severe, often fatal acute respiratory disease syndromes named covid-19. CoVs are classified into Alpha-, Beta-, Gamma-, and Deltacoronavirus genetic genera. The novel betacoronavirus (betaCoVs) SARS-CoV-2 is relatively close to other betaCoVs: severe acute respiratory syndrome coronavirus (SARS-CoV), Middle East respiratory syndrome coronavirus (MERS-CoV), bat coronavirus HKU4, mouse hepatitis coronavirus (MHV), bovine coronavirus (BCoV), and human OC43 coronavirus (HCoV-OC43). SARS-CoV emerged in China (2002–2003) and spread to other countries (more than 8,000 infection cases and a fatality rate of ∼10%) (Peiris et al., 2003). In 2012, MERS-CoV was detected in the Middle East. It spread to multiple countries, infecting more than 1,700 people with a fatality rate of ∼36%.
The surface-located SARS-CoV-2 spike glycoprotein S (S) is a 1273 amino acid residues. It is a homotrimeric, multidomain, and integral membrane protein that give coronaviruses the appearance of having crowns (Corona in Latin) (Li, 2016). It is a key piece of viral host recognition (receptor-recognition) and organ tropism and induces strongly the host immune reaction (Li, 2015). It is subdivided to S1 subunit that binds to a receptor on the host cell surface and S2 subunit that permits viral and host membranes fusion. S1 subunit is divided into two domains, an N-terminal domain (NTD) and a C-terminal receptor-binding domain (RBD) that can function as viral receptors-binding (Li, 2012). In addition, S1 subunit is normally more variable in sequence among different CoVs than is the S2 subunit (Masters, 2006).
Protein phosphatase 2A (PP2A) is a major family of serine/threonine phosphatases in eukaryotic cells and regulates diverse biological processes through dephosphorylation of numerous signaling molecules. PPA2 and phosphatase 1 (PP1), regulates over 90% of all ser/thr dephosphorylation events in eukaryotic cells (Eichhorn et al., 2009). PP2A is a heterotrimeric holoenzyme composed of a stable heterodimer of the scaffold A-subunit (PP2A-A) and catalytic C-subunit (PP2A-C) and a variable mutually exclusive regulatory subunit from four families (B (B55), B′ (B56), B″ and B‴) which provide substrate specificity. The human B56 family consists of at least five different members (α, β, γ, δ and ε). Phosphatases and kinases are big players in the regulation of pro-inflammatory responses during microbial infections. Moreover, studies have revealed that viruses use multiple strategies to target PP2A in the aim to manipulate host antiviral responses (Guergnon et al., 2011). Here, face to urgent priority to fight the novel SARS-CoV-2 due to its grave consequences in the human health and socioeconomic stability around the world, S protein was analyzed because its importance in mediating infection. This analysis revealed in S1subunit a B56-binding LxxIxE-like motif that could recruit the host PP2A. The interaction S1 subunit-host PP2A-B56 could be a target for the development of a bioactive peptide and ligand inhibitors for therapeutic purposes.
RESULTS AND DISCUSSION
Two LxxIxE-like motifs in S1 and S2 subunits of Spike S
Sequence analysis of SARS-CoV-2 spike protein by the eukaryotic linear motif (ELM) resource (http://elm.eu.org/) revealed short linear motifs (SLiMs) known as LxxIxE-like motif,293LDPLSE298 in S1 subunit and 1197LIDLQE1202 in S2 subunit (Fig. 1). SLiMs are few amino acid residues (3-15) in proteins that facilitate protein sequence modifications and protein-protein interactions (Davey et al., 2012; Van Roey et al., 2014). Viruses are known to mutate quickly and thus create mimic motifs, on very short time scales, that could hijack biological processes in the host cell such as cell signaling networks (Davey et al., 2015; Via et al., 2015; Davey et al., 2011). Interestingly, 293LDPLSET299 is only present in SARS-CoV-2 (Fig. 1A). It is absent in the other coronaviruses S protein analysed in this study. In order to interact with protein(s), 293LDPLSE298 must be present at the surface of S1 subunit. Indeed, it is exposed in the surface of S1 subunit in the end of NTD (Fig. 3B). Additionally, this motif could be an antigenic epitope to generate antibodies and/or can help the design of vaccine components and immuno-diagnostic reagents. A second motif 1197LIDLQEL1203 is present in S2 subunit. It is conserved in S2 subunit of SARS-CoV-2, SARS-CoV, SARS-like of bat from China and Kenya (Fig. 1B). These last betacoronaviruses are phylogenetically close (Fig. 2). Unfortunately, the region containing 1197LIDLQEL1203 peptide has not been resolved in all known structures of spike S protein of coronaviruses to know if it is exposed in the surface of S2 subunit.
Multiple alignment of the spike glycoprotein of betacoronaviruses using Clustal omega (Sievers et al., 2011). LxxIxE-like motifs are indicated by green stars. Numbers at the start of each sequence corresponding to the GenBank and UniProt accession number. Green stars indicated LxxIxE-like motif. The figure was prepared with ESPript (http://espript.ibcp.fr).
Unrooted phylogenetic tree of spike protein of representative betacoronaviruses. The tree was constructed using Mr Bayes method (Huelsenbeck and Ronquist, 2001) based on the multiple sequence alignment by Clustal omega (Sievers et al., 2011). Numbers at the start of each sequence corresponding to the GenBank and UniProt accession number. Red rectangle assembles betacoronaviruses with the same 1197LIDLQE1202. Green star indicated the only betacoronavirus with 293LDPLSE298.
(A) Diagram representation of the S1 subunit of spike protein of SARS-CoV-2 colored by domain. N-terminal domain (NTD, cyan), receptor-binding domain (RBD, green), subdomains 1 and 2 (SD1-2, orange) and the localization of 293LDPLSE298 in the end of NTD. (B) Surface structure representation of the S1subunit of spike protein (PDBid: 6VSB_A). 293LDPLSE298 peptide is localized in the surface (red).
Interactions of 293LDPLSET299and 1197LIDLQEL1203 with Subunit B56-PP2A
To compare the interactions between these peptides and B56 regulatory subunit of PP2A, molecular docking was performed with the software AutoDock vina (Trott and Olson, 2010). Fig. 4 shows that peptides are localized in the same region as pS-RepoMan peptide (PDBid: 5SW9) and important amino acids of LxxIxE-like motif are superposed with those of pS-RepoMan peptide (Fig. 4C). Interestingly, 293LDPLSET299 contains a serine and threonine that could be phosphorylated generating a negative charge that will interact with positive patch in subunit B56 of PP2A, enhancing binding affinity (Fig. 4A) (Nygren and Scott, 2015). According to Autodock software, binding affinity of 293LDPLSET299is -4.8 Kcal/mol and this of 1197LIDLQEL1203is -3.5 Kcal/mol. The difference of binding affinity may explained by the phosphorylation of serine and threonine. It is known that the binding affinity of SLiMs is relatively weak (low µmolar range) (Gouw et al., 2018). This knowledge of molecular interactions of 293LDPLSET299 and B56-PP2A will pave the way to design a peptide able to mimic the surface of B56-PP2A and strongly bind to 293LDPLSET299 surface precluding PP2A’s recruitment (Zaidman and Wolfson, 2016).
Electrostatic potential surface representation of the region of the regulatory subunit B56 of PP2A (PDBid: 5SWF_A) with docked peptides. (A) 293LDPLpSEpT299 (green), (B) 1197LIDLQEL1203 (cyan) and (C) 293LDPLpSEpT299 superposed to pS-RepoMan (581RDIASKKPLLpSPIPELPEVPE601) peptide (orange, PDBid: 5SW9_B). The surfaces are colored by electrostatic potential with negative charge shown in red and positive charge in blue. Images were generated using PyMol (www.pymol.org).
Protein phosphatase 2A and single RNA viruses
It has been shown in single RNA viruses, Ebola virus (EBOV) and Dengue fever virus (DENV) that they recruit the host PP2A through its regulatory subunit B56-binding LxxIxE motif to activate transcription and replication (Kruse et al., 2018; Oliveira et al., 2018). In addition, it has been shown in mice infected with rhinovirus 1B (the most common viral infectious agent in humans) an exacerbation of lung inflammation. Administrating Salmeterol (beta-agonist) treatment exerts anti-inflammatory effects by increasing PP2A activity. It is probable that beta-agonists have the potential to target distinct proinflammatory pathways unresponsive to corticosteroids in patients with rhinovirus-induced exacerbations. (Hatchwell et al., 2014). It is interesting to learn about Salmeterol drug and the possibility of using it in covid-19’s patients with sustained and dangerous inflammatory reaction.
MATERIALS AND METHODS
Sequence analysis
To search probable short linear motifs (SLiMs), SARS-CoV-2 spike protein sequence was scanned with the eukaryotic linear motif (ELM) resource (http://elm.eu.org/).
3D modeling and molecular docking
For docking, the coordinates of the 293LDPLSET299 peptide were extracted from spike S protein of CoV-2 structure (PDBid: 6VSB_A). Unfortunately, the region containing 1197LIDLQEL1203 peptide has not been resolved in all known structures of spike S protein. So, Pep-Fold (Thevenet et al., 2012) software was used to model de novo this peptide. The model quality of the peptide was assessed by analysis of a Ramachandran plot through PROCHECK (Vaguine et al., 1999). The docking of the two peptides into regulatory subunit B56 of PPA2 (PDBid: 5SWF_A) was performed with the software AutoDock vina (Trott and Olson, 2010). The 3D complex containing regulatory subunit B56 of PPA2 and peptides was refined by using FlexPepDock (London et al., 2011), which allows full flexibility to the peptide and side-chain flexibility to the receptor. The electrostatic potential surface of the regulatory subunit B56 of PPA2 was realized with PyMOL software (http://pymol.org/).
Phylogeny
To establish the phylogenetic relationships between spike S protein of SARS-CoV-2 and representative betacoronaviruses, amino acid residues sequences were aligned with Clustal omega (Sievers et al., 2011) and a phylogenetic tree was constructed with MrBayes (Huelsenbeck and Ronquist, 2001) using: Likelihood model (Number of substitution types: 6(GTR); Substitution model: Poisson; Rates variation across sites: Invariable + gamma); Markov Chain Monte Carlo parameters (Number of generations: 100 000; Sample a tree every: 1000 generations) and Discard first 500 trees sampled (burnin).
CONFLICT OF INTERESTED
The author declares that he has no conflicts of interest.
ACKNOWLEDGMENTS
I would like to thank the IBIS bioinformatics group for their assistance.