Modelling conformational state dynamics and its role on infection for SARS-CoV-2 Spike protein variants

Natália Teruel; Olivier Mailhot; Rafael Josef Najmanovich

doi:10.1101/2020.12.16.423118

Abstract

The SARS-CoV-2 Spike protein needs to be in an open-state conformation to interact with ACE2 as part of the viral entry mechanism. We utilise coarse-grained normal-mode analyses to model the dynamics of Spike and calculate transition probabilities between states for 17081 Spike variants. Our results correctly model an increase in open-state occupancy for the more infectious D614G via an increase in flexibility of the closed-state and decrease of flexibility of the open-state. We predict the same effect for several mutations on Glycine residues (404, 416, 504, 252) as well as residues K417, D467 and N501, including the N501Y mutation, explaining the higher infectivity of the B.1.1.7 and 501.V2 strains. This is, to our knowledge, the first use of normal-mode analysis to model conformational state transitions and the effect of mutations thereon. The specific mutations of Spike identified here may guide future studies to increase our understanding of SARS-CoV-2 infection mechanisms and guide public health in their surveillance efforts.

1. Introduction

The coronavirus pandemic has emerged as a major and urgent issue affecting individuals, families and societies as a whole. Among all outbreaks of aerosol transmissible diseases in the 21st century, the COVID-19 pandemic, caused by the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) virus [1,2], has the highest infection and death cumulative numbers - 83 million infections and over 1.8 million deaths, according to the World Health Organization (WHO) epidemiological report of January 5, 2021 [3]. Recent WHO reports also show significant weekly increases in the number of infections and deaths as countries start to face upcoming waves of the disease. In 2003 the SARS coronavirus (SARS-CoV) pandemic caused 8,098 infections and 774 deaths before it was brought under control [4,5]. In 2012, the Middle East respiratory syndrome-related coronavirus (MERS-CoV) outbreak caused 2499 infections and 858 deaths, presenting the highest fatality rate [6]. SARS-CoV-2, SARS-CoV and MERS-CoV, as coronaviruses in general, present considerable mutation rates, which may contribute to future outbreaks. For instance, SARS-CoV-2 is estimated to have a mutation rate close to the ones presented by MERS-CoV [7] and by SARS-CoV [8], as well as other RNA viruses, showing a median of 1.12 × 10⁻³ mutations per site per year [9]. The high mutation rate may in part be responsible for the zoonotic nature of these viruses and points to a clear risk of still-undetected additional members of the coronavirideae family of viruses making the jump from their traditional hosts to humans in the future.

The SARS-CoV-2 Spike protein (Uniprot ID P0DTC2) is responsible for anchoring the virus to the host cell. The entry receptor for SARS-CoV-2 and other lineages of human coronaviruses is the human cell-surface protein angiotensin converting enzyme 2 (ACE2) (Uniprot ID Q9BYF1) [10]. Therefore, studying the Spike protein family is essential to understand the evolution of coronaviruses.

SARS-CoV-2 Spike is a homo-trimeric glycoprotein, with each chain built by subunits S1 and S2, delimited by a furin cleavage site at residues 682-685. The S1 subunit comprises the N-terminal Domain (NTD), located in the peripheric part of the extramembrane extreme, and the Receptor Binding Domain (RBD), the most flexible site, located in the central part of this same extreme. The S2 subunit consists of the fusion peptide (FP), heptad repeat 1 (HR1), heptad repeat 2 (HR2), the transmembrane domain (TM), and the cytoplasmic tail (CT) (Figure 1). The interaction between Spike and ACE2 relies on Spike to be in its open conformation, in which the Receptor Binding Domain (RBD) is extended [11]. The study of the binding properties between Spike and ACE2, although important, cannot explain all the nuances of the infection mechanism. An example of this limitation is the comparison between SARS-CoV and SARS-CoV-2, which have different rates of infection even though they share similar Spike-ACE2 affinities [12]. These facts lead us to consider the contribution of the dynamics of the Spike protein to the infection process.

Figure 1.

Domains of the Spike protein. N-Terminal Domain (NTD), Receptor Binding Domain (RBD), Subunit 1/Subunit 2 junction (S1/S2), Fusion Peptide (FP), Heptad Repeat 1 (HR1), Heptad Repeat 2 (HR2), Transmembrane Domain (TM), and the Cytoplasmic Tail (CT). Crystallography structure in the conformational state of all 3 RBD domains closed (PDB 6VXX) and of 1 RBD open (PDB 6VYB), binding to ACE2 (PDB 6M17).

Computational structural biology methods have grown in both accuracy and usability over the years and are increasingly accepted as part of an integrated approach to tackle problems in molecular biology. Such integration permits to speed up research, decrease needs in infrastructure, reagents, and human resources and allows us to evaluate increasingly larger data sets. Computational approaches are being extensively used in the study of SARS-CoV-2 and its mechanisms of infection [13-15]. Among these, we highlight the study of dynamic properties of the Spike protein as well as in antibody recognition and the search for therapeutic interventions [16-18].

Several aspects of the dynamics of the Spike protein are being currently studied, with a range of particular goals: to evaluate the docking of small molecules to the RBD domain [19], to search for alternative target binding-sites for vaccine development [20], to understand residue-residue interactions and their effects on conformational plasticity [21] and to investigate the flexibility of different domains in particular conformational states [22].

Normal Mode Analysis (NMA) methods are being employed in the study of different conformational states [23] and of different coronavirus variants [24]. These methods, however, are limited with respect to their ability to study the effects of mutations on dynamics due to the fact that such methods are either extremely taxing on computational resources (e.g., molecular dynamics) or agnostic to the nature of amino acids (e.g., traditional coarse-grained NMA methods). In the past, our group developed a coarse-grained NMA method called ENCoM (for Elastic Network Contact Model) that is more accurate than alternative coarse-grained NMA methods due to the explicit consideration of the chemical nature of amino acids and their interactions and consequently their effect on dynamics [25]. ENCoM performs better than other NMA methods on traditional applications and is the only coarse-grained NMA method capable of predicting the effect of mutations on protein stability and function as a result of dynamic properties [26-28].

In this study, we use the ENCoM method to study the dynamics of the Spike protein, considering different conformational states and several sequence variants observed during the current pandemic, as well as through large-scale analysis of in silico mutations. Experimental analysis of the effect of the SARS-CoV-2 Spike mutation D614G and the comparison between SARS-CoV and SARS-CoV-2 Spike proteins show unique dynamic characteristics that correlate with epidemiological and experimental data on infection. The present work shows that we can replicate such results computationally, suggesting that rigidity or flexibility of different Spike conformational states affects infectivity. We present a high throughput analysis of simulated single amino acid mutations on dynamic properties to seek potential hotspots and individual Spike variants that may be more infectious and therefore may guide public health decisions if such variants were to appear in the population. We also introduce a Markov model of occupancy of molecular states with transition probabilities derived from our analysis of dynamics that recapitulates experimental data on conformational state occupancies. This is the first application of an NMA method that derives transition probabilities from normal modes and employs them in a dynamic system to predict the occupancy of different conformational states. We model the occupancy of several variants and highlight those that may be useful in studying future epidemiological trends that can be responsible for new outbreaks.

2. Methods

2.1 Spike protein models

We performed our analyses using the crystallographic models of the SARS-CoV-2 Spike protein in the open (PDB ID 6VYB) and closed (PDB ID 6VXX) states. The open (prefusion) state was designed with an abrogated Furin S1/S2 cleavage site and two consecutive proline mutations that improve expression [29]. Despite the mutations, the engineered structures correctly represent the conformational states of Spike, as confirmed by independently solved structures [23,30]. The PDB structures used for the SARS-CoV comparison were 5×58 and 5×5B for closed state and one RBD open state, respectively [31].

We removed heteroatoms, water molecules, and hydrogen atoms from the PDB structures. Missing residues were reconstructed using template-based loop reconstruction and refinement with Modeller [32]. Single amino acid mutants were generated using FoldX4 [33]. ΔΔS_vib and occupancy calculations were performed with reconstructed closed and one-RBD-open structures using as template 6VXX and 6VYB. These engineered structures contain the GSAS sequence in the Furin cleavage site as well as two prolines in positions 986 and 987. In order to minimize potential artefacts in the calculations due to modelling errors, we chose to model all mutations and subsequent calculations using the above engineered structures and sequences unless otherwise noted. That is to say, when we refer to the wild type SARS-CoV-2 Spike protein in our calculations, it is the Spike protein with the above alterations in the Furin recognition site as well as the pair of prolines. This choice in our methodology is made as stated to decrease the possibility of modelling artefacts as the alternative would have required modelling 6 additional mutations to ‘de-engineer’ the structures of the open and closed states.

For the parameter fitting used in the calculation of occupancies, we utilized the following experimentally determined structures for which occupancy data exists as follows (acronyms described in results): S-GSAS/WT: 7KDG,7KDH; S-GSAS/D614G: 7KDI,7KDJ [30]; S-R/x2: 6ZOX; S-R/PP/x1: 6ZOY,6ZOZ; S-R: 6ZP0; S-R/PP: 6ZP1,6ZP2 [34].

2.2 Dynamic analyses

We analysed dynamic properties of the Spike protein with ENCoM [25]. ENCoM employs a potential energy function that includes a pairwise atom-type non-bonded interaction term and thus makes it possible to consider the effect of the specific nature of amino-acids on dynamics. NMA explores protein vibrations around an equilibrium conformation by calculating the eigenvectors and eigenvalues associated with different normal modes [35-37]. Representing each protein residue as a single point, for a given conformation of a protein with N amino acids, we obtain 3N −6 nontrivial eigenvectors. Each eigenvector represents a linear, harmonic motion of the entire protein in which each amino acid moves along a unique 3-dimensional Euclidean vector. The associated eigenvalues rank the eigenvectors in terms of energetic accessibility, lower values corresponding to global, more easily accessible motions.

NMA calculations allow us to computationally estimate b-factors associated with the protein structure, as shown in Equation 1 for the i^th residue, which in turn are related to local flexibility. Higher predicted b-factors denote more flexible positions. Individually calculated b-factors are combined in a vector for a protein sequence or part thereof and called Dynamic Signature.

The eigenvectors and associated eigenvalues can also be used to obtain a variable called vibrational entropy that can be used to compare the relative stability of two states. For example, by measuring the difference of vibrational entropy (ΔS_vib) between a mutant and a wild type (WT), one can calculate how much a mutation affects the overall flexibility and stability of the mutant relative to the WT. The ΔS_vib value predicted by ENCoM is positive when the mutation makes the protein more flexible and negative when the mutation makes the protein more rigid. The differences between the ΔS_vib values for closed and open states were calculated for each mutant (ΔΔS_vib = ΔS_{vib (open)} – ΔS_{vib (closed)}) in order to evaluate individual mutations according to a single score. Vibrational Entropy calculations are dependent on the thermodynamic β factor, that for pseudo-physical models such as ENCoM serves as a scaling factor. This term was optimized to fit experimental Gibbs free energy differences [38] and established as β = 1. The vibrational contribution of the entropic components of the free energy is calculated as described in Eq. 2 [39] in units of J.K^-1, where N is the total number of amino acids in the protein, v_i is the vibrational frequency and K_B is the Boltzmann constant. Equation 3 shows the association between eigenvalues and vibrational frequency.

The Najmanovich Research Group Toolkit for Elastic Networks (NRGTEN) [38], with the latest implementation of ENCoM, also includes a function to evaluate state occupancies by calculating transition probabilities between different states. A probability P_j of moving along each eigenvector j can be obtained using a Boltzmann distribution given its associated eigenvalue λ _j and a scaling factor γ.

Let’s consider two conformations A and B of the same protein and the vector E_A→B, which represents the conformational change going from conformation A to conformation B. The overlap between each normal mode M_j computed from conformation A and the E_A→B vector is a value between 0 and 1 describing how well that normal mode recapitulates the conformational change required to go from one state to the other [40].

We can then calculate the transition probability of going from conformation A to conformation B as the weighted sum of the Boltzmann probability P_j of each normal mode M_j times the overlap between that normal mode and the conformational change E_A→B.

The reverse probability P_B→A can be computed in the same fashion, giving an indication of which conformation is favored between the two.

A simple way of computing the occupancies of these conformations from the transition probabilities is to use a Markov model. Each conformation is represented by a state, and the transition probabilities between states are computed as described above. We add a constant k to all states as the probability of staying in that state. Since all states must have outgoing transition probabilities that sum to 1, we normalize these values after the addition of k. For a two-state Markov chain representing the open and closed states of the Spike protein, we obtain the diagram shown in Figure 2. All transition probabilities are computed using ENCoM and Eq. 6. The parameters k and γ need to be optimized for the system being studied as they are not directly coupled to physical quantities because of the pseudo-physical, coarse-grained nature of the ENCoM model. Once the parameters are set, there is a unique equilibrium solution that gives the occupancies of the two states. This approach could be easily generalized to a Markov model with more than two states, where the transition between any two states is computed exactly as described above if that transition is deemed possible.

Figure 2.

Two-state Markov chain of spike protein conformations

3. Results and Discussion

3.1. Dynamic Signature of different Spike variants

3.1.1. G614 and D614 dynamic comparison

An important event in the progression of the COVID-19 pandemic was the appearance of the D614G variant in mid-February 2020 in Europe. The fast spread of this variant raised the possibility that this mutation conferred advantages relative to other forms of the virus in circulation at the time [41,42]. Studies revealed that the mutation has indeed greater infectivity, triggering higher viral loads [43,44]. Several hypotheses have emerged to explain the mechanisms behind this higher infectivity primarily focused on possible effects on the Furin cleavage site [30,45,46], but recently also considering possible important dynamic differences [44,47,48].

In order to test if Dynamic Signatures reveal differences between Spike variants, we analysed the 13741 sequences of the protein available on May 08 in the COVID-19 Viral Genome Analysis Pipeline, enabled by data from GISAID [49,50]. The mutant Spike proteins harboring mutations (Table S1) were modelled in the open and closed states. Dynamic Signatures were calculated for each mutant in both states and clustered (Figure 3). Mutations in positions that had no occupancy in the original templates used for the open and closed states (positions 5, 8, and 1263) were ignored.

Figure 3.

Dynamic Signature clustering for the closed (A) and open (B) state structures for WT and 22 mutants from GISAID (Table S1).

Analysis of the effect of mutations on the Dynamic Signature show that the D614G mutation produces similar dynamic patterns largely independent of the other mutations accumulated, and dynamic patterns that are distinct from that of the wild type and other mutants on both the open and closed states. The dynamic characteristics of D614G are very specific and cannot be obtained with random mutations (Figure S1, Table S2). Performing the clustering using segments of the Dynamic Signature representing lengths of 100 amino acids identifies a section of the Spike protein from around position 250 to around position 750, responsible for the unique characteristics that the mutation D614G confers to the dynamics of the Spike protein (data not shown). This section of Spike includes part of the N-Terminal Domain (NTD) and all of the RBD domain.

When checking the difference between the Dynamic Signatures of the wild type D614 and the mutant G614 we observe that for the closed conformation, the pattern tends towards negative values, indicating that this mutation makes the closed state more flexible, especially around the position of the mutation. On the other hand, for the open B chain conformation the pattern is positive for the open RBD, the same chain NTD and the adjacent chain NTD, indicating that this mutation makes these areas of the open conformation more rigid (Figure 4).

Figure 4.

Effects of the D614G mutation on the Dynamic Signature of the closed (purple) and open B chain RBD (blue) structures, measured by the difference between the calculated b-factors of D614 and G614. Chains are represented in different colours and the position of the mutation is marked in yellow, using the same colours as for different regions of the structure as represented in the colours of the structures. The coordinate axis counts residues linearly through the three Spike subunits.

This result led us to hypothesize that a more flexible closed state would favor the opening of Spike and that a more rigid open state would disfavor its closing, thus shifting the conformational equilibrium towards the open state and favouring interaction with ACE2, leading to increased cell entry. Mutating position 614 to every other amino acid, we observe a correlation in the closed state between residue size and flexibility. Namely, smaller amino acids tend to make the closed state more flexible. However, we do not observe the opposite effect on the open state. Mutation of D614 to Glutamine, which is similar to Aspartate, barely shows any effect. Nevertheless, we can see that other amino acids have a similar effect as Glycine, such as Proline and Threonine (Figure S2).

3.1.2. Comparison of the Dynamic Signatures of Spike from SARS-CoV and SARS-CoV-2

It has been previously observed that RBD flexibility in SARS-CoV influences binding to ACE2 and facilitates fusion with host cells [51]. Thus, considering the lesser infectivity of SARS-CoV relative to SARS-CoV-2 and our aforementioned results for the D614G mutation, we expected the SARS-CoV Spike to be more rigid in the closed state and more flexible in the open state relative to Spike from SARS-CoV-2. This is indeed the case (Figure 5). The dynamic signature values of SARS-CoV are smaller than those of SARS-CoV-2 in several areas throughout the closed structure, indicating that when in the closed state, the SARS-CoV Spike protein is more rigid. For the open state we can see that SARS-CoV open RBD and adjacent NTD are significantly more flexible than for SARS-CoV-2 Spike.

Figure 5.

Comparison between SARS-CoV-2 and SARS-CoV. Dynamic Signature difference of the closed (purple) and open B chain RBD (blue) between aligned residues of the Spike protein from SARS-CoV-2 and SARS-CoV, with SARS-CoV chains represented in the top bar and equivalent colors in the structures and SARS-CoV-2 chains represented in the bar just below. The coordinate axis counts residues linearly through the three Spike subunits.

3.2. Vibrational entropy

It is possible to combine the trend of a Dynamic Signature into a single value to represent the overall flexibility of any given mutation and compare it to the WT. This can be achieved with ΔS_vib, calculated with Eq. 2 for each state (see materials and methods). For any given state, positive ΔS_vib values represent mutants that relative to the wild type make the protein more rigid, whereas negative values of ΔS_vib describe mutations that cause the protein to be more flexible in the given state relative to the wild type. In the case of the mutation D614G, we obtain ΔS_{vib (open)} = 5.26×10⁻² J.K^-1 and ΔS_{vib (closed)} = −9.27×10⁻² J.K^-1 with a ΔΔS_vib (calculated as ΔS_{vib (open)} – ΔS_{vib (closed)}) of 1.45×10⁻¹ J.K^-1.

We generated in silico the 19 possible single mutations in each position from residue 14 to residue 913 and calculated ΔS_{vib (open)}, ΔS_{vib (closed)} and ΔΔS_vib. Other positions were ignored due to uncertainties in modelling or the fact that they are not expected to have a pronounced effect on dynamic [23]. It should be noted that Spike cannot accommodate the vast majority of such single mutations, particularly in its core as these would lead to unstable or misfolded conformations. However, those that occur near the surface are more likely to represent single residue variations of the Spike protein that lead to a stable, correctly folded protein. Therefore, the stability of specific mutations highlighted in this work, unless otherwise stated (such as those already observed experimentally or within the RBD domain as stated below), needs to be validated experimentally.

The heatmap in Figure 6A shows ΔS_vib values associated with mutations on the closed conformational state (left) and open conformational state (right). Lighter colors represent high ΔS_vib values, meaning that the specific mutant is more flexible than the WT, and darker colors represent low ΔS_vib values, meaning that the specific mutant is more rigid than the WT. The second heatmap (Figure 6B) shows ΔΔS_vib values, or Difference Scores, highlighting positions and specific mutations with great contrast between their effect on the open and closed states. In this representation, blue mutants are more rigid in the closed state and more flexible in the open state, therefore candidates for less infectious mutants, and red mutants are more flexible when closed and more rigid when open, candidates for more infectious mutants.

Figure 6.

Heatmaps representing the values of ΔS_vib for the closed structure (A, left-hand section) and for the open structure (A, right-hand section), and the values of ΔΔS_vib (B) for every possible mutant in Spike from positions 14 to 913. Each column represents one of the 20 amino acids (repeated in the left heatmap). Notice that for each position (represented in a row), one particular column represents the value of the WT amino acid found at that position. Higher values of ΔS_vib are represented in yellow and lower values in dark purple. Higher values of ΔΔS_vib are represented in red and lower values in blue. The domain structure of Spike is represented in (C) for reference purposes.

In Figure 7 we map ΔΔS_vib values (Figure 6B) on the structure of Spike, colored according to the median value for each position with the same color scheme as the heatmap. From the 17081 single mutations considered, we show the top 64 mutants with with ΔΔS_vib>0.3 (Tables 1 and S3) as well as the bottom 20 in terms of ΔΔS_vib values (Table S3). The mutants with predicted open state occupancy higher than that of the wild type are presented in Table 1. The Dynamic Signature comparison for 3 of those most infectious candidates (Figure 8A) and 3 of the least infectious candidates (Figure 8B) shows some of the patterns that could lead to a greater or lesser effect on infectivity. For instance, in Figure 8A we can see that high scores can come from a large flexibility of the closed state, a very large rigidity of the open state, or have the contribution of both. We can also observe that these effects can be different in each chain and can affect more the NTD, the RBD, or both. Finally, these single mutants also show how a point mutation can have widespread impacts across the protein.

View this table:

Table 1.

Putative mutations, their associated ΔΔS_vib (ΔS_{vib (open)} – ΔS_{vib (close)}, in units of J.K^-1) and predicted occupancies for the open and closed states for the mutants with predicted open-state occupancy higher than that of the wild type. Predicted occupancy values are shown for the open conformation, the closed conformation, and the difference between the two (closed – open). The data for the remaining mutants with occupancy below that of the wild type but ΔΔS_vib>0.3 (red) as well as those with the lowest predicted ΔΔS_vib values (blue) is presented in Table S3.

Figure 7.

ΔΔS_vib scores represented in the structure of Spike from two angles according to the median value for each position and the same color scheme as in the Difference Score Heatmap (Fig 6B).

Figure 8.

Dynamic Signature differences for three mutations among the top ΔΔS_vib scores (A) and bottom ΔΔS_vib scores (B) for closed (purple) and open (blue) B chain RBD structures. The coordinate axis counts residues linearly through the three Spike subunits.

3.3. Conformational state occupancies

We calculated forward and reverse transition probabilities between the open and closed states (Eq. 4, 5 & 6) from the calculated normal modes and used the Markov model described in Materials and Methods to calculate the equilibrium occupancies for each state in wild type and mutant Spike proteins. It is unclear if any additional conformational states other than those with either all three RBD domains in the closed state or only one RBD open state are biologically relevant. Specifically, Yurkovetskiy et al. [44] observed an occupancy for states with two or three RBD domains in the open conformation, but these were not observed by Gobeil et al. [30] and Xiong et al. [34] or taken into consideration in several other structural studies [20-24]. As such, we employ the two-state model shown in Figure 2, with one state representing all three RBD domains closed and the second state representing one RBD open. We calculated the robustness of this Markov occupancy model utilizing 60 different reconstructed structures, varying the positions of loops and with minor differences in the core structure, representing the closed state and the open state for each chain. The results are equivalent no matter what specific structural template is used to represent each of the two states above.

The Markov model calculation of occupancies requires two parameters (see Materials and Methods) that were optimized based on experimental data for six Spike variants. These variants were: S-GSAS/D614, an engineered Spike with the sequence GSAS in the furin cleavage site and no 614 mutation; S-GSAS/G614, with the same furin site modifications and the D614G mutation [30]; S-R, the Spike protein with original furin site RRAR; S-R/x2, with added S383C, D985C mutations inducing a disulfide bond; S-R/PP, engineered with two prolines in positions 986 and 987; S-R/PP/x1, in which from the double prolines sequence the mutations G413C, V987C were performed to induce a disulfide bond [34]. It is worth stressing that all 6 variants used to calibrate the two parameters affecting the occupancy were modelled on the same open and closed state conformations. All differences in observed occupancies and the agreement with experimental occupancy data came about as a consequence of the effect of the mutations on the normal modes and derived transition probabilities and not as a result of structural differences between variants. We obtained a good fitting to the experimental results with k and γ of 0.5 and 0.001, respectively (Pearson correlation = 0.89, p-value = 1.94×10⁻²). Predicted occupancies of the open and closed states for each of the six variants above, as well as the experimental data, are presented in Table 2.

View this table:

Table 2.

Experimental and predicted occupancies for the open and closed states and their difference for multiple SARS-CoV-2 variants. Experimental values obtained from Gobeil et al. [30] and Xiong et al. [34].

We utilized this data to calculate occupancy differences for each variant (Figure 9). The range of variation of our predicted occupancies is small compared to that of experimental values. We believe that given the limitations of our coarse-grained model as well as additional phenomena that ultimately affect occupancy, our predictions reflect only a fraction of the myriad of factors contributing to the occupancy. Nonetheless, our predictions correctly capture the pattern of relative variations of occupancy observed in the experimental data. To ensure that the calculated correlation is not due to chance, we simulated random sets of occupancies for the 6 sequence variants and calculated simulated correlations for the 110 different combinations of k and γ to determine if the observed correlations represent an actual signal in the data or could be randomly obtained with different values for the parameters k and γ. We observed a marked shift with higher correlations for the data representing our predicted occupancies when compared to the gaussian noise data (Figure S3), suggesting that the predicted occupancies are not due to chance.

Figure 9.

Difference in the occupancies for the open and the closed states (open – closed) for six variants of the Spike protein. Experimental values are represented on the Y-axis and the predicted values in the scale on the X-axis. Predicted values for the parameters k = 0.5, γ = 0.001. Represented linear fit of Experimental = 192.011*Predicted + 92.9013. Errors on the experimental measurements are not known.

The computational resources needed for the calculation of occupancies for all 8250 mutations with ΔΔS_vib>0 is beyond our current capabilities. We set a threshold of ΔΔS_vib>0.3 to select candidates for the calculation of occupancies. This threshold corresponds to 64 mutations (Table 1, in red). Using the parameters k and γ obtained above, we calculated occupancies for these 64 mutants as well as the 20 mutants with lowest ΔΔS_vib values (Table 1, in blue). In Figure 10A we show the difference in occupancy between the open and closed states using a non-linear scale adapted to better show the results around the wild type occupancy. Whereas ΔΔS_vib values for particular mutations may hint at a more flexible closed state and more rigid open state, this is a global measure that may not reflect the necessary pattern of flexibility across the structure that leads to effective transition probabilities between the open and closed states. Yet, for the most part, ΔΔS_vib can predict the shifts in occupancy, showing a clear distinction between the 64 mutants predicted using ΔΔS_vib as shifting occupancy towards the open state and the 20 mutants predicted to shift the equilibrium towards the closed state (p-value=2.04×10⁻⁶). Figure 10B shows the location in the structure of the mutants in Table 1. We can see that the least infectious candidates (blue) are positioned in the interfaces between NTD and RBD domains, while the most infectious candidates, especially the ones validated by the occupancy prediction (dark red), are more concentrated in the interfaces between different RBD domains.

Figure 10.

(A) Difference in the occupancies for the open and the closed states for the top 64 mutants with ΔΔS_vib>0.3 (red) and the 20 mutants with lower ΔΔS_vib scores (blue). Occupancy difference for the WT is represented by the dashed green line. Y-axis based on the transformation of a symmetric logarithmic scale. (B) Two visualizations of the 6VYB structure highlighting the mutations. The bottom 20 mutant positions are marked in two shades of blue, with the darker shade indicating positions in which at least one mutant had an (open – closed) occupancy value smaller than wild type. The top 64 mutant positions are marked in two shades of red, with the darker shade indicating positions in which at least one mutant had an (open – closed) occupancy value higher than wild type.

Residue G252 stands out as capable of accommodating a large number of mutations (C, D, E, H, M, P, Q, S, T, W) that shift the occupancy in favour of the open state. The fact that variants in this position do not seem to be prevalent in outbreaks to date, points to the possibility that this position may be under additional functional constraints that prevent the emergence of variants. A number of other Glycine residues also could accept mutations that we predict to increase the occupancy of the open state: G72W; G404W; G413M; G416E,W; and G404I. In fact, three of the top four mutations are mutations on Glycine. A number of other potential mutations are adjacent to Glycine residues above. Namely, R403S and K417D,E,G,P. Additionally, D467P,W and I468T are also positions that are adjacent to others that can accommodate mutations that may lead to a conformational shift favouring the open state. The mutation that favours the open state the most in our calculations is N501W with ΔS_{vib (open)} = 6.02×10⁻¹ J.K^-1 and ΔS_{vib (closed)} = 2.30×10⁻¹ J.K^-1 and a resulting ΔΔS_vib value of 3.72×10⁻¹ J.K^-1 leading to occupancies compared to those of the wild type (in parenthesis) of 62.7% (25.8%) and 37.3% (74.2%) for the open and closed states respectively. It is important to stress, as discussed in methods, that the calculations are performed using structures containing a modified Furin recognition site and prolines in positions 986 and 987. Furthermore, the contribution of vibrational entropy changes is one among potentially several effects whose overall importance remains to be determined. Therefore, relative changes in occupancy are relevant whereas the specific values are less so.

The COG-UK consortium (https://www.cogconsortium.uk/about/) monitors the appearance and spread of new strains of SARS-CoV-2. COG-UK recently detected a strain containing the mutation N501Y that has been observed to be spreading rapidly at the time of writing. We believe that shifts in occupancy may be in part responsible for its emergence. According to our calculations, the N501Y mutant shows ΔS_{vib (open)} = −1.60×10⁻² J.K^-1 and ΔS_{vib (closed)} = 2.37×10⁻¹ J.K^-1, with ΔΔS_vib = 2.53×10⁻¹ J.K^-1. The predicted occupancies for the N501Y mutant compared to those of the wild type (in parenthesis) are 54.3% (25.8%) and 45.7% (74.2%) for the open and closed states, respectively. Therefore, the N501Y mutant shows a marked increase of the occupancy of the open state relative to other mutations. Additionally, this mutation was shown to also increase binding affinity to the ACE2 receptor relative to the wild type with a Δlog₁₀ (K_D,app) of 0.24 [52]. Therefore, we predict that N501Y has a strong potential to contribute to increased transmission. The calculations above were performed in the context of D614. However, the double mutant representing the N501Y mutation in the context of G614 also shows an increase in the occupancy of the open state to 35.06%. The recently observed A222V mutation on the other hand [53], does not show in our analysis any propensity of altering the occupancy of states with a negative ΔΔS_vib of −1.64×10⁻² J.K^-1. Predicted occupancies for A222 and V222 are nearly identical either in the context of D614 (WT) or the mutant containing G614.

Notice that N501Y has a ΔΔS_vib value of 2.53×10⁻¹ J.K^-1 that is slightly below the 3.00×10⁻¹ J.K^-1 threshold, suggesting that there may be many other mutations with ΔΔS_vib values below our set threshold that turn out to have augmented occupancies for the open state relative to the wild type.

D614G shows that changes in the occupancy of conformational states can impact infectivity despite no changes or even weaker binding affinities [44]. A recent study [52] on binding and expression of Spike mutations within the RBD domain (positions 331 to 531) shows that several (but not all, see below) of the mutations that we predicted to have increased occupancy of the open state are associated with a decrease of binding affinity with ACE2. Incidentally, the data also shows that the mutations in Table 1 within the RBD produce stable and properly folded Spike proteins. As shown for D614G, infection does not rely on binding affinity alone, and even a strain with higher dissociation rates from ACE2 can bring about fitness advantages.

The mutation N501W is predicted to have the largest effect in augmenting the occupancy of the open state relative to the wild type. This mutation is associated with stronger binding to ACE2 (Δlog₁₀(K_D,app)=0.11) [52] relative to the wild type Spike (but lower than N501Y). Furthermore, N501W appears to have increased expression relative to the wild type with a Δlog(MFI) of 0.1 compared to decrease in relative expression of −0.14 for N501Y [52]. The authors note that changes in expression correlate with folding stability [52]. However, even with a Δlog(MFI) of −0.14, N501Y is viable and spreading. Therefore, N501W might be even more stable and infective.

We consider all mutations with increased predicted occupancy of the open state in Table 1 as good candidates for further experimental validation to better understand the role of binding and dynamics of Spike and their role in SARS-CoV-2 infectivity. Furthermore, we suggest that their appearance in outbreaks should be closely monitored.

3.4. SARS-CoV-2 Variants B.1.1.7 and 501.V2

The mutation N501Y above appears in both the B.1.1.7 variant first observed in the UK [54] as well as the 501.V2 variant first observed in South Africa [55] that rapidly spreading around the globe. These two strains contain additional mutations in Spike. Namely B.1.1.7 contains N501Y, A570D, D614G, P681H, T716I, S982A, D1118H and deletions on positions 69, 70 and 144. As the number of normal modes is related to the number of amino acids, we are unable to model deletions while still making comparisons with the wild type strain given the nature of the quantities calculated (Eq. 2 and 6). Therefore, the deletions of three residues at positions 69,70 and 144 that are present in B.1.1.7 were not modelled here. 501.V2 includes the mutations L18F, D80A, D215G, R246I, K417N, E484K, N501Y, D614G, A701V. The dynamic signatures for both B.1.1.7 as well as 501.V2 show a strong rigidification of the open state and added flexibility of the closed state (Supplementary Figures S4 and S5 respectively) leading to ΔΔS_vib values of 5.30×10⁻¹ J.K^-1 and 6.45×10⁻¹ J.K^-1 and open state occupancies of 36.2% and 35.8%, for B.1.1.7 and 501.V2 respectively. Both variants show an increase in occupancy of approximately 40% relative to the wild type (25.8%). Despite our preference of modelling the smaller number of mutations and therefore using the engineered structure containing the modified Furin binding site and proline modification, we also modelled B.1.1.7 (except the deletions) and 501.V2 using the original sequence of Spike. In that case we obtain 33.0% and 33.6% occupancy for B.1.1.7 and 501.V2, respectively.

3.5. Polyclonal human serum antibody escape

Recently the Bloom group utilised human serum antibodies from subjects that recovered from COVID-19 and tested mutations in the RBD for their capacity to escape recognition [56], i.e. mutations leading to weaker binding to polyclonal serum antibodies. The presented patterns of escape vary between subjects but a number of positions and specific mutations at those positions are relevant to the present study. Positions Y369, N448, F456, Y473 and F486 are noteworthy as specific mutations at these locations not only allow varying levels of escape in particular subjects [56] but also lead to positive values of ΔΔS_vib above a threshold of 0.1 J.K^-1 (Table S4). Among these, the mutations N448G; Y473 mutations to A, Q and T; and lastly, F486E all show occupancy of the open state modestly higher than that of the wild type (Table S4). The mutations noted, by virtue of potentially increasing infectivity as well as displaying varying levels of escape to immune responses, may give the virus an evolutionary edge and therefore should be closely monitored.

3.6. Data Availability

Raw data and structures used to build the images presented here are available in a Github repository (https://github.com/nataliateruel/data_Spike). All vibrational entropy results are available for visualisation and analysis through a link to the dms-view open-access tool, available on GitHub [57] through the same URL above. On dms-view, it is possible to visualise the effects of different mutations for each residue of the Spike protein and visualise these on the 3D structure of Spike. Each site has 20 ΔΔS_vib values, one of them being zero (corresponding to the amino acid found in the wild type). The option max will show the top ΔΔS_vib score for each position. Therefore, it shows which mutation for that specific position represents the candidate with the highest predicted infectivity as defined here in terms of a propensity to higher occupancy of the open state. The option min will show the lowest score for each position and the mutation associated with the least predicted infectious candidate. The option median returns the median score, presenting a general trend for any given position, and var shows the variance between the results for each position, highlighting sites in which mutations to different residues lead to a broader range of ΔΔS_vib values. Furthermore, for the mutations for which occupancy was calculated, the data can be accessed through the same menu. As new occupancy data is calculated, it will be added to this resource. Readers interested on the occupancy of particular mutations not yet available are invited to contact the authors via email or through the GitHub repository. When selecting each specific point on the first panel, it is possible to access all ΔΔS_vib values on the second panel and see the highlighted position in 3D on the structural representation.

The Najmanovich Research Group Toolkit for Elastic Networks (NRGTEN) including the latest ENCoM implementation is freely available at (https://github.com/gregorpatof/nrgten_package).

4. Conclusions

SARS-CoV-2 mutations are still arising and spreading around the world. The A222V mutation, reportedly responsible for many infections, emerged in Spain during the Summer of 2020 and since then has spread to neighbor countries [53]; In Denmark, new strains related to SARS-CoV-2 transmission in mink farms were confirmed in early October by the WHO and shown to be caused by specific mutations not previously observed with the novelty of back-and-forth transmission between minks and humans [58]. A new strain containing N501Y first appeared in the UK and is now on the rise worldwide at the time of writing. Such occurrences point to the possibility that new mutations in SARS-CoV-2 may bring about more infectious strains.

Using the methods described in this paper, it is possible to predict potential variants that might have an advantage over the wild type virus insofar as these are the result of changes in occupancy of states and with the limitations of the simplified coarse-grained model employed here. In our analyses, flexibility properties and conformational state occupancy probabilities contribute to the infectivity of a SARS-CoV-2. Our results explain the behaviour of the D614G strain, the increased infectivity of SARS-CoV-2 relative to SARS-CoV as well as offers a possible explanation for the rise of new strains such as those harboring the N501Y mutation.

The results we present on SARS-CoV-2 Spike mutations have several limitations. First and foremost, some of the in silico mutation discussed may not be thermodynamically stable, may affect expression, cleavage, or binding to ACE2, and our approach does not consider that Spike is, in fact, a glycoprotein and the sugar molecules may have an effect on dynamics. However, the remarkable agreement between our model and experimental observations shows that the simplified model of Spike and the coarse-grained methods used here allow us to calculate dynamic properties of Spike that are relevant to understand infection and epidemiological behavior. It is important to keep in mind that all of the mutations that we discuss in Table 1 that lay within positions 331 and 531 within the RBD domain were already experimentally validated and are viable [52]. However, we highlight the need for experimental validation of our predictions particularly for those candidates that we believe would help elucidate the extent of the effect of the conformational dynamics of Spike on infectivity. Beyond in vitro biophysical studies, experimental alternatives exist such as using pseudo-type viruses or virus-like-particles that would not require studying gain-of-function mutations using intact viruses. Alternatively, loss-of-function mutations can be created with intact viruses and compared to the wild type SARS-CoV-2 virus to validate the role of dynamics on infectivity.

To the best of our knowledge, this is the first time that a Normal Mode Analysis method is used to model the effect of mutations on the occupancy of conformational states opening a new opportunity in computational biophysics to create dynamic models of transitions between conformational states of proteins based on physical properties and sensitive to sequence variations. We hope that our results help public health surveillance programs decide on the risk posed by new strains, contribute to inform the research community in understanding SARS-CoV-2 infection mechanisms and open new possibilities in computational biophysics to study protein dynamics.

Acknowledgements

OM is the recipient of a PhD fellowship from the Fonds de Recherche du Québec − Nature et Technologie (FRQ-NT). RN is a Fonds de Recherche du Québec - Santé (FRQ-S) Senior Fellow, a member of the Réseau Québécois de Recheche sur les Médicaments (RQRM) and the Quebec Network for Research on Protein Function, Engineering and Applications (PROTEO). The authors would like to dedicate this work to the memory of Mordechai Najmanovich, Z”L, father of RN, who passed away from complications due to COVID-19 on November 26, 2020. RN would like to thank all healthcare workers, particularly ICU nurses and physicians at the Avista Adventist Hospital in Louisville, Colorado, for their efforts.

Footnotes

1. We updated the abstract 2. We added redistributed text into a new section 3.4 to clarify the text. 3. We added a new section 3.5 with new data. 4. We updated the data in the introduction regarding the most recent numbers of COVID-19 infections and deaths worldwide. 5. We added two figures and a table to the supplementary data. 6. We removed the supplementary data from the main manuscript file into its own file.
https://github.com/nataliateruel/data_Spike
https://github.com/gregorpatof/nrgten_package

References

1.↵
Lu R, Zhao X, Li J, Niu P, Yang B, Wu H, et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020;395: 565–574. doi:10.1016/S0140-6736(20)30251-8
OpenUrl CrossRef PubMed
2.↵
Zhou P, Yang X-L, Wang X-G, Hu B, Zhang L, Zhang W, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. Nature Publishing Group; 2020;579: 270–273. doi:10.1038/s41586-020-2012-7
OpenUrl CrossRef PubMed
3.↵
World Health Organization. Weekly epidemiological update - 5 January 2021 [Internet]. 2021 Jan. Available: https://www.who.int/publications/m/item/weekly-epidemiological-update5-january-2021
4.↵
US Center for Disease Control. SARS Basics Fact Sheet [Internet]. [cited 4 Jan 2021]. Available: https://www.cdc.gov/sars/about/fs-sars.html
5.↵
Wang L-F, Shi Z, Zhang S, Field H, Daszak P, Eaton BT. Review of bats and SARS. Emerging Infect Dis. 2006;12: 1834–1840. doi:10.3201/eid1212.060401
OpenUrl CrossRef PubMed Web of Science
6.↵
Memish ZA, Perlman S, Van Kerkhove MD, Zumla A. Middle East respiratory syndrome. Lancet. 2020;395: 1063–1077. doi:10.1016/S0140-6736(19)33221-0
OpenUrl CrossRef PubMed
7.↵
Haagmans BL, Dhahiry Al SHS, Reusken CBEM, Raj VS, Galiano M, Myers R, et al. Middle East respiratory syndrome coronavirus in dromedary camels: an outbreak investigation. Lancet Infect Dis. 2014;14: 140–145. doi:10.1016/S1473-3099(13)70690-X
OpenUrl CrossRef PubMed Web of Science
8.↵
Zhao Z, Li H, Wu X, Zhong Y, Zhang K, Zhang Y-P, et al. Moderate mutation rate in the SARS coronavirus genome and its implications. BMC Evol Biol. 2004;4: 21. doi:10.1186/1471-2148-4-21
OpenUrl CrossRef PubMed
9.↵
Koyama T, Platt D, Parida L. Variant analysis of SARS-CoV-2 genomes. Bull World Health Organ. 2020;98: 495–504. doi:10.2471/BLT.20.253591
OpenUrl CrossRef PubMed
10.↵
Letko M, Marzi A, Munster V. Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses. Nat Microbiol. Nature Publishing Group; 2020;5: 562–569. doi:10.1038/s41564-020-0688-y
OpenUrl CrossRef PubMed
11.↵
Yan R, Zhang Y, Li Y, Xia L, Guo Y, Zhou Q. Structural basis for the recognition of the SARS-CoV-2 by full-length human ACE2. Science. American Association for the Advancement of Science; 2020;: eabb2762. doi:10.1126/science.abb2762
OpenUrl Abstract/FREE Full Text
12.↵
Shang J, Wan Y, Luo C, Ye G, Geng Q, Auerbach A, et al. Cell entry mechanisms of SARS-CoV-2. Proc Natl Acad Sci U S A. National Academy of Sciences; 2020;117: 11727–11734. doi:10.1073/pnas.2003138117
OpenUrl Abstract/FREE Full Text
13.↵
Selvaraj C, Dinesh DC, Panwar U, Abhirami R, Boura E, Singh SK. Structure-based virtual screening and molecular dynamics simulation of SARS-CoV-2 Guanine-N7 methyltransferase (nsp14) for identifying antiviral inhibitors against COVID-19. J Biomol Struct Dyn. Taylor & Francis; 2020;57: 1–12. doi:10.1080/07391102.2020.1778535
OpenUrl CrossRef
14.
Ali A, Vijayan R. Dynamics of the ACE2-SARS-CoV-2/SARS-CoV spike protein interface reveal unique mechanisms. Sci Rep. Nature Publishing Group; 2020;10: 14214–12. doi:10.1038/s41598-020-71188-3
OpenUrl CrossRef
15.↵
Suárez D, Díaz N. SARS-CoV-2 Main Protease: A Molecular Dynamics Study. J Chem Inf Model. American Chemical Society; 2020;60: 5815–5831. doi:10.1021/acs.jcim.0c00575
OpenUrl CrossRef
16.↵
Pinto D, Park Y-J, Beltramello M, Walls AC, Tortorici MA, Bianchi S, et al. Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody. Nature. Nature Publishing Group; 2020;583: 290–295. doi:10.1038/s41586-020-2349-y
OpenUrl CrossRef PubMed
17.
Rogers TF, Zhao F, Huang D, Beutler N, Burns A, He W-T, et al. Isolation of potent SARS-CoV-2 neutralizing antibodies and protection from disease in a small animal model. Science. 2020;369: 956–963. doi:10.1126/science.abc7520
OpenUrl Abstract/FREE Full Text
18.↵
Cao Y, Su B, Guo X, Sun W, Deng Y, Bao L, et al. Potent Neutralizing Antibodies against SARS-CoV-2 Identified by High-Throughput Single-Cell Sequencing of Convalescent Patients’ B Cells. Cell. 2020;182: 73–84.e16. doi:10.1016/j.cell.2020.05.025
OpenUrl CrossRef PubMed
19.↵
Deganutti G, Prischi F, Reynolds CA. Supervised molecular dynamics for exploring the druggability of the SARS-CoV-2 spike protein. J Comput Aided Mol Des. Springer International Publishing; 2020;20: 1015–13. doi:10.1007/s10822-020-00356-4
OpenUrl CrossRef
20.↵
Arantes PR, Saha A, Palermo G. Fighting COVID-19 Using Molecular Dynamics Simulations. ACS central science. American Chemical Society; 2020.: 1654–1656. doi:10.1021/acscentsci.0c01236
OpenUrl CrossRef
21.↵
Karathanou K, Lazaratos M, Bertalan É, Siemers M, Buzar K, Schertler GFX, et al. A graph-based approach identifies dynamic H-bond communication networks in spike protein S of SARS-CoV-2. J Struct Biol. 2020;212: 107617. doi:10.1016/j.jsb.2020.107617
OpenUrl CrossRef
22.↵
Melero R, Sorzano COS, Foster B, Vilas J-L, Martínez M, Marabini R, et al. Continuous flexibility analysis of SARS-CoV-2 spike prefusion structures. IUCrJ. 2020;7: 1059–1069. doi:10.1107/S2052252520012725
OpenUrl CrossRef PubMed
23.↵
Verkhivker GM. Molecular Simulations and Network Modeling Reveal an Allosteric Signaling in the SARS-CoV-2 Spike Proteins. J Proteome Res. American Chemical Society; 2020;19: 4587–4608. doi:10.1021/acs.jproteome.0c00654
OpenUrl CrossRef
24.↵
Majumder S, Chaudhuri D, Datta J, Giri K. Exploring the intrinsic dynamics of SARS-CoV-2, SARS-CoV and MERS-CoV spike glycoprotein through normal mode analysis using anisotropic network model. J Mol Graph Model. 2021;102: 107778. doi:10.1016/j.jmgm.2020.107778
OpenUrl CrossRef
25.↵
Frappier V, Najmanovich RJ. A coarse-grained elastic network atom contact model and its use in the simulation of protein dynamics and the prediction of the effect of mutations. MacKerell AD, editor. PLoS Comput Biol. 2014;10: e1003569. doi:10.1371/journal.pcbi.1003569
OpenUrl CrossRef PubMed
26.↵
Frappier V, Chartier M, Najmanovich RJ. ENCoM server: exploring protein conformational space and the effect of mutations on protein function and stability. Nucleic Acids Res. 2015;43: W395–400. doi:10.1093/nar/gkv343
OpenUrl CrossRef PubMed
27.
Frappier V, Chartier M, Najmanovich R. Applications of Normal Mode Analysis Methods in Computational Protein Design. Methods Mol Biol. 2017;1529: 203–214. doi:10.1007/978-1-4939-6637-0_9
OpenUrl CrossRef PubMed
28.↵
Frappier V, Najmanovich RJ. Vibrational entropy differences between mesophile and thermophile proteins and their use in protein engineering. Protein Science. 2015;24: 474–483. doi:10.1002/pro.2592
OpenUrl CrossRef PubMed
29.↵
Walls AC, Park Y-J, Tortorici MA, Wall A, McGuire AT, Veesler D. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell. 2020;181: 281–292.e6. doi:10.1016/j.cell.2020.02.058
OpenUrl CrossRef PubMed
30.↵
Gobeil S, Janowska K, McDowell S, Mansouri K, Parks R, Manne K, et al. D614G mutation alters SARS-CoV-2 spike conformational dynamics and protease cleavage susceptibility at the S1/S2 junction. bioRxiv. Cold Spring Harbor Laboratory; 2020;74: 531. doi:10.1101/2020.10.11.335299
OpenUrl Abstract/FREE Full Text
31.↵
Yuan Y, Cao D, Zhang Y, Ma J, Qi J, Wang Q, et al. Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nat Commun. Nature Publishing Group; 2017;8: 15092–9. doi:10.1038/ncomms15092
OpenUrl CrossRef PubMed
32.↵
Sali A, Blundell TL. Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993;234: 779–815. doi:10.1006/jmbi.1993.1626
OpenUrl CrossRef PubMed Web of Science
33.↵
Schymkowitz J, Borg J, Stricher F, Nys R, Rousseau F, Serrano L. The FoldX web server: an online force field. Nucleic Acids Res. 2005;33: W382–8. doi:10.1093/nar/gki387
OpenUrl CrossRef PubMed Web of Science
34.↵
Xiong X, Qu K, Ciazynska KA, Hosmillo M, Carter AP, Ebrahimi S, et al. A thermostable, closed SARS-CoV-2 spike protein trimer. Nat Struct Mol Biol. Nature Publishing Group; 2020;27: 934–941. doi:10.1038/s41594-020-0478-5
OpenUrl CrossRef PubMed
35.↵
Wako H, Endo S. Normal mode analysis as a method to derive protein dynamics information from the Protein Data Bank. Biophys Rev. Springer Berlin Heidelberg; 2017;9: 877–893. doi:10.1007/s12551-017-0330-2
OpenUrl CrossRef
36.
Tang Q-Y, Kaneko K. Long-range correlation in protein dynamics: Confirmation by structural data and normal mode analysis. de Groot BL, editor. PLoS Comput Biol. 2020;16: e1007670. doi:10.1371/journal.pcbi.1007670
OpenUrl CrossRef
37.↵
Cui Q, Bahar I. Normal Mode Analysis. CRC Press; 2006.
38.↵
Mailhot O, Najmanovich R. The NRGTEN Python package: an extensible toolkit for coarse-grained normal mode analysis of proteins, nucleic acids, small molecules and their complexes. arXiv.org. 2020.
39.↵
Xu B, Shen H, Zhu X, Li G. Fast and accurate computation schemes for evaluating vibrational entropy of proteins. Journal of Computational Chemistry. John Wiley & Sons, Ltd; 2011;32: 3188–3193. doi:10.1002/jcc.21900
OpenUrl CrossRef PubMed
40.↵
Marques O, Sanejouand YH. Hinge-bending motion in citrate synthase arising from normal mode calculations. PROTEINS: Structure, Function and Genetics. Wiley Subscription Services, Inc., A Wiley Company; 1995;23: 557–560. doi:10.1002/prot.340230410
OpenUrl CrossRef PubMed Web of Science
41.↵
Volz E, Hill V, McCrone JT, Price A, Jorgensen D, O’Toole Á, et al. Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity. Cell. 2020. doi:10.1016/j.cell.2020.11.020
OpenUrl CrossRef
42.↵
Li Q, Wu J, Nie J, Zhang L, Hao H, Liu S, et al. The Impact of Mutations in SARS-CoV-2 Spike on Viral Infectivity and Antigenicity. Cell. 2020;182: 1284–1294.e9. doi:10.1016/j.cell.2020.07.012
OpenUrl CrossRef PubMed
43.↵
Korber B, Fischer WM, Gnanakaran S, Yoon H, Theiler J, Abfalterer W, et al. Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus. Cell. 2020;182: 812–827.e19. doi:10.1016/j.cell.2020.06.043
OpenUrl CrossRef PubMed
44.↵
Yurkovetskiy L, Wang X, Pascal KE, Tomkins-Tinch C, Nyalile TP, Wang Y, et al. Structural and Functional Analysis of the D614G SARS-CoV-2 Spike Protein Variant. Cell. 2020;183: 739–751.e8. doi:10.1016/j.cell.2020.09.032
OpenUrl CrossRef PubMed
45.↵
Mohammad A, Alshawaf E, Marafie SK, Abu-Farha M, Abubaker J, Al-Mulla F. Higher binding affinity of Furin to SARS-CoV-2 spike (S) protein D614G could be associated with higher SARS-CoV-2 infectivity. Int J Infect Dis. 2020. doi:10.1016/j.ijid.2020.10.033
OpenUrl CrossRef
46.↵
Tang L, Schulkins A, Chen C-N, Deshayes K, Kenney JS. The SARS-CoV-2 Spike Protein D614G Mutation Shows Increasing Dominance and May Confer a Structural Advantage to the Furin Cleavage Domain. Preprints; 2020;: 2020050407. doi:10.20944/preprints202005.0407.v1
OpenUrl CrossRef
47.↵
Zhang L, Jackson CB, Mou H, Ojha A, Rangarajan ES, Izard T, et al. The D614G mutation in the SARS-CoV-2 spike protein reduces S1 shedding and increases infectivity. bioRxiv. Cold Spring Harbor Laboratory; 2020;: 2020.06.12.148726. doi:10.1101/2020.06.12.148726
OpenUrl Abstract/FREE Full Text
48.↵
Berger I, Schaffitzel C. The SARS-CoV-2 spike protein: balancing stability and infectivity. Cell Res. Nature Publishing Group; 2020;30: 1059–1060. doi:10.1038/s41422-020-00430-4
OpenUrl CrossRef
49.↵
Shu Y, McCauley J. GISAID: Global initiative on sharing all influenza data - from vision to reality. Euro Surveill. European Centre for Disease Prevention and Control; 2017;22: 957. doi:10.2807/1560-7917.ES.2017.22.13.30494
OpenUrl CrossRef
50.↵
Elbe S, Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob Chall. John Wiley & Sons, Ltd; 2017;1: 33–46. doi:10.1002/gch2.1018
OpenUrl CrossRef PubMed
51.↵
Kirchdoerfer RN, Wang N, Pallesen J, Wrapp D, Turner HL, Cottrell CA, et al. Stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis. Sci Rep. Nature Publishing Group; 2018;8: 15701–11. doi:10.1038/s41598-018-34171-7
OpenUrl CrossRef
52.↵
Starr TN, Greaney AJ, Hilton SK, Ellis D, Crawford KHD, Dingens AS, et al. Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. Cell. 2020;182: 1295–1310.e20. doi:10.1016/j.cell.2020.08.012
OpenUrl CrossRef PubMed
53.↵
Hodcroft EB, Zuber M, Nadeau S, Crawford KHD, Bloom JD, Veesler D, et al. Emergence and spread of a SARS-CoV-2 variant through Europe in the summer of 2020. medRxiv. Cold Spring Harbor Laboratory Press; 2020;15: e1006650. doi:10.1101/2020.10.25.20219063
OpenUrl Abstract/FREE Full Text
54.↵
Rambaut A, Loman N, Pybus O, Barclay W, Barrett J, Carabelli A, et al. Case Study: Prolonged Infectious SARS-CoV-2 Shedding from an Asymptomatic Immunocompromised Individual with Cancer [Internet]. Dec 2020 [cited 6 Jan 2021]. Available: https://virological.org/t/preliminary-genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-the-uk-defined-by-a-novel-set-of-spike-mutations/563
55.↵
Tegally H, Wilkinson E, Giovanetti M, Iranzadeh A, Fonseca V, Giandhari J, et al. Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa. medRxiv. Cold Spring Harbor Laboratory Press; 2020;34: 2020.12.21.20248640. doi:10.1101/2020.12.21.20248640
OpenUrl Abstract/FREE Full Text
56.↵
Greaney AJ, Loes AN, Crawford KH, Starr TN, Malone KD, Chu HY, et al. Comprehensive mapping of mutations to the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human serum antibodies. bioRxiv. Cold Spring Harbor Laboratory; 2021;: 2020.12.31.425021. doi:10.1101/2020.12.31.425021
OpenUrl Abstract/FREE Full Text
57.↵
Hilton S, Huddleston J, Black A, North K, Dingens A, Bedford T, et al. dms-view: Interactive visualization tool for deep mutational scanning data. JOSS. 2020;5: 2353. doi:10.21105/joss.02353
OpenUrl CrossRef
58.↵
Oude Munnink BB, Sikkema RS, Nieuwenhuijse DF, Molenaar RJ, Munger E, Molenkamp R, et al. Transmission of SARS-CoV-2 on mink farms between humans and mink and back to humans. Science. 2020;: eabe5901. doi:10.1126/science.abe5901
OpenUrl Abstract/FREE Full Text

View the discussion thread.

Posted January 11, 2021.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Biophysics

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11753)
Bioengineering (8752)
Bioinformatics (29201)
Biophysics (14974)
Cancer Biology (12100)
Cell Biology (17413)
Clinical Trials (138)
Developmental Biology (9422)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18309)
Genetics (12245)
Genomics (16804)
Immunology (11869)
Microbiology (28098)
Molecular Biology (11596)
Neuroscience (60975)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] 1.↵
Lu R, Zhao X, Li J, Niu P, Yang B, Wu H, et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020;395: 565–574. doi:10.1016/S0140-6736(20)30251-8
OpenUrl CrossRef PubMed

[2] 2.↵
Zhou P, Yang X-L, Wang X-G, Hu B, Zhang L, Zhang W, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. Nature Publishing Group; 2020;579: 270–273. doi:10.1038/s41586-020-2012-7
OpenUrl CrossRef PubMed

[3] 3.↵
World Health Organization. Weekly epidemiological update - 5 January 2021 [Internet]. 2021 Jan. Available: https://www.who.int/publications/m/item/weekly-epidemiological-update5-january-2021

[4] 4.↵
US Center for Disease Control. SARS Basics Fact Sheet [Internet]. [cited 4 Jan 2021]. Available: https://www.cdc.gov/sars/about/fs-sars.html

[5] 5.↵
Wang L-F, Shi Z, Zhang S, Field H, Daszak P, Eaton BT. Review of bats and SARS. Emerging Infect Dis. 2006;12: 1834–1840. doi:10.3201/eid1212.060401
OpenUrl CrossRef PubMed Web of Science

[6] 6.↵
Memish ZA, Perlman S, Van Kerkhove MD, Zumla A. Middle East respiratory syndrome. Lancet. 2020;395: 1063–1077. doi:10.1016/S0140-6736(19)33221-0
OpenUrl CrossRef PubMed

[7] 7.↵
Haagmans BL, Dhahiry Al SHS, Reusken CBEM, Raj VS, Galiano M, Myers R, et al. Middle East respiratory syndrome coronavirus in dromedary camels: an outbreak investigation. Lancet Infect Dis. 2014;14: 140–145. doi:10.1016/S1473-3099(13)70690-X
OpenUrl CrossRef PubMed Web of Science

[8] 8.↵
Zhao Z, Li H, Wu X, Zhong Y, Zhang K, Zhang Y-P, et al. Moderate mutation rate in the SARS coronavirus genome and its implications. BMC Evol Biol. 2004;4: 21. doi:10.1186/1471-2148-4-21
OpenUrl CrossRef PubMed

[9] 9.↵
Koyama T, Platt D, Parida L. Variant analysis of SARS-CoV-2 genomes. Bull World Health Organ. 2020;98: 495–504. doi:10.2471/BLT.20.253591
OpenUrl CrossRef PubMed

[10] 10.↵
Letko M, Marzi A, Munster V. Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses. Nat Microbiol. Nature Publishing Group; 2020;5: 562–569. doi:10.1038/s41564-020-0688-y
OpenUrl CrossRef PubMed

[11] 11.↵
Yan R, Zhang Y, Li Y, Xia L, Guo Y, Zhou Q. Structural basis for the recognition of the SARS-CoV-2 by full-length human ACE2. Science. American Association for the Advancement of Science; 2020;: eabb2762. doi:10.1126/science.abb2762
OpenUrl Abstract/FREE Full Text

[12] 12.↵
Shang J, Wan Y, Luo C, Ye G, Geng Q, Auerbach A, et al. Cell entry mechanisms of SARS-CoV-2. Proc Natl Acad Sci U S A. National Academy of Sciences; 2020;117: 11727–11734. doi:10.1073/pnas.2003138117
OpenUrl Abstract/FREE Full Text

[13] 13.↵
Selvaraj C, Dinesh DC, Panwar U, Abhirami R, Boura E, Singh SK. Structure-based virtual screening and molecular dynamics simulation of SARS-CoV-2 Guanine-N7 methyltransferase (nsp14) for identifying antiviral inhibitors against COVID-19. J Biomol Struct Dyn. Taylor & Francis; 2020;57: 1–12. doi:10.1080/07391102.2020.1778535
OpenUrl CrossRef

[14] 14.
Ali A, Vijayan R. Dynamics of the ACE2-SARS-CoV-2/SARS-CoV spike protein interface reveal unique mechanisms. Sci Rep. Nature Publishing Group; 2020;10: 14214–12. doi:10.1038/s41598-020-71188-3
OpenUrl CrossRef

[15] 15.↵
Suárez D, Díaz N. SARS-CoV-2 Main Protease: A Molecular Dynamics Study. J Chem Inf Model. American Chemical Society; 2020;60: 5815–5831. doi:10.1021/acs.jcim.0c00575
OpenUrl CrossRef

[16] 16.↵
Pinto D, Park Y-J, Beltramello M, Walls AC, Tortorici MA, Bianchi S, et al. Cross-neutralization of SARS-CoV-2 by a human monoclonal SARS-CoV antibody. Nature. Nature Publishing Group; 2020;583: 290–295. doi:10.1038/s41586-020-2349-y
OpenUrl CrossRef PubMed

[17] 17.
Rogers TF, Zhao F, Huang D, Beutler N, Burns A, He W-T, et al. Isolation of potent SARS-CoV-2 neutralizing antibodies and protection from disease in a small animal model. Science. 2020;369: 956–963. doi:10.1126/science.abc7520
OpenUrl Abstract/FREE Full Text

[18] 18.↵
Cao Y, Su B, Guo X, Sun W, Deng Y, Bao L, et al. Potent Neutralizing Antibodies against SARS-CoV-2 Identified by High-Throughput Single-Cell Sequencing of Convalescent Patients’ B Cells. Cell. 2020;182: 73–84.e16. doi:10.1016/j.cell.2020.05.025
OpenUrl CrossRef PubMed

[19] 19.↵
Deganutti G, Prischi F, Reynolds CA. Supervised molecular dynamics for exploring the druggability of the SARS-CoV-2 spike protein. J Comput Aided Mol Des. Springer International Publishing; 2020;20: 1015–13. doi:10.1007/s10822-020-00356-4
OpenUrl CrossRef

[20] 20.↵
Arantes PR, Saha A, Palermo G. Fighting COVID-19 Using Molecular Dynamics Simulations. ACS central science. American Chemical Society; 2020.: 1654–1656. doi:10.1021/acscentsci.0c01236
OpenUrl CrossRef

[21] 21.↵
Karathanou K, Lazaratos M, Bertalan É, Siemers M, Buzar K, Schertler GFX, et al. A graph-based approach identifies dynamic H-bond communication networks in spike protein S of SARS-CoV-2. J Struct Biol. 2020;212: 107617. doi:10.1016/j.jsb.2020.107617
OpenUrl CrossRef

[22] 22.↵
Melero R, Sorzano COS, Foster B, Vilas J-L, Martínez M, Marabini R, et al. Continuous flexibility analysis of SARS-CoV-2 spike prefusion structures. IUCrJ. 2020;7: 1059–1069. doi:10.1107/S2052252520012725
OpenUrl CrossRef PubMed

[23] 23.↵
Verkhivker GM. Molecular Simulations and Network Modeling Reveal an Allosteric Signaling in the SARS-CoV-2 Spike Proteins. J Proteome Res. American Chemical Society; 2020;19: 4587–4608. doi:10.1021/acs.jproteome.0c00654
OpenUrl CrossRef

[24] 24.↵
Majumder S, Chaudhuri D, Datta J, Giri K. Exploring the intrinsic dynamics of SARS-CoV-2, SARS-CoV and MERS-CoV spike glycoprotein through normal mode analysis using anisotropic network model. J Mol Graph Model. 2021;102: 107778. doi:10.1016/j.jmgm.2020.107778
OpenUrl CrossRef

[25] 25.↵
Frappier V, Najmanovich RJ. A coarse-grained elastic network atom contact model and its use in the simulation of protein dynamics and the prediction of the effect of mutations. MacKerell AD, editor. PLoS Comput Biol. 2014;10: e1003569. doi:10.1371/journal.pcbi.1003569
OpenUrl CrossRef PubMed

[26] 26.↵
Frappier V, Chartier M, Najmanovich RJ. ENCoM server: exploring protein conformational space and the effect of mutations on protein function and stability. Nucleic Acids Res. 2015;43: W395–400. doi:10.1093/nar/gkv343
OpenUrl CrossRef PubMed

[27] 27.
Frappier V, Chartier M, Najmanovich R. Applications of Normal Mode Analysis Methods in Computational Protein Design. Methods Mol Biol. 2017;1529: 203–214. doi:10.1007/978-1-4939-6637-0_9
OpenUrl CrossRef PubMed

[28] 28.↵
Frappier V, Najmanovich RJ. Vibrational entropy differences between mesophile and thermophile proteins and their use in protein engineering. Protein Science. 2015;24: 474–483. doi:10.1002/pro.2592
OpenUrl CrossRef PubMed

[29] 29.↵
Walls AC, Park Y-J, Tortorici MA, Wall A, McGuire AT, Veesler D. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell. 2020;181: 281–292.e6. doi:10.1016/j.cell.2020.02.058
OpenUrl CrossRef PubMed

[30] 30.↵
Gobeil S, Janowska K, McDowell S, Mansouri K, Parks R, Manne K, et al. D614G mutation alters SARS-CoV-2 spike conformational dynamics and protease cleavage susceptibility at the S1/S2 junction. bioRxiv. Cold Spring Harbor Laboratory; 2020;74: 531. doi:10.1101/2020.10.11.335299
OpenUrl Abstract/FREE Full Text

[31] 31.↵
Yuan Y, Cao D, Zhang Y, Ma J, Qi J, Wang Q, et al. Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nat Commun. Nature Publishing Group; 2017;8: 15092–9. doi:10.1038/ncomms15092
OpenUrl CrossRef PubMed

[32] 32.↵
Sali A, Blundell TL. Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993;234: 779–815. doi:10.1006/jmbi.1993.1626
OpenUrl CrossRef PubMed Web of Science

[33] 33.↵
Schymkowitz J, Borg J, Stricher F, Nys R, Rousseau F, Serrano L. The FoldX web server: an online force field. Nucleic Acids Res. 2005;33: W382–8. doi:10.1093/nar/gki387
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Xiong X, Qu K, Ciazynska KA, Hosmillo M, Carter AP, Ebrahimi S, et al. A thermostable, closed SARS-CoV-2 spike protein trimer. Nat Struct Mol Biol. Nature Publishing Group; 2020;27: 934–941. doi:10.1038/s41594-020-0478-5
OpenUrl CrossRef PubMed

[35] 35.↵
Wako H, Endo S. Normal mode analysis as a method to derive protein dynamics information from the Protein Data Bank. Biophys Rev. Springer Berlin Heidelberg; 2017;9: 877–893. doi:10.1007/s12551-017-0330-2
OpenUrl CrossRef

[36] 36.
Tang Q-Y, Kaneko K. Long-range correlation in protein dynamics: Confirmation by structural data and normal mode analysis. de Groot BL, editor. PLoS Comput Biol. 2020;16: e1007670. doi:10.1371/journal.pcbi.1007670
OpenUrl CrossRef

[37] 37.↵
Cui Q, Bahar I. Normal Mode Analysis. CRC Press; 2006.

[38] 38.↵
Mailhot O, Najmanovich R. The NRGTEN Python package: an extensible toolkit for coarse-grained normal mode analysis of proteins, nucleic acids, small molecules and their complexes. arXiv.org. 2020.

[39] 39.↵
Xu B, Shen H, Zhu X, Li G. Fast and accurate computation schemes for evaluating vibrational entropy of proteins. Journal of Computational Chemistry. John Wiley & Sons, Ltd; 2011;32: 3188–3193. doi:10.1002/jcc.21900
OpenUrl CrossRef PubMed

[40] 40.↵
Marques O, Sanejouand YH. Hinge-bending motion in citrate synthase arising from normal mode calculations. PROTEINS: Structure, Function and Genetics. Wiley Subscription Services, Inc., A Wiley Company; 1995;23: 557–560. doi:10.1002/prot.340230410
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Volz E, Hill V, McCrone JT, Price A, Jorgensen D, O’Toole Á, et al. Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity. Cell. 2020. doi:10.1016/j.cell.2020.11.020
OpenUrl CrossRef

[42] 42.↵
Li Q, Wu J, Nie J, Zhang L, Hao H, Liu S, et al. The Impact of Mutations in SARS-CoV-2 Spike on Viral Infectivity and Antigenicity. Cell. 2020;182: 1284–1294.e9. doi:10.1016/j.cell.2020.07.012
OpenUrl CrossRef PubMed

[43] 43.↵
Korber B, Fischer WM, Gnanakaran S, Yoon H, Theiler J, Abfalterer W, et al. Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus. Cell. 2020;182: 812–827.e19. doi:10.1016/j.cell.2020.06.043
OpenUrl CrossRef PubMed

[44] 44.↵
Yurkovetskiy L, Wang X, Pascal KE, Tomkins-Tinch C, Nyalile TP, Wang Y, et al. Structural and Functional Analysis of the D614G SARS-CoV-2 Spike Protein Variant. Cell. 2020;183: 739–751.e8. doi:10.1016/j.cell.2020.09.032
OpenUrl CrossRef PubMed

[45] 45.↵
Mohammad A, Alshawaf E, Marafie SK, Abu-Farha M, Abubaker J, Al-Mulla F. Higher binding affinity of Furin to SARS-CoV-2 spike (S) protein D614G could be associated with higher SARS-CoV-2 infectivity. Int J Infect Dis. 2020. doi:10.1016/j.ijid.2020.10.033
OpenUrl CrossRef

[46] 46.↵
Tang L, Schulkins A, Chen C-N, Deshayes K, Kenney JS. The SARS-CoV-2 Spike Protein D614G Mutation Shows Increasing Dominance and May Confer a Structural Advantage to the Furin Cleavage Domain. Preprints; 2020;: 2020050407. doi:10.20944/preprints202005.0407.v1
OpenUrl CrossRef

[47] 47.↵
Zhang L, Jackson CB, Mou H, Ojha A, Rangarajan ES, Izard T, et al. The D614G mutation in the SARS-CoV-2 spike protein reduces S1 shedding and increases infectivity. bioRxiv. Cold Spring Harbor Laboratory; 2020;: 2020.06.12.148726. doi:10.1101/2020.06.12.148726
OpenUrl Abstract/FREE Full Text

[48] 48.↵
Berger I, Schaffitzel C. The SARS-CoV-2 spike protein: balancing stability and infectivity. Cell Res. Nature Publishing Group; 2020;30: 1059–1060. doi:10.1038/s41422-020-00430-4
OpenUrl CrossRef

[49] 49.↵
Shu Y, McCauley J. GISAID: Global initiative on sharing all influenza data - from vision to reality. Euro Surveill. European Centre for Disease Prevention and Control; 2017;22: 957. doi:10.2807/1560-7917.ES.2017.22.13.30494
OpenUrl CrossRef

[50] 50.↵
Elbe S, Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob Chall. John Wiley & Sons, Ltd; 2017;1: 33–46. doi:10.1002/gch2.1018
OpenUrl CrossRef PubMed

[51] 51.↵
Kirchdoerfer RN, Wang N, Pallesen J, Wrapp D, Turner HL, Cottrell CA, et al. Stabilized coronavirus spikes are resistant to conformational changes induced by receptor recognition or proteolysis. Sci Rep. Nature Publishing Group; 2018;8: 15701–11. doi:10.1038/s41598-018-34171-7
OpenUrl CrossRef

[52] 52.↵
Starr TN, Greaney AJ, Hilton SK, Ellis D, Crawford KHD, Dingens AS, et al. Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. Cell. 2020;182: 1295–1310.e20. doi:10.1016/j.cell.2020.08.012
OpenUrl CrossRef PubMed

[53] 53.↵
Hodcroft EB, Zuber M, Nadeau S, Crawford KHD, Bloom JD, Veesler D, et al. Emergence and spread of a SARS-CoV-2 variant through Europe in the summer of 2020. medRxiv. Cold Spring Harbor Laboratory Press; 2020;15: e1006650. doi:10.1101/2020.10.25.20219063
OpenUrl Abstract/FREE Full Text

[54] 54.↵
Rambaut A, Loman N, Pybus O, Barclay W, Barrett J, Carabelli A, et al. Case Study: Prolonged Infectious SARS-CoV-2 Shedding from an Asymptomatic Immunocompromised Individual with Cancer [Internet]. Dec 2020 [cited 6 Jan 2021]. Available: https://virological.org/t/preliminary-genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-the-uk-defined-by-a-novel-set-of-spike-mutations/563

[55] 55.↵
Tegally H, Wilkinson E, Giovanetti M, Iranzadeh A, Fonseca V, Giandhari J, et al. Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa. medRxiv. Cold Spring Harbor Laboratory Press; 2020;34: 2020.12.21.20248640. doi:10.1101/2020.12.21.20248640
OpenUrl Abstract/FREE Full Text

[56] 56.↵
Greaney AJ, Loes AN, Crawford KH, Starr TN, Malone KD, Chu HY, et al. Comprehensive mapping of mutations to the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human serum antibodies. bioRxiv. Cold Spring Harbor Laboratory; 2021;: 2020.12.31.425021. doi:10.1101/2020.12.31.425021
OpenUrl Abstract/FREE Full Text

[57] 57.↵
Hilton S, Huddleston J, Black A, North K, Dingens A, Bedford T, et al. dms-view: Interactive visualization tool for deep mutational scanning data. JOSS. 2020;5: 2353. doi:10.21105/joss.02353
OpenUrl CrossRef

[58] 58.↵
Oude Munnink BB, Sikkema RS, Nieuwenhuijse DF, Molenaar RJ, Munger E, Molenkamp R, et al. Transmission of SARS-CoV-2 on mink farms between humans and mink and back to humans. Science. 2020;: eabe5901. doi:10.1126/science.abe5901
OpenUrl Abstract/FREE Full Text