Abstract
Continuous evolution of Omicron has led to numerous subvariants that exhibit growth advantage over BA.5. Such rapid and simultaneous emergence of variants with enormous advantages is unprecedented. Despite their rapidly divergent evolutionary courses, mutations on their receptor-binding domain (RBD) converge on several hotspots, including R346, K356, K444, L452, N460K and F486. The driving force and destination of such convergent evolution and its impact on humoral immunity established by vaccination and infection remain unclear. Here, we demonstrate that these convergent mutations can cause striking evasion of convalescent plasma, including those from BA.5 breakthrough infection, and existing antibody drugs, including Evusheld and Bebtelovimab. BR.2, CA.1, BQ.1.1, BM.1.1.1, and especially XBB, are the most antibody-evasive strain tested, far exceeding BA.5 and approaching SARS-CoV-1 level. To delineate the origin of the convergent evolution, we determined the escape mutation profiles and neutralization activity of monoclonal antibodies (mAbs) isolated from BA.2 and BA.5 breakthrough-infection convalescents. Importantly, due to humoral immune imprinting, BA.2 and especially BA.5 breakthrough infection caused significant reductions in the epitope diversity of neutralizing antibodies and increased proportion of non-neutralizing mAbs, which in turn concentrated humoral immune pressure and promoted the convergent RBD evolution. Additionally, the precise convergent RBD mutations and evolution trends of BA.2.75/BA.5 subvariants could be inferred by integrating the neutralization-weighted DMS profiles of mAbs from various immune histories (3051 mAbs in total). Moreover, we demonstrated that as few as five additional convergent mutations based on BA.5 or BA.2.75 could completely evade most plasma samples, including those from BA.5 breakthrough infection, while retaining sufficient hACE2-binding affinity. These results suggest that current herd immunity and BA.5 vaccine boosters may not provide sufficiently broad protection against infection. Broad-spectrum SARS-CoV-2 vaccines and NAb drugs development should be of high priority, and the constructed convergent mutants could serve to examine their effectiveness in advance.
Main
SARS-CoV-2 Omicron BA.1, BA.2, and BA.5 have demonstrated strong neutralization evasion capability, posing severe challenges to the efficacy of existing humoral immunity established through vaccination and infection 1-15. Nevertheless, Omicron is continuously evolving, leading to various new subvariants, including BA.2.75 and BA.4.6 16-22. Importantly, a high proportion of these emerging variants display significant growth advantages over BA.5, such as BA.2.3.20, BA.2.75.2, CA.1, BR.2, BN.1, BM.1.1.1, BU.1, BQ.1.1, and XBB (Fig. 1a) 23. Such rapid and simultaneous emergence of multiple variants with enormous growth advantages is unprecedented. Notably, although these derivative subvariants appear to diverge along the evolutionary course, the mutations they carry on the receptor-binding domain (RBD) converge on the same sites, including R346, K444, V445, G446, N450, L452, N460, F486, F490, and R493 (Fig. 1b, Extended Data Fig. 1). Most mutations on these residues are known to be antibody-evasive, as revealed by deep mutational scanning (DMS) 1,2,24-26. It’s crucial to examine the impact of these convergent mutations on antibody-escaping capability, receptor binding affinity, and the efficacy of vaccines and antibody therapeutics. It’s also important to investigate the driving force behind this accelerated emergence of RBD mutations, what such mutational convergence would lead to, and how we can prepare for such convergent RBD evolution.
Convergent RBD evolution causes extensive immune evasion
First, we tested the antibody evasion capability of these convergent variants. We constructed the VSV-based spike-pseudotyped virus of Omicron BA.2, BA.2.75, and BA.4/5 sublineages carrying those convergent mutations and examined the neutralizing activities of therapeutic neutralizing antibodies (NAbs) against them (Fig. 2a and Extended Data Fig. 2a) 13,27-32. The COV2-2196+COV2-2130 (Evusheld) 28,33 is vulnerable to F486, R346, and K444-G446 mutations, evaded or highly impaired by BJ.1 (R346T), XBB (R346T+V445P+F486S), BA.2.75.2/CA.1/BM.1.1/BM.1.1.1 (R346T+F486S), BR.2 (R346T+F486I), BA.4.6.1 (R346T+F486V), BA.5.6.2/BQ.1 (K444T+F486V), BU.1 (K444M+F486V), and BQ.1.1 (R346T+K444T+F486V). LY-CoV1404 (Bebtelovimab) remains potent against BF.16 (K444R) and BA.5.1.12 (V445A) but shows reduced potency against BA.5.5.1 (N450D) 31 (Extended Data Fig. 2a). Importantly, LY-CoV1404 was escaped by BJ.1, XBB, BR.1, and BQ.1.1 while exhibiting strongly reduced activity against BA.2.38.1, BA.5.6.2, and BQ.1 due to K444N/T mutations and the combination of K444M/G446S or V445P/G446S 31. SA55+SA58 is a pair of broad NAbs isolated from vaccinated SARS convalescents that target non-competing conserved epitopes 2,32. SA58 is weak to G339H and R346T mutations and showed reduced neutralization efficacy against BJ.1/XBB and BA.2.75 sublineages. SA55 is the only NAb demonstrating high potency against all tested Omicron subvariants. Among the tested variants, XBB and BQ.1.1 exhibited the strongest resistance to therapeutic mAbs and cocktails (Fig. 2a). Since the SA55+SA58 cocktail is still in preclinical development, the efficacy of available antibody drugs, including the BA.2.75/BA.5-effective Evusheld and Bebtelovimab, are extensively affected by the emerging subvariants with convergent mutations.
Sufficient ACE2-binding affinity is essential for SARS-CoV-2 variants to gain advantages in transmission. Thus, we examined the relative hACE2 binding affinity of these variants by evaluating hACE2 neutralizing potency against the pseudoviruses. The higher neutralizing efficacy of soluble hACE2 indicates a higher ACE2-binding affinity 18. Overall, these convergent variants all demonstrate sufficient ACE2-binding affinity, at least higher than D614G, allowing their prevalence, including the most evasive XBB, BM.1.1.1, BR.2, CA.1, and BQ.1.1 (Fig. 2b and Extended Data Fig. 2b). Specifically, R493Q reversion increases the binding affinity to hACE2, which is consistent with previous reports 4,18,20. K417T shows a moderate increase in binding affinity to hACE2. In contrast, F486S, K444M, and K444N have a clear negative impact on binding affinity, while K444T does not cause significant impairment of ACE2 binding. These observations are also in line with previous DMS results 34.
Most importantly, we investigated how these variants escape the neutralization of plasma samples from individuals with various immune histories. We recruited cohorts of individuals who received 3 doses of CoronaVac 35 with or without breakthrough infection by BA.1, BA.2, or BA.5. Convalescent plasma samples were collected around 4 weeks after hospital discharge (Supplementary Table 1). Plasma from CoronaVac vaccinees was obtained two weeks after the third dose. There was a significant reduction in the NT50 against most tested BA.2, BA.2.75, or BA.5 subvariants, compared to that against corresponding ancestral BA.2, BA.2.75, or BA.5, respectively (Fig. 2c-f and Extended Data Fig. 2c-f). BJ.1, BA.2.10.4, and BA.2.3.20 are highly immune evasive. Plasma of vaccinees and BA.1/BA.2/BA.5 convalescents neutralize these three subvariants with similar NT50 (Fig. 2c-e). Newly emerging BA.2.75 and BA.5 subvariants also demonstrated strong antibody-evading capability. BA.5 subvariants with R346T, K444T/R, and V445A exhibit a similar reduction in NT50, while N450D is slightly less evasive than others (Extended Data Fig. 2c-f). BA.2.75 subvariants with L452R, R346T, K356T, K444M, and F490S exhibit further immune evasion as well. Importantly, BA.2.75.2 (BA.2.75+R346T+F486S), BM.1.1 (BA.2.75+R346T+F486S), BM.1.1.1 (BM.1.1+F490S), CA.1 (BA.2.75.2+L452R+T604I) in BA.2.75 sublineages, and BU.1 (BA.5+Y144del+K444M+N460K), BQ.1.1 (BA.5+R346T+K444T+N460K) in BA.5 sublineages, are among the most humoral immune evasive strain tested. CA.1 causes a reduction in NT50 of plasma from 3-dose vaccinees, BA.1 breakthrough infection, BA.2 breakthrough infection, and BA.5 breakthrough infection by 4.3, 7.6, 8.8, and 3.4 folds compared to BA.2.75, respectively. BQ.1.1 reduces the NT50 of the plasma from the four cohorts by 3.0, 3.9, 4.4 and 6.4 folds compared to BA.5, respectively. NT50 of BA.5 convalescents against BQ.1.1 is higher than that against CA.1, which is probably contributed by the strong capability of BA.2.75 subvariants to evade NTD-targeting antibodies (Fig. 2f). Strikingly, the BJ.1/BM.1.1.1 recombinant strain XBB showed the most extreme immune evasion ability, displaying comparable NT50 to SARS-CoV-1. These observations demonstrate that convergent RBD evolution could cause significant immune escape at a scale never seen before. Such rapid and convergent emergence of antibody-escaping variants, especially when they can evade convalescent plasma from BA.5 breakthrough infection, suggests that vaccine boosters designed based on BA.5 may not achieve broad-spectrum protection against infection.
Immune imprinting induces convergent RBD evolution
It is crucial to investigate the origin of such accelerated RBD convergent evolution. Therefore, we characterized the antibody repertoires induced by Omicron BA.2 and BA.5 breakthrough infection, which is the dominant immune background of current global herd immunity. Following the strategy described in our previous report using PBMC from BA.1 breakthrough infection 2, we enriched antigen-specific memory B cells by fluorescence-activated cell sorting (FACS) for individuals who had recovered from BA.2 and BA.5 breakthrough infection (Supplementary Table 1). RBD-binding CD27+/IgM-/IgD- cells were subjected to single-cell V(D)J sequencing (scVDJ-seq) to determine the BCR sequences (Extended Data Figure 3a-c). Similar to that reported in BA.1 breakthrough infection, immune imprinting, or so-called “original antigenic sin”, is also observed in BA.2 and BA.5 breakthrough infection 2,36-39. Post-vaccination infection with BA.2 and BA.5 mainly recalls cross-reactive memory B cells elicited by wildtype-based vaccine, but rarely produces BA.2/BA.5 specific B cells, similar to BA.1 breakthrough infection (Fig. 3a-b). This is in marked contrast to Omicron infection without previous vaccination (Fig. 3c). The RBD-targeting antibody sequences determined by scVDJ-seq are then expressed in vitro as human IgG1 monoclonal antibodies (mAbs). As expected, only a small proportion of the expressed mAbs specifically bind to BA.2/BA.5 RBD and are not cross-reactive to WT RBD, determined by enzyme-linked immunosorbent assay (ELISA), concordant with the FACS results (Fig. 3d). Importantly, cross-reactive mAbs exhibit significantly higher somatic hypermutation (SHM), indicating that these antibodies are more affinity-matured and are most likely recalled from previously vaccination-induced memory (Fig. 3d).
Next, we determined the escape mutation profiles of these antibodies by high-throughput DMS and measured their neutralizing activities against SARS-CoV-2 D614G, BA.2, BA.5, and BA.2.75 (Fig. 3e, Extended Data Fig. 4a-b). Previously, we reported the DMS profiles and the epitope distribution of antibodies isolated from WT vaccinated/infected individuals, SARS-CoV-2 vaccinated SARS convalescents, and BA.1 convalescents, which could be classified into 12 epitope groups 2. Among them, mAbs in groups A, B, C, D1, D2, F2, and F3 compete with ACE2 and exhibit neutralizing activity (Extended Data Fig. 5a-d, 6a-d); while mAbs in groups E1, E2.1, E2.2, E3, and F1 do not compete with ACE2 (Extended Data Fig. 7a-c). Antibodies in groups E2.2, E3, and F1 exhibit low or no neutralizing capability (Extended Data Fig. 4b, 7d). To integrate the previous dataset with DMS profiles of the new mAbs isolated from BA.2 and BA.5 convalescents, we co-embedded all antibodies using multidimensional scaling (MDS) based on their DMS profiles, followed by t-distributed stochastic neighbor embedding (t-SNE) for visualization, and used KNN-based classification to determine the epitope groups of new mAbs (Fig. 3e). This results in a dataset containing the DMS profiles of 3051 SARS-CoV-2 WT RBD-targeting mAbs in total (Supplementary Table 2). The epitope distribution of mAbs from BA.2 breakthrough infection is generally similar to those elicited by BA.1, except for the increased proportion of mAbs in group C (Fig. 3f). However, BA.5-elicited mAbs showed a more distinct distribution compared to BA.1, with a significantly increased proportion of mAbs in group D2 and E2.2, and decreased ratio of antibodies in groups B and E2.1. The main reason is that the F486 and L452 mutations carried by BA.5 make these cross-reactive memory B cells unable to be activated and recalled (Fig. 3f, Extended Data Fig. 5b, 6a and 7a). Remarkably, antibody repertoires induced by all Omicron breakthrough infections are distinct from those stimulated by WT. Compared to WT infection or vaccination, BA.1, BA.2, and BA.5 breakthrough infection mainly elicit mAbs of group E2.2, E3, and F1, which do not compete with ACE2 and demonstrate weak neutralizing activity, while WT-elicited antibodies enrich mAbs of groups A, B, and C which compete with ACE2 and exhibit strong neutralization potency (Fig. 3g-h). Strikingly, the combined proportion of E2.2, E3, and F1 antibodies rose from 29% in WT convalescents/vaccinees, 53% in BA.1 convalescents, 51% in BA.2 convalescents, to 63% in BA.5 convalescents (Fig. 3f). Overall, the proportion and diversity of neutralizing antibody epitopes are reduced in Omicron breakthrough infection, especially in BA.5 breakthrough infection.
To better delineate the impact of immune imprinting and consequent reduction of NAb epitope diversity on the RBD evolutionary pressure, we aggregated the DMS profiles of large collections of mAbs to estimate the impact of mutations on the efficacy of humoral immunity, as inspired by previous works (Supplementary Table 2) 40. It is essential to incorporate the effects of ACE2 binding, RBD expression, neutralizing activity of mAbs, and codon usage constraint to estimate the SARS-CoV-2 evolution trend on the RBD. In brief, each mutation on the RBD would have an impact on each mAb in the set, which is quantified by the escape scores determined by DMS and weighted by its IC50 against the evolving strain. For each residue, only those amino acids that are accessible by one nucleotide mutation are included. Impacts on ACE2-binding affinity and RBD expression of each mutation are also considered in the analyses, using data determined by DMS in previous reports 34,41,42. Finally, the estimated relative preference of each mutation is calculated using the sum of weighted escape scores of all mAbs in the specific set.
The reduced NAb epitope diversity caused by imprinted humoral response could be strikingly shown by the estimated mutation preference spectrum (Fig. 4a). Diversified escaping-score peaks, which also represent immune pressure, could be observed when using BA.2-elicited antibodies, while only two major peaks could be identified, R346T/S and K444E/Q/N/T/M, when using BA.5-elicited antibodies (Fig. 4a). Interestingly, these two hotspots are the most frequently mutated sites in continuously evolving BA.4/5 subvariants, and convergently occurred in multiple lineages (Fig. 1a-b and Extended Data Fig. 1). Similar analysis for WT and BA.1 also demonstrated diversified peaks; thus, the concentrated immune pressure strikingly reflects the reduced diversity of NAbs elicited by BA.5 breakthrough infection due to immune imprinting, and these concentrated preferred mutations highly overlapped with convergent hotspots observed in the real world (Extended Data Fig. 8a-b). Together, our results indicate that due to immune imprinting, BA.5 breakthrough infection caused significant reductions of NAb epitope diversity and increased proportion of non-neutralizing mAbs, which in turn concentrated immune pressure and promoted the convergent RBD evolution.
Accurate inference of RBD evolution hotspots
Moreover, we wonder if the real-world evolutionary trends of SARS-CoV-2 RBD could be rationalized and even predicted by aggregating this large DMS dataset containing mAbs from various immune histories. Using the mAbs elicited from WT vaccinees or convalescents weighted by IC50 against the D614G strain, we identified mutation hotspots including K417N/T, K444-G446, N450, L452R, and especially E484K (Extended Data Fig. 8a). Most of these residues were mutated in previous VOCs, such as K417N/E484K in Beta, K417T/E484K in Gamma, L452R in Delta, and G446S/E484A in Omicron BA.1, confirming our estimation and inference. Evidence of the emergence of BA.2.75 and BA.5 could also be found using WT, BA.1, and BA.2-elicited mAbs with IC50 against BA.2, where peaks on 444-446, 452, 460, and 486 could be identified (Extended Data Fig. 8c). To better investigate the evolution trends of BA.2.75 and BA.5, the two major lineages circulating currently, we then included antibodies elicited by various immune background, including WT/BA.1/BA.2/BA.5 convalescents, which we believe is the best way to represent the current heterogeneous global humoral immunity (Fig. 4b and Extended Data Fig. 8d). For BA.2.75, the most significant sites are R346T/S, K356T, N417Y/H/I/T, K444E/Q/N/T/M, V445D/G/A, N450T/D/K/S, L452R, I468N, A484P, F486S/V, and F490S/Y. We noticed that these identified residues, even most specific mutations, highly overlapped with recent mutation hotspots of BA.2.75 (Fig. 1b). Two exceptions are A484 and F490, which are feature residues of Group C and could be replaced by L452R and F486S/V (Extended Data Fig. 5c). I468N mutation is also highly associated with K356 mutations, and its function could be covered by K356T (Extended Data Fig. 7a-b). Due to stronger antibody evasion, the preference spectrum of BA.5 is much more concentrated compared to BA.2.75, but the remaining sites are highly overlapped and complementary with BA.2.75. The most striking residues are R346, K444-G446, and N450, followed by K356, N417, L455, N460, and A484. As expected, L452R/F486V does not stand out in BA.5 preference spectrum, while N460K harbored by BA.2.75 appears. These sites and mutations are also popular in emerging BA.4/5 subvariants, proving that our RBD evolution inference system works accurately.
Convergent evolution could eventually nullify plasma neutralization
It is important to examine where this convergence evolution would lead to. Based on the observed and predicted convergent hotpots on RBD of BA.2.75 and BA.5, we wonder if we could construct the convergent variants in advance and investigate to what extent they will evade the humoral immune response. To do this, we must first evaluate the antibody evasion capability of the convergent mutations and their combinations. Thus, we selected a panel of 178 NAbs from 8 epitope groups that could potently neutralize BA.2 and determined their neutralizing activity against constructed mutants carrying single or multiple convergent mutations (Fig. 4c and Extended Data Fig. 9a). NAbs from F1-F3 epitope groups were not included since they are either completely escaped by BA.2 or too rare in Omicron infected convalescents (Fig. 3f). As expected, R493Q and N417T are not major contributors to antibody evasion, but R493Q significantly benefits ACE2 binding. Most group A NAbs are sensitive to N460K, and BA.5+N460K escapes the majority of NAbs in group A due to the combination of F486V and N460K. All NAbs in group B are escaped by F486S/V. Group C NAbs are heavily escaped by F490S and are strongly affected by L452R and F486S/V. A part of group C NAbs is also slightly affected by K444N/T and N450D. G446S affects a part of the D1/D2 NAbs, as previously reported (Extended Data Fig. 6a-b) 20. D1/D2 NAbs are more susceptible to K444N/T, V445A and N450D, and some D1 NAbs could also be escaped by L452R (Extended Data Fig. 6a-b). E1 is mainly affected by R346T, D339H and K356T (Extended Data Fig. 6c). E2.1 and E2.2 exhibit similar properties, evaded by K356T, R346T and L452R (Extended Data Fig. 7a-b). E3 antibodies seem not significantly affected by any of the constructed mutants, as expected (Extended Data Fig. 7c), but they generally exhibit very low neutralization (Extended Data Fig. 7d). BA.5+R346T escapes most antibodies in D1, E1, and E2.1/E2.2, and an additional K444N further escapes most mAbs in D2, demonstrating the feasibility and effectiveness of combining convergent mutations to achieve further evasion. Importantly, adding six mutations to BA.5 could achieve the evasion of the vast majority of RBD NAbs, while maintaining high hACE2-binding affinity, despite the reduction caused by K444N/T and F486V. Together, these findings indicate the feasibility of generating a heavy-antibody-escaping mutant harboring accumulated convergent escape mutations while maintaining sufficient hACE2-binding affinity (Fig. 4c and Extended Data Fig. 9a).
Although the proportion of Omicron-specific mAbs is low due to immune imprinting, it is still necessary to evaluate their neutralization potency and breadth, especially against the convergent mutants. We tested the neutralizing activity of a panel of Omicron-specific RBD-targeting mAbs against D614G, BA.1, BA.2, BA.5, BA.2.75, BA.2.75.2, BR.1, BR.2, CA.1, BQ.1.1 and XBB. These mAbs were isolated from convalescent plasma with Omicron breakthrough infection (Fig. 4d). They could bind RBD of the corresponding exposed Omicron variant but do not cross-bind WT RBD, as confirmed by ELISA. We found these mAbs could effectively neutralize against the exposed strain, as expected, but exhibited poor neutralizing breadth, which means their potency would be largely impaired by other Omicron subvariants, consistent with our previous discovery 2. Notably, BQ.1.1 and XBB could escape most of these Omicron-specific NAbs. Thus, these Omicron-specific antibodies would not effectively expand the breadth of the antibody repertoire of Omicron breakthrough infection when facing convergent variants.
We then evaluated the potency of NTD-targeting NAbs against BA.2, BA.4/5, BA.2.75 and their sublineages and constructed mutants with selected NTD mutations using a panel of 14 NTD-targeting NAbs, as it is reported that NTD-targeting antibodies are abundant in plasma from BA.2 breakthrough infection and contribute to cross-reactivity 43. Most selected mutations are from recently designated Omicron subvariants, except for R237T, which was near V83A, designed to escape mAbs targeting an epitope reported recently 20. None of the NTD-targeting NAbs exhibit strong neutralizing potency, and the IC50 values are all over 0.2 μg/mL 44,45 (Fig. 4e and Extended Data Fig. 9b). We found the tested BA.2-effective NTD-targeting NAbs could be separated into two clusters, named group α and δ in our previous report, respectively 20 (Extended Data Fig. 9c). NAbs in group α target the well-known antigenic supersite on NTD 46, which is sensitive to K147E and W152R harbored by BA.2.75*, and Y144del harbored by BJ.1/XBB; while the other group δ is affected by V83A (XBB) and R237T. The other three NTD mutations harbored by BA.2.75, F157L, I210V and G257S did not significantly affect the tested mAbs, consistent with previous sera neutralization data 18. Two of the NTD mutations harbored by BJ.1 or XBB, Y144del and V83A, each escapes a cluster of them and together would enable XBB to exhibit extremely strong capability of escaping NTD-targeting NAbs.
Based on the above results, we designed multiple VSV-based pseudoviruses that gradually gain convergent mutations that could induce RBD/NTD-targeting NAb resistance (Fig. 5a). The constructed final mutant contains 11 additional mutations on the NTD and RBD compared to BA.5, or 9 mutations compared to BA.2.75. The neutralizing activities of Omicron-effective NAb drugs were first evaluated. As expected, the majority of existing effective NAb drugs, except SA55, are escaped by these mutants (Fig. 5b). Similarly, we also determined the ACE2-binding capability of these mutants by neutralization assays using hACE2 (Fig. 5c). Although some of the designed pseudoviruses, especially those with K444N and F486V, exhibit reduced activity to hACE2 compared to original BA.2.75 or BA.5 variants, their binding affinities are still higher than that of D614G (Fig. 2b). Importantly, our designed pseudoviruses could largely evade the plasma of vaccinees and convalescents after BA.1 breakthrough infection, BA.2 breakthrough infection, and even BA.5 breakthrough infection (Fig. 5d-g). Among the derivative mutants of BA.2.75, L452R, K444M, R346T, and F486V contribute mainly to the significant reduction in neutralization (Fig. 5d-g). Adding more NTD mutations does not contribute to stronger evasion in BA.2.75-based mutants, but we observed a significant reduction in NT50 of BA.2/BA.5 convalescents against BA.5-based mutants with K147E+W152R, suggesting BA.2/BA.5 convalescent plasma contains a large proportion of NTD-targeting antibodies 43. As the NTD of BA.1 differs from that of BA.2 and BA.5, we did not observe significant effects of NTD mutations on the efficacy of BA.1 convalescent plasma. Plasma neutralization titers of most vaccinees and convalescents decreased to the lower detection limit against BA.2.75 with 5 extra RBD mutations L452R, K444M, R346T, F486V, and K356T. The same applies to vaccinees or BA.1 convalescents against BA.5 with 4 extra RBD mutations K444N, R346T, N460K, and K356T. The plasma from BA.2/BA.5 convalescents can tolerate more mutations based on BA.5, and extra NTD mutations such as K147E and W152R are needed to completely eliminate their neutralization. Together, we demonstrate that as few as five additional mutations on BA.5 or BA.2.75 could completely evade most plasma samples, including those from BA.5 breakthrough infection, while maintaining high hACE2-binding affinity. Similar efforts have been made in a recent report despite different construction strategies 47. The constructed evasive mutants, such as BA.2.75-S5/6/7/8 and BA.5-S7/8, could serve to examine the effectiveness of broad-spectrum vaccines and NAbs in advance.
Discussion
In this work, we showed that convergent RBD evolution can cause severe immune evasion and could be rationalized by integrating DMS profiles. Given the existence of immune imprinting, the humoral immune repertoire is not effectively diversified by infection with new Omicron variants, while the immune pressure on the RBD becomes increasingly concentrated and promotes convergent evolution. The interaction between convergent evolution of escaping variants and less diversified antibody repertoire would ultimately lead to a highly evasive variant, posing a great challenge to current vaccines and antibody drugs.
Notably, the antibody evasion capability of CA.1, BQ.1.1, XBB, and the constructed convergent mutants have already reached or even exceeded SARS-CoV-1, indicating extensive antigenicity drift (Fig. 5d-g). Indeed, by constructing an antigenic map of the tested SARS-CoV-2/SARS-CoV-1 variants using the plasma NT50 data, we found that the antigenicity distances of SARS-CoV-2 ancestral strain to CA.1, XBB and BQ.1.1 are already comparable to that of SARS-CoV-1 (Extended Data Fig. 10a). Given that there are ∼50 different amino acids between SARS-CoV-1 and SARS-CoV-2 RBD, but only 21 mutations on BQ.1.1 RBD compared to the ancestral strain, these results indicate that the global pandemic indeed has greatly promoted the efficiency of the virus to evolve immune escape mutations.
Additionally, since these convergent variants could escape the binding of the majority of NAbs, it is possible that infections by those mutants may hardly revoke pre-existing memory B cells that encode NAbs, but only recall those memory B cells that encode non-neutralizing antibodies. This may cause inabilities to induce rapid increase of plasma neutralization after infection and may correlate with escalated percentages of severe symptoms. Therefore, the disease severity caused by new convergent variants needs to be closely monitored.
Finally, our prediction demonstrated a remarkable consistency with real-world observations. Some variants close to the predicted and constructed variants have already emerged while we performed the experiments, validating our prediction model. For example, BQ.1.1, BA.5+R346T+K444T+N460K, is highly similar to BA.5-S3, BA.5+R346T+N417T+K444N+N460K, given that K444N/T and K417N/T exhibit similar evasion capabilities (Fig. 4c). If we have this prediction model at the beginning of the pandemic, the development of NAb drugs and vaccines may not be so frustrated against the continuously emerging SARS-CoV-2 variants. Broad-spectrum SARS-CoV-2 vaccines and NAb drugs development should be of high priority, and the constructed convergent mutants could serve to examine their effectiveness in advance.
Data and Code Availability
Processed mutation escape scores and custom scripts to analyze the data can be downloaded at https://github.com/jianfcpku/convergent_RBD_evolution. Sequences and neutralization of the antibodies are included in Supplementary Table 2. Any other raw data of specific antibodies are available from the corresponding authors upon request. We used vdj_GRCh38_alts_ensembl-5.0.0 as the reference of V(D)J alignment, which can be obtained from https://support.10xgenomics.com/single-cell-vdj/software/downloads/latest.IMGT/DomainGapAlign is based on the built-in lastest IMGT antibody database, and we let the “Species” parameter as “Homo sapiens” while keeping the others as default.
Author contributions
Y.C. designed the study. X.S.X supervised the study. Y.C, F.J., A.Y, Q.G. and X.S.X. wrote the manuscript with inputs from all authors. A.Y., W.S., R.A., Yao.W., and X.N. performed B cell sorting, single-cell VDJ sequencing, and antibody sequence analyses. J.W. (BIOPIC), H.S. and F.J. performed and analyzed the DMS data. Y.Y. and Youchun.W. constructed the pseudotyped virus. N.Z., P.W., L.Y., T.X. and F.S. performed the pseudotyped virus neutralization assays. W.S. and Y.C. analyzed the neutralization data. X.H., Y.X., and R.J. recruited the SARS-CoV-2 vaccinees and convalescents. J.W. (Changping Laboratory), L.Y. and F.S. performed the antibody expression.
Conflicts of interest
X.S.X and Y.C. are co-founders of Singlomics Biopharmaceuticals and listed as inventors on patents related to DXP-604, SA55, and SA58. The remaining authors declare no competing interests.
Extended Data Figures
Methods
Isolation of peripheral blood mononuclear cells and plasma
Samples from vaccinees and individuals who had recovered from BA.1, BA.2, or BA.5 infection were obtained under study protocols approved by Beijing Ditan Hospital, Capital Medical University (Ethics committee archiving No. LL-2021-024-02) and the Tianjin Municipal Health Commission, and the Ethics Committee of Tianjin First Central Hospital (Ethics committee archiving No. 2022N045KY). All donors provided written informed consent for the collection of information, the use of blood and blood components, and the publication of data generated from this study. Whole blood samples were diluted 1:1 with PBS+2% FBS (Gibco) and subjected to Ficoll (Cytiva) gradient centrifugation. Plasma was collected from the upper layer. Cells were collected at the interface and further prepared by centrifugation, red blood cell lysis (Invitrogen eBioscience) and washing steps. The date of vaccination, hospitalization and sampling can be found in Supplementary Table 1.
BCR sequencing, analysis, and antibody production
CD19+ B cells were isolated from PBMCs with EasySep Human CD19 Positive Selection Kit II (STEMCELL, 17854). Every 106 B cells in 100 μl solution were stained with 3 μl FITC anti-human CD20 antibody (BioLegend, 302304, clone: 2H7), 3.5 μl Brilliant Violet 421 anti-human CD27 antibody (BioLegend, 302824, clone: O323), 2 μl PE/Cyanine7 anti-human IgM antibody (BioLegend, 314532, clone: MHM-88), 2 μl PE/Cyanine7 anti-human IgD antibody (BioLegend, 348210, clone: IA6-2), 0.13 μg biotinylated SARS-CoV-2 BA.2 RBD protein (customized from Sino Biological) or 0.13 μg biotinylated SARS-CoV-2 BA.5 RBD protein (customized from Sino Biological) conjugated with PE-streptavidin (BioLegend, 405204) or APC-streptavidin (BioLegend, 405207), 0.13 μg SARS-CoV-2 WT biotinylated RBD protein (Sino Biological, 40592-V27H-B) conjugated with Brilliant Violet 605 Streptavidin (BioLegend, 405229). Cells are also labeled with biotinylated RBD conjugated to DNA-oligo-streptavidin. Cells were washed twice after 30 minutes of incubation on ice. 7-AAD (Invitrogen, 00-6993-50) was used to label dead cells. 7-AAD−CD20+CD27+IgM−IgD− SARS-CoV-2 BA.2 RBD+ or BA.5+ cells were sorted with a MoFlo Astrios EQ Cell Sorter. FACS data were analyzed using FlowJo v10.8 (BD Biosciences).
Sorted B cells were resuspended in appropriate volume and then processed with Chromium Next GEM Single Cell V(D)J Reagent Kits v1.1 following the manufacturer’s user guide (10x Genomics, CG000208). Gel beads-in-emulsion (GEMs) were obtained with a 10X Chromium controller. GEMs were subjected to reverse transcription and purification. Reverse transcription products were subject to preamplification and purification with SPRIselect Reagent Kit (Beckman Coulter, B23318). BCR sequences (paired V(D)J) were enriched with 10X BCR primers. After library preparation, libraries were sequenced with Illumina sequencing platform. 10X Genomics V(D)J sequencing data were assembled as BCR contigs and aligned using Cell Ranger (v6.1.1) pipeline according to the GRCh38 BCR reference. Only the productive contigs and the B cells with one heavy chain and one light chain were kept for quality control. The germline V(D)J gene identification and annotation were performed by IgBlast (v1.17.1)48. Somatic hypermutation sites in the antibody variable domain were detected using Change-O toolkit (v1.2.0)49.
Antibody heavy and light chain genes were optimized for human cell expression and synthesized by GenScript. VH and VL were inserted separately into plasmids (pCMV3-CH, pCMV3-CL or pCMV3-CK) through infusion (Vazyme, C112). Plasmids encoding the heavy chain and light chain of antibodies were co-transfected by polyethylenimine-transfection to Expi293F™ cell (ThermoFisher, A14527). Cells were cultured at 36.5°C, 5% CO2, 175 rpm for 6-10 days. Supernatants containing mAbs were collected, and the supernatants were further purified with protein A magnetic beads (Genscript, L00695).
High-throughput mutation escape profiling
High-throughput mutation escape profiling platform has been described previously1,2. Briefly, deep mutation scanning libraries were constructed by mutagenesis PCR based on the Wuhan-Hu-1 RBD sequence (GenBank: MN908947, residues N331-T531). A unique 26-nucleotide (N26) barcode was appended to each RBD variant in mutant libraries by PCR, and the correspondence between the N26 barcode and mutations in RBD variants was acquired by PacBio sequencing. RBD mutant libraries were first transformed in the EBY100 strain of Saccharomyces cerevisiae and then enriched for properly folded ACE2 binders, which were used for subsequent mutation escape profiling. The above ACE2 binders were grown in SG-CAA media (2% w/v d-galactose, 0.1% w/v dextrose (d-glucose), 0.67% w/v yeast nitrogen base, 0.5% w/v casamino acids (−ade, −ura, −trp), 100 mM phosphate buffer, pH 6.0) at room temperature for 16-18h with agitation. Then these yeast cells were washed twice and proceeded to three rounds of magnetic beads-based selection. Obtained yeast cells after sequential sorting were recovered overnight in SD-CAA media (2% w/v dextrose (d-glucose), 0.67% w/v yeast nitrogen base, 0.5% w/v casamino acids (−ade, −ura, −trp), 70 mM citrate buffer, pH 4.5). Pre- and post-sort yeast populations were submitted to plasmid extraction by 96 Well Plate Yeast Plasmid Preps Kit (Coolaber, PE053). N26 barcode sequences were amplified with the extracted plasmid templates, and PCR products were purified and submitted to Illumina Nextseq 550 sequencing.
Antibody clustering and embedding based on DMS profiles
Data analysis of DMS was performed as described in previous reports 1,2. In brief, the detectedbarcode sequences of both the antibody-screened and reference library were aligned to the barcode-variant lookup table generated using dms_variants (v0.8.9). The escape scores of each variant X in the library were defined as F×(nX,ab / Nab) / (nX,ref / Nref), where F is a scale factor to normalize the scores to the 0-1 range, while n and N are the number of detected barcodes for variant X and total barcodes in post-selected (ab) or reference (ref) samples, respectively. The escape scores of each mutation were calculated by fitting an epistasis model as described previously 41,50.
Epitope groups of new antibodies not included in our previous report are determined by k-nearest neighbors (KNN)-based classification. In brief, site escape scores of each antibody are first normalized and considered as a distribution across RBD residues, and only residues whose standard derivation is among the highest 50% of all residues are retained for further analysis. Then the dissimilarity or distance of two antibodies is defined by the Jessen-Shannon divergence of the normalized escape scores. Pair-wise dissimilarities of all antibodies in the dataset are calculated using the scipy package (scipy.spatial.distance.jensenshannon, v1.7.0). For each antibody, 15 nearest neighbors whose epitope groups have been determined by unsupervised clustering in our previous paper are identified and simply voted to determine the group of the selected antibody. To project the dataset onto a 2D space for visualization, we performed MDS to represent each antibody in a 32-dimensional space, and then t-SNE to get the 2D representation, using sklearn.manifold.MDS and sklearn.manifold.TSNE (v0.24.2).
Calculation of the estimated preference of RBD mutations
Four different weights are included in the calculation, including the weight for ACE2-binding affinity, RBD expression, codon constraint, and neutralizing activity. Impact on ACE2-binding affinity and RBD expression of each mutation based on WT, BA.1 and BA.2 are obtained from public DMS results. And for BA.5 (BA.2+L452R+F486V+R493Q) and BA.2.75 (BA.2+D339H+G446S+N460K+R493Q), BA.2 results are used except for these mutated residues, whose scores for each mutant are subtracted by the score for the mutation in BA.5 or BA.2.75. As the reported values are log fold changes, the weight is simply defined by the exponential of reported values, i.e., exp [Sbind] or exp[Sexpr], respectively. For codonconstraint, the weight is 1.0 for mutants that could be accessed by one nucleotide mutation, and 0.0 for others. We used the following RBD nucleotide sequences for determination of accessible mutants, WT/D614G (Wuhan-Hu-1 reference genome), BA.1 (EPI_ISL_10000028), BA.2 (EPI_ISL_10000005), BA.4/5 (EPI_ISL_11207535), BA.2.75 (EPI_ISL_13302209). For neutralizing activity, the weight is -log10(IC50). The IC50 values (μg/mL), which are smaller than 0.0005 or larger than 1.0 are considered as 0.0005 or 1.0, respectively. The raw escape scores for each antibody are first normalized by the max score among all mutants, and the final weighted score for each antibody and each mutation is the production of the normalized scores and four corresponding weights. The final mutation-specific weighted score is the summation of scores of all antibodies in the designated antibody set, and then normalized again to make it a value between 0 and 1. Logo plots for visualization of escape maps were generated by the Python package logomaker (v0.8).
Pseudotyped-virus assay
The Spike gene (GenBank: MN908947) was mammalian condon-optimized and inserted into the pcDNA3.1 vector. Site-directed mutagenesis PCR was performed as described previously51. The sequence of mutants is shown in Fig. S1. Pseudotyped viruses were generated by transfection 293T cells (ATCC, CRL-3216) with pcDNA3.1-Spike with Lipofectamine 3000 (Invitrogen). The cells were subsequently infected with G*ΔG-VSV (Kerafast) that packages expression cassettes for firefly luciferase instead of VSV-G in the VSV genome. The cell supernatants were discarded after 6-8h harvest and replaced with complete culture media. The cell was cultured one day and then the cell supernatant containing pseudotyped virus was harvested, filtered (0.45-μm pore size, Millipore), aliquoted, and stored at -80 °C. Viruses of multiple variants were diluted to the same number of copies before use. mAbs or plasma was serially diluted and incubated with the pseudotyped virus in 96-well plates for 1 h at 37°C. Trypsin-treated Huh-7 cells (Japanese Collection of Research Bioresources, Cat0403) were added to the plate. The cells were cultured for 20-28 h in 5% CO2, 37°C incubators. The supernatants were removed and left 100 μL in each well, and 100 μL luciferase substrate (Perkinelmer, 6066769) was added and incubated in the dark for 2 min. The cell lysate was removed, and the chemiluminescence signals were collected by PerkinElmer Ensight. Each experiment was repeated at least twice.
Dulbecco’s modified Eagle medium (DMEM, high glucose; HyClone) with 100 U/mL of penicillin-streptomycin solution (Gibco), 20 mM N-2-hydroxyethylpiperazine-N-2-ethane sulfonic acid (HEPES, Gibco) and 10% fetal bovine serum (FBS, Gibco) were used in cell culture. Trypsin-EDTA (0.25%, Gibco) was used to detach cells before seeding to the plate.
Acknowledgments
We thank J. Bloom for his gift of the yeast SARS-CoV-2 RBD libraries. We thank all volunteers for providing the blood samples. This project is financially supported by the Ministry of Science and Technology of China and Changping laboratory under the project number (CPL-1233).
Footnotes
Update information on BU.1, BR.2, BM.1.1.1, CA.1, and XBB