ABSTRACT
Spillover of sarbecoviruses from animals to humans resulted in outbreaks of severe acute respiratory syndrome SARS-CoVs and the ongoing COVID-19 pandemic. Efforts to identify the origins of SARS-CoV-1 and -2 has resulted in the discovery of numerous animal sarbecoviruses – the majority of which are only distantly related to known human pathogens and do not infect human cells. The receptor binding domain (RBD) on sarbecoviruses engages receptor molecules on the host cell and mediates cell invasion. Here, we tested the receptor tropism for RBDs from two sarbecoviruses found in Russian horseshoe bats to screen cell entry. While these two viruses are in a viral lineage distinct from SARS-CoV-1 and -2, one virus was capable of using human ACE2 to facilitate cell entry. Our findings demonstrate that sarbecoviruses circulating in wildlife outside of Asia also exhibit compatibility with human ACE2 and should be taken into consideration for future universal sarbecovirus vaccine candidates.
Introduction
Zoonotic spillover of sarbecoviruses from animals to humans has led to the emergence of highly pathogenic human viruses, SARS-CoV-1 and −2, with the later leading to the largest global pandemic in modern history. Researchers around the world are ramping up the pace of viral discovery efforts, expanding the sequence databases with new animal sarbecoviruses in circulation. While some laboratory experiments have been performed with these new viruses, demonstrating a range of host tropisms, several viruses remain untested, and thus their ability to transmit to humans is unknown.
Coronaviruses are covered with a “spike” glycoprotein that engages with receptor molecules on the surface of host cells and mediates viral infection of the cell. A small region within the spike proteins of sarbecoviruses, known as the receptor binding domain (RBD), contains all of the structural information necessary to engage with the host receptor. We and others have experimentally classified the majority of published sarbecovirus RBDs into different clades based on sequence and functional data: clade 1, identified in Asian bats, contains no deletions and binds to host receptor, Angiotensin-Converting Enzyme 2 (ACE2), whereas clade 2, also identified in Asian bats, contains 2 deletions and does not use ACE2 and clade 3 viruses, found more widely in Africa and Europe, contain 1 deletion and also infect using an unknown receptor1–3. Recently, several viruses were identified in China that comprise a fourth clade that also interact with human ACE24.
In late 2020, two clade 3 sarbecoviruses were identified in Rhinolophus bats in Russia: Khosta-1 was found in Rhinolophus ferrumequinum and Khosta-2 in R. hipposideros5. Similar to other European and African clade 3 viruses, the Khosta viruses are divergent from the RBD found in SARS-CoV-1 and −2, and were thus presumed not to possess any tropism for human cells. However, a recent study demonstrated that a new clade (clade 4) of sarbecoviruses found in China containing a deletion similar to clade 3 sarbecoviruses can bind to human ACE24. Here, we tested the RBDs from Russian sarbecoviruses, along with other clade 3 viruses and report that the virus, Khosta-2, is capable of infecting cells using the human orthologue of ACE2.
Results
Khosta virus receptor binding domains are distinct from human viruses
Khosta-1 and -−2 were identified by Alkhovsky and colleagues in bat samples collected between March-October 2020 near Sochi National Park5. Phylogenetic analysis of the conserved viral gene, Orf1ab, revealed these viruses were most closely related to another sarbecovirus found in Bulgaria in 2008 (known as BM48-31 or Bg08), and form a lineage sarbecoviruses distinct from human pathogens, SARS-CoV-1 and −25. A list of viruses and accession numbers used in this study can be found in table 1. Phylogenetic analysis of the spike RBD further reflected the close relatedness between Khosta −1 and −2 with BM48-31 and other clade 3 RBD viruses we have previously tested from Uganda and Rwanda1,6 (Fig. 1a). Clade 3 RBDs, including the Khosta viruses, contain a truncated surface-exposed loop, as compared to the ACE2-dependent, clade 1 viruses such as SARS-CoV and additionally vary in most of the residues known for clade 1 viruses to interact with ACE21,2,6,7.
(a) Sarbecovirus Receptor Binding Domain Cladogram based on amino acid sequenced. Countries of origin and known host receptors are indicated to the right. Clade 1 viruses are shown in red and orange, clade 2 in grey, clade 3 in blue and clade 4 in purple. (b) Diagram of spike constructs used for this study. The SARS-CoV-1 RBD was replaced with RBDs from other sarbecoviruses. (c) Expression and incorporation of viral pseudotypes by westernblot. (d) Huh-7 cells were infected with pseudotypes in the presence of absence of trypsin. Cells were infected in triplicate. (e) BHK cells were transfected with receptors and infected in the absence of trypsin. Cells were infected in quadruplicate.
RBD from Khosta viruses mediate entry into human cells
Using our scalable Sarbecovirus RBD entry platform, we replaced the RBD from SARS-CoV-1 spike with the Khosta RBDs and generated chimeric spike expression plasmids (Fig. 1b)1. For comparison, we also included chimeric RBD spikes for other clade 3 RBDs we have previously tested (BM48-31, Uganda, Rwanda) as well as SARS-CoV-2 and related RatG13 viruses. These chimeric spike expression constructs were used to produce BSL2-compatible viral reporter pseudotypes with Vesicular Stomatitis Virus expressing a dual GFP-luciferase reporter1. All of the chimeric spike proteins expressed to similar levels in mammalian cells and incorporated in Vesicular Stomatitis Virus (VSV). Chimeric spike with the RBD from BM48-31 and RatG13 showed reduced incorporation but this did not correlate with viral entry phenotypes observed in later experiments (Fig. 1c, e).
To test human cell compatibility, we first infected the human liver cell line, Huh-7, with pseudotypes bearing the chimeric Khosta RBD spikes. In the absence of the addition of an exogenous protease, trypsin, the pseudotypes exhibited almost no entry in these cells, which has been observed for other sarbecoviruses and is attributed to low endogenous expression of ACE2. However, when trypsin was included during the infection, entry signal strongly increased for SARS-CoV-1 and −2 RBDs as well as the Khosta RBDs (Fig. 1d). As we and others have shown previously, trypsin enhancement of sarbecovirus entry is still receptor dependent, suggesting that the Khosta virus RBDs were using a receptor present in human cells to mediate infection1,8.
The RBD from Khosta-2 infects cells using human ACE2
To characterize potential receptors that the Khosta viruses may be using, we performed a classic receptor tropism test, where we transfected Baby Hamster Kidney (BHK) cells with human orthologues of known coronavirus receptors and then infected with our pseudotype panel. Unlike 293T cells, which express low levels of human ACE2 and potentially other coronavirus receptors and have been shown to have low but measurable permissivity to SARS-CoV infection, BHK cells are generally considered completely non-permissive for sarbecoviruses unless a suitable receptor is supplemented9. The Khosta-1 RBD failed to infect cells expressing any of the receptors, while Khosta 2 RBD clearly infected cells expressing human ACE2 (figure 1e). The level of cell entry mediated by the Khosta 2 RBD was similar to RatG13, a bat sarbecovirus with high similarity to SARS-CoV-2 in the RBD that also binds human ACE2, albeit with lower efficiency than the human pathogen10,11 (Fig. 1e). In contrast to the ACE2 results, only HCoV-229E could infect cells expressing Aminopeptidase N (APN), and MERS-CoV spike could only infect cells expressing dipeptidyl peptidase IV (DPP4) – the known receptors for these viruses, demonstrating these receptors were expressed to functional levels (Fig. 1e).
Discussion
To date, all animal derived sarbecoviruses capable of infecting cells through human ACE2 have been found in Asia2,12,15. Here, we provide evidence that animal-derived sarbcoviruses circulating in wildlife outside of Asia can also possess the capacity to infect human cells and use human ACE2 (Fig. 1). Khosta 1 and 2 viruses are most closely related to other clade 3 RBD viruses, which have been found across a much wider geographic range than the clade 1 viruses1,6,16,17. Thus, our findings show that sarbecovirus host receptor compatibility is not geographically locked and that other sarbecoviruses, which may only be distantly related to human SARS-CoVs, may also stochastically possess the capacity to infect humans. Importantly, as the researchers who initially discovered the Khosta viruses note with their findings: the Khosta bat sarbecoviruses are genetically distinct from human SARS-CoVs in that they lack genetic information encoding for some of the genes thought to antagonize the immune system and contribute to pathogenicity, such as Orf85. Unfortunately, because coronaviruses are known to recombine in co-infected hosts, the recent identification of SARS-CoV-2 spillover from humans back in wildlife populations opens the possibility of new human-compatible sarbecoviruses18–23.
In the presence of trypsin, both Khosta-1 and −2 RBDs were capable of infecting human cells, with Khosta-1 performing notably stronger than Khosta-2, however in our receptor-specific assay, only Khosta-2 could infect cells expressing human ACE2 without exogenous protease (Fig. 1d, e). We have previously shown that a small number of clade 2 RBDs, such as As6526, also exhibit protease-mediated, ACE2-independent entry, and similar phenotypes have been described for other bat coronaviruses1,24. Because not all of the clade 2 and 3 viruses exhibit this phenotype, these findings collectively suggest that other coronaviruses can infect human cells through a presently unknown receptor. Sarbecoviruses have been shown to co-circulate in their reservoirs, so this variation in receptor usage among closely related viruses may even represent an evolutionary strategy for viral persistence2.
Although the Khosta-2 RBD exhibited clear compatibility with human ACE2, pseudotype entry was approximately 10-fold less than for SARS-CoV-1 and −2, and was to levels closer to RatG13 RBD (Fig. 1e). The RBD for RatG13, while also compatible with human ACE2, has been shown by other groups to exhibit poor binding to human ACE225–27 A single point mutation, T403R/K, has been shown to improve RatG13-RBD binding and entry with human ACE2, although this mutation is already present in Khosta-2 (K403)25. We have previously demonstrated that MERS-CoV can rapidly adapt to overcome a semi-permissive receptor, suggesting that limited entry phenotypes are easily overcome by betacoronaviruses28. As new sarbecoviruses are discovered in wildlife, there is a propensity in the research community to compare single amino acids based on known human SARS-CoVs as an indicator for compatibility with humans. Our work underscores the challenges in predicting viral function from sequence data alone and sheds light on another group of sarbecoviruses with zoonotic potential. Current universal sarbecovirus vaccines in development include mostly clade 1 viruses and one of the clade 2 viruses but do not include any members from clade 329,30. Taken together, our findings with the Khosta viruses show that the sarbecoviruses may pose an even broader geographic threat than previously appreciated.
Methods
Phylogenetic analysis
Genbank accession numbers for all sarbecovirus spike sequences used in this study are available in table 1. Amino acid sequences for the receptor binding domain of the spike glycoprotein were aligned using ClustalW multiple sequence alignment with default parameters. A maximum likelihood phylogenetic tree was inferred with PhyML v. 3.031 using the ‘WAG’ matrix +G model of amino acid substitution as selected by Smart Model Selection method with 1000 bootstrap replicates32. The final tree was then visualized as a cladogram with FigTree v1.4.4 (https://github.com/rambaut/figtree).
Plasmids and sequences
Untagged human orthologues of ACE2 (Q9BYF1.2), APN (NP_001141.2), and DPP4 (XM_005246371.3) were described previously1. Spike sequences from HCoV-229E (AB691763.1), MERS-CoV (JX869059.2), and SARS-CoV-1 (AY278741) were codon-optimized, appended with a carboxy-terminal FLAG tag sequence separated by a flexible poly-glycine linker and cloned into pcDNA3.1+ as previously described1. The SARS-CoV-1 RBD was removed with KpnI and XhoI and replaced with codon-optimized, synthesized RBD fragments as previously described1.
Cells and pseudotype assay
293T, Huh-7 (human liver cells), and BHKs were maintained under standard cell culture conditions in DMEM with L-glutamine, antibiotics, and 10% Fetal Bovine Serum. Single-cycle, Vesicular Stomatitis Virus (VSV) pseudotype assays were performed as previously described1. Briefly, 293T “producer cells” were transfected with spike plasmids or empty vector as a “no spike” control and infected 24-hours later with VSV-ΔG-luc/GFP particles, generating chimeric-spike pseudotyped particles that were harvested 72 hours post-transfection and stored at −80°C. Target cells were plated in 96-well format, and spin-infected in quadruplicate with equivalent volumes of viral pseudotypes at 1200xG for 1 hour at 4°C. Infected cells were incubated for approximately 18-20 hours and luciferase was measured using the Promega BrightGlo luciferase kit following manufacturers’ instructions (Promega). Entry signal was normalized to the average signal for the “no spike” control. Plates were measured and analyzed in triplicate. Data are representative of four complete biological replicates.
Westernblot
Viral pseudotypes were concentrated and 293T producer cells were lysed in 1% SDS and clarified as described previously1. Lysates were analyzed on 10% Bis-Tris PAGE-gel (Thermo Fisher Scientific) and probed for FLAG (Sigma-Aldrich; A8592; 1:10000); GAPDH (Sigma-Aldrich, G8795, 1:10000); and/or VSV-M (Kerafast, 23H12, 1:5000). Signal was detected using SuperSignal West substrate (Thermo-Fisher).
AUTHOR CONTRIBUTIONS STATEMENT
S.N.S. and M.L. performed experiments and wrote the manuscript. M.L. designed the study, analyzed the data, and assembled the figures.
ADDITIONAL INFORMATION
The authors declare no competing interests.