ABSTRACT
The global spread of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and the associated disease COVID-19, requires therapeutic interventions that can be rapidly translated to clinical care. Unfortunately, traditional drug discovery methods have a >90% failure rate and can take 10-15 years from target identification to clinical use. In contrast, drug repurposing can significantly accelerate translation. We developed a quantitative high-throughput screen to identify efficacious single agents and combination therapies against SARS-CoV-2. Quantitative high-content morphological profiling was coupled with an AI-based machine learning strategy to classify features of cells for infection and stress. This assay detected multiple antiviral mechanisms of action (MOA), including inhibition of viral entry, propagation, and modulation of host cellular responses. From a library of 1,441 FDA-approved compounds and clinical candidates, we identified 15 dose-responsive compounds with antiviral effects. In particular, we discovered that lactoferrin is an effective inhibitor of SARS-CoV-2 infection with an IC50 of 308 nM and that it potentiates the efficacy of both remdesivir and hydroxychloroquine. Lactoferrin also stimulates an antiviral host cell response and retains inhibitory activity in iPSC-derived alveolar epithelial cells. Given its safety profile in humans, these data suggest that lactoferrin is a readily translatable therapeutic adjunct for COVID-19. Additionally, several commonly prescribed drugs were found to exacerbate viral infection and warrant clinical investigation. We conclude that morphological profiling for drug repurposing is an effective strategy for the selection and optimization of drugs and drug combinations as viable therapeutic options for COVID-19 pandemic and other emerging infectious diseases.
MAIN
SARS-CoV-2 is an enveloped, positive-sense, single-stranded RNA beta-coronavirus that emerged in Wuhan, China in November 2019 and rapidly developed into a global pandemic. The associated disease, COVID-19, has an array of symptoms, ranging from flu-like illness and gastrointestinal distress1,2 to acute respiratory distress syndrome, heart arrhythmias, strokes, and death3,4. Drug repurposing has played an important role in the search for COVID-19 therapies. Recently, the FDA issued emergency approval of remdesivir (GS-5734), a prodrug of a nucleoside inhibitor developed for Ebola virus treatment5, and hydroxychloroquine, an aminoquinoline derivative first developed in the 1940s for the treatment of malaria, for patients with severe COVID-19. However, there are no established prophylactic strategies or direct antiviral treatments available to limit SARS-CoV-2 infections and to prevent/cure the associated disease COVID-19.
Repurposing of FDA-approved drugs is a promising strategy for identifying rapidly deployable treatments for COVID-19. Benefits of repurposing include known safety profiles, robust supply chains, and a short time-frame necessary for development6. Additionally, approved drugs serve as chemical probes to understand the biology of viral infection and can make new associations between COVID-19 and molecular targets/pathways that influence pathogenesis of the disease. A complementary approach to standard in vitro antiviral assays is high-content imaging-based morphological cell profiling. Using morphological cell profiling, it is possible to identify pathways and novel biology underlying infection, thus allowing for targeted screening around a particular biological process or targeting of host processes that limit viral infection. This enables the identification of multiple anti-viral mechanisms, allowing for the rational design of drug combinations or, conversely, revealing drugs that exacerbate infectivity or are associated with cytotoxicity.
Here, we developed a pipeline for quantitative high-throughput image-based screening of SARS-CoV-2 infection. We leveraged machine learning approaches to create an assay metric that accurately and robustly identifies features that predict antiviral efficacy and mechanism of action (MOA). We identified several FDA-approved drugs and clinical candidates with unique antiviral activity. We further demonstrated that lactoferrin inhibits viral entry and replication, enhances antiviral host cell response, and potentiates the effects of remdesivir and hydroxychloroquine. Furthermore, we identified currently prescribed drugs that exacerbate viral infectivity. Collectively, we present evidence that morphological profiling can robustly identify new potential therapeutics against SARS-CoV-2 infection as well as drugs that potentially worsen COVID-19 outcomes.
Morphological profiling reveals unique features associated with SARS-CoV-2 infection
To determine the optimal cell line and appropriate endpoint for antiviral drug screening, we assessed SARS-CoV-2 infectivity in previously reported permissive cell lines: Vero-E6, Caco-2, and Huh-77. Viral growth kinetics at a multiplicity of infection (MOI) of 0.2 revealed that Vero-E6, Caco2, and Huh-7 cells supported viral infection, with peak viral titers at 48 hours post infection (hrs p.i.) (Supplementary Figure 1A/B). Although the viral load was higher in Vero-E6 cells, Huh-7 were selected for our morphological drug screen as a human cell line that expresses both ACE2 and TMPRSS2, which are the primary entry factors for SARS-CoV-28. Infection was detectable in Huh-7 cells at an MOI as low as 0.004 at 48 hrs p.i. (Supplementary Figure 1C), which highlights the high sensitivity of image-based screening. To identify compounds that inhibit or exacerbate infection, we selected an MOI of 0.2, leading to a baseline infectivity rate of 20%.
Morphological cell profiling was enabled through multiplexed staining and automated high-content fluorescence microscopy. Our multiplexed dye set included markers for SARS-CoV-2 nucleocapsid protein, nuclei (Hoechst 33342), neutral lipids (HCS LipidTox Green), and cell boundaries (HCS CellMask Orange). These fluorescent probes were chosen to capture a wide variety of cellular features relevant to viral infectivity, including nuclear morphology, nuclear texture, cytoplasmic and cytoskeletal features, and indicators of cell health. From our initial profiling we observed three prominent morphological features associated with SARS-CoV-2 infection: the formation of syncytia, increased nucleoli count (Supplementary Figure 1D), and cytoplasmic protrusions (Figure 1). These features, which are key indicators of SARS-CoV-2 infection in Huh-7, were used to generate our machine learning pipeline for antiviral drug discovery.
Machine learning identifies FDA-approved molecules with antiviral activity against SARS-CoV-2
To identify compounds with antiviral activity against SARS-CoV-2, we screened a custom library of 1,441 FDA-approved compounds and rationally included clinical candidates in Huh-7 cells (Supplementary File 1). Compounds were assessed for their antiviral activity using a CellProfiler-based image analysis pipeline and a random forest classification algorithm to identify infected cells and quantify their morphological characteristics (Figure 2A). The random forest classifier leveraged 660 unique cellular features including measurements of intensity, texture and radial distribution for each fluorescent channel (nuclei, cytoplasm, lipid, virus). From the primary quantitative high-throughput screening (qHTS), hits were defined as compounds with consistent decreases in viral infectivity in at least three of the tested concentrations (50 nM, 250 nM, 500 nM, 1000 nM and 2000 nM) as well as minimal cytotoxicity. This approach for hit identification was intentionally designed to be broad to minimize false negatives and maximize our list of efficacious compounds which would later be refined. Our first-pass qHTS approach identified 132 compounds with moderate dose-responsive antiviral behavior between 50 nM and 2 μM, and successfully eliminated compounds without any significant activity or obvious cytotoxicity.
In confirmatory screening, 10-point, two-fold dilution dose-response experiments were performed in triplicate on the 132 qHTS hits, with validation of dose-responsive efficacy for 15 compounds below 1 μM potency (Supplementary Table 1 and Figure 2B). These hits include nine that are novel in vitro observations (lactoferrin, S1RA, entecavir, lomitapide, metoclopramide, bosutinib, thioguanine, fedratinib, and Z-FA-FMK), and six that have been previously identified to have antiviral activity (amiodarone, verapamil, gilteritinib, clofazimine9,10, niclosamide11, and remdesivir). In addition to antiviral drug hits, we also identified several compounds that appear to exacerbate SARS-CoV-2 infection, including trametinib, binimetinib and cobimetinib-potent MEK inhibitors used to treat metastatic melanoma- and the Parkinson’s disease drugs carbidopa, methyldopa and levodopa (Supplementary Figure 2).
Cell level feature clustering reveals potential mechanisms of action for lead compounds
In contrast to standard single-endpoint in vitro assays, morphological cell profiling allows for the efficient visualization and quantitation of biological characteristics of viral infection and cytotoxicity. To assist with mechanistic determination, we used dimensionality reduction via the non-linear uniform manifold approximation and projection (UMAP) embedding, which projects the cellular feature vector into a 2-dimensional plot to observe clusters of cells based on their distinct morphological features12. Millions of cells were observed in the UMAP embedding through a range of drug concentrations alongside negative and positive controls to observe perturbations in specific morphological clusters (Supplementary Figure 3B).
In the UMAP embedding, we identified 15 regions of interest (ROI) with high cell density (Figure 3B). A broad density region (ROIs 10,15) contained uninfected cells with satellite populations having characteristic morphologies including cell division (ROI 6) and accumulation of lipids towards the periphery of the cell (ROI 12). A large disconnected region (ROIs 1-4) contained individually infected cells (ROI 4), infected cells in syncytia (ROI 3), and cells adjacent to infected cells (ROIs 1,2) (Figure 3C). Pseudotime of the viral infection progression can be inferred through inspection of cluster populations where cells begin in the main cluster body (ROIs 10,15) and traverse to the infected cluster (ROIs 1-4) where there is punctate viral signal (ROI 1) which progress to isolated infected cells characterized by homogenous nucleocapsid staining throughout the cytoplasm (ROI 2), and ends with infection of surrounding cells and the formation of syncytia (ROIs 3,4). All efficacious compounds deplete ROIs 1-4 and thioguanine, clofazimine, S1RA and gilteritinib show differences in the UMAP cluster dynamics (Supplementary Figure 3B) suggestive of differing MOAs.
Lactoferrin blocks SARS-CoV-2 replication at different stages of the viral cycle
One of the most efficacious hits identified from our screen was lactoferrin, a protein found in milk and other secretory fluids13. We determined that lactoferrin has dose-dependent antiviral activity through a range of MOIs (Figure 4A and B). Previous work on lactoferrin in the context of infection with SARS-CoV-1 suggests that it blocks viral entry by binding heparan sulfate proteoglycans that are important for early viral attachment13. Our studies showed that lactoferrin blocks SARS-CoV-2 infection through entry inhibition and is also capable of rescuing infection when added 1 or 24 hrs p.i. (Figure 4B). Lactoferrin has been proposed to enhance innate interferon responses to limit viral replication within host cells14. Upon treatment, we observed a dose-dependent reduction of viral replication (Figure 4C), which was consistent with elevated mRNA levels of IFNβ and interferon-stimulated genes (ISG15, MX1, Viperin and IFITM3) in lactoferrin-treated Huh-7 cells (Figure 4D). Interestingly, we detected a robust antiviral effect by both holo and apolactoferrin, the latter being the component of widely available dietary supplements. To rule out a mode of action that involved a general iron depletion mechanism, we tested the protein transferrin and found that it was devoid of any anti-SARS-CoV-2 activity at the highest concentration of 2.3 μM (Figure 4E). Lastly, we tested the efficacy of lactoferrin in physiologically relevant iPSC-derived alveolar epithelial type 2 cells (iAEC2s)15,16. Consistent with our findings in Huh-7 cells, pretreatment of iAEC2s with lactoferrin resulted in a significant reduction in the proportion of SARS-CoV-2 infected cells at MOI 10 (Figure 4F).
A clinically effective strategy for antiviral therapies uses a combinatorial (or “drug cocktail”) approach, where compounds with varying MOAs are concomitantly used to target different stages in the viral life cycle and to minimize the risk of drug resistance from single-agent selective pressure. This is especially true for RNA viruses, which are highly variable and can develop drugresistance17. Given the pronounced single-agent efficacy of lactoferrin, we tested whether combinations with remdesivir or hydroxycholoroquine could improve the overall antiviral activity. We found that lactoferrin potentiates the efficacy of both remdesivir (Figure 4G and Supplementary Figure 4A) and hydroxychloroquine (Figure 4H and Supplementary Figure 4B), which are currently explored treatments for SARS-CoV-2 infection. Therefore, combination therapy with lactoferrin could be beneficial in the management of the COVID-19 pandemic by reducing toxicity (e.g., hydroxycholorquine) or consumption (e.g., remdesivir).
DISCUSSION
In this study, we developed an experimental workflow based on high-content imaging and morphological profiling that allows for rapid screening of FDA-approved compounds, leveraging machine learning to determine potential MOA. We identified 15 FDA-approved compounds that limit SARS-CoV-2 infection in vitro. Of these, six were previously reported and serve as a benchmark validation of our endpoints and experimental approach, and nine were hitherto unknown. We demonstrate that this approach is versatile (i.e., it can be applied to both transformed and more physiologically-relevant non-transformed cell lines) and can identify the emergent properties of the infection as well as novel phenotypes that can be perturbed through chemical inhibition.
A high-content morphological cell profiling approach is superior to image cytometry (tabulating percent positive) and plate reader assays for selecting and prioritizing drugs for repurposing. Here, viral staining is not merely an absolute measure for viral infection (or inhibition) but the starting point for a detailed investigation of infection trajectories and observations of numerous phenotypic targets, including inhibition of syncytia formation, viral entry, or viral replication, and modulation of the host cell. We report compounds with strong antiviral activity against SARS-CoV-2 and also their putative MOA.
The UMAP embedding was highly effective at visualizing the virally infected cell population and the progression of the viral infection trajectory was clearly visible. We gained insight into the putative antiviral MOAs via inspection of the cluster populations. For thioguanine and clofazimine, increasing concentrations appear to suppress isolated single infected cells (ROI 4) while a small number of syncytia (ROI 3) are still observable (Supplementary Figure 3B), suggesting a replication inhibition MOA. This observation is consistent with the established MOA of both drugs as inhibitors of nucleic acid synthesis18,19. S1RA reduces ROIs 1-4 evenly, but ROI 13 increases (Supplementary Figure 3B: S1RA) and is characterized by increased cytoplasmic nucleic acid staining (Supplementary Figure 3A: ROI 13), suggesting a host-modulation MOA. Lastly, gilteritinib demonstrates an increase in ROI 12 with treatment (Supplementary Figure 3B: gilteritinib), a cell cluster defined by large and distributed lipid accumulation (Supplementary Figure 3A: ROI 12). Lipid accumulation, in liver-derived Huh-7 cells, is associated with cytotoxicity and is consistent with reduction in viability with escalating dose (Figure 2B)20.
Importantly, our study identified drugs that implicate new molecular targets/pathways in the pathogenesis of SARS-CoV-2 and produce clinically testable and readily translatable hypotheses. As an example, we observed a dose dependent antiviral activity of metoclopramide, a potent Dopamine Receptor D2 antagonist used to treat gastroesophageal reflux disease and prevent other gastrointestinal symptoms, including nausea and vomiting21. Gastrointestinal symptoms have been increasingly reported in more than half of the patients infected by SARS-CoV-22. Notably, investigational drugs like hydroxychloroquine, lopinavir-ritonavir, tocilizumab and others can be associated with gastrointestinal and hepatic adverse events and hence are not ideal for patients already experiencing severe GI symptoms22. Metoclopramide therefore represents an interesting dual-target therapeutic option for COVID-19 patients. Interestingly, the pro-dopaminergic drugs carbidopa, levodopa, and methyldopa promote infection (Supplementary Figure 2), suggesting a dopamine-mediated contribution to viral entry or replication. Additionally, all MEK inhibitors exacerbated viral infection up to 300% indicating a causal role of MEK in SARS-CoV-2 pathogenesis. These in vitro observations should be validated through clinical research that examines whether concomitant presence of drug and SARS-CoV-2 exacerbates COVID-19.
As most FDA-approved drugs target human molecular targets, our screen helped identify crucial host factors involved in SARS-CoV-2 infection. Z-FA-FMK, an irreversible inhibitor of cysteine proteases, including cathepsins B, L, and S23, exhibited potent antiviral activity. A recent report using a pseudovirus indicated cathepsin L is an entry factor of SARS-CoV-224. The antiviral effect of Z-FA-FMK suggests that cathepsin L is a requirement also in the context of SARS-CoV-2 infection and suggests that this molecule could be a useful investigational tool to study virus entry. Similarly, fedratinib is an orally bioavailable semi-selective JAK2 inhibitor, approved by the FDA in 2019 for myeloproliferative neoplasm, a rare blood cancer that causes clotting and fibrosis25. JAK-inhibitors have been proposed for COVID-19 to specifically inhibit TH17-mediated inflammatory responses26,27. In addition, JAK-inhibitors have been proposed to block numb-associated kinase responsible for clathrin-mediated viral endocytosis28. Several JAK-inhibitors are currently evaluated in clinical trials for COVID-19 management, including with baricitinib29, jakotinib (ChiCTR2000030170), and ruxolitinib (ChiCTR2000029580). Further studies examining these Jak-inhibitors could serve as useful tools in the future to elucidate the involvement of the innate immune response in SARS-CoV-2 infection.
The sigma receptors (SigmaR1/R2) are permissive chaperones that mediate endoplasmic reticulum stress response and lipid homeostasis30, processes that have been implicated in early stages of hepatitis C viral infection in Huh-7 cells31 and coronavirus pathogenesis32. We identified two sigma receptor modulators amiodarone33 (SigmaR1 IC50: 1.4 nM, SigmaR2 IC50: 1 nM), and S1RA34 (E-52862; SigmaR1 IC50: 17 nM antagonist, SigmaR2 IC50: > 1 μM) with potent antiviral activity, demonstrating IC50 values of 52 nM and 222 nM, respectively, with limited cell toxicity. Amiodarone is approved for treatment of arrhythmias but, like hydroxychloroquine, has potent cardiotoxic side effects through inhibition of the hERG ion channel35 that limit therapeutic potential. S1RA has completed phase II clinical trials for the treatment of neuropathic pain36,37. Although Gordon et al. identified several other sigmaR1/R2 modulators that inhibited SARS-CoV-2 infection in Vero-E6 cells, antiviral activity for S1RA was not observed38. This suggests that the activity of S1RA is dependent on host cell factors specific to each cell line and, promisingly, that human cells may be more responsive to this compound.
Most noteworthy, our screen demonstrates lactoferrin as a SARS-CoV-2 inhibitor in vitro with multimodal efficacy. We showed efficacy in multiple cell types, including a non-transformed and clinically relevant iPSC-derived model of alveolar epithelium. Lactoferrin gene expression has been shown previously to be highly upregulated in response to SARS-CoV-1 infection39 and, in addition to enhancing natural killer cell and neutrophil activity, lactoferrin blocks viral entry through binding to heparan sulfate proteoglycans. Interestingly, lactoferrin retains anti-SARS-CoV-2 activity 24 hrs p.i., which suggests additional MOA other than simple entry inhibition. Although we cannot conclude a definitive and complete MOA, we show significant host cell modulation through increased expression of several interferon-stimulated genes upon treatment with lactoferrin. Additionally, lactoferrin has been previously shown to decrease the production of IL-640, which is one of the key players of the “cytokine storm” produced by SARS-CoV-2 infection41,42. Importantly, we found that lactoferrin retains activity in both the holo and apo forms, the latter being the component of orally available lactoferrin supplements. Orally available lactoferrin could be especially effective in mitigating the gastrointestinal symptoms that are present in COVID-19 patients43. The mechanisms may be similar to how lactoferrin reduces human norovirus infection through induction of innate immune responses44, especially as lactoferrin gene polymorphisms are associated with increased susceptibility to infectious diarrhea45. If lactoferrin reduces viral load in the GI tract, it could reduce fecal-oral transmission of COVID-1946.
Combination therapies are likely to be required for effectively treating SARS-CoV-2 infection, and this approach has already shown some promise. For example, combination therapy with interferon beta-1b, lopinavir-ritonavir, and ribavirin showed efficacy against SARS-CoV-2 in a prospective, open-label, randomized, phase 2 trial47. We demonstrated in our study that lactoferrin potentiates the antiviral activity of both remdesivir and hydroxychloroquine and could be used as a combination therapy with these drugs, which are currently being used or studied for the treatment of COVID-19. Due to its wide availability, limited cost and lack of adverse effects, lactoferrin could be a rapidly deployable option for both prophylaxis and the management of COVID-19.
Although our findings are promising, further studies are needed to confirm their efficacy in other representative in vitro cell lines and/or clinical studies. UMAP analysis provides limited insight to MOA and serve as a basis for future pharmacological studies specific to our compounds in treatment of SARS-CoV-2. These studies are currently ongoing.
In conclusion, high-content morphological cell profiling for drug repurposing screening enabled the identification of both novel antivirals efficacious against SARS-CoV-2 and compounds that possibly exacerbate SARS-CoV-2 infection. Furthermore, in contrast to other drug repurposing studies, the assay reported here allowed for the identification of potential MOA, including host cell responses. This approach to preclinical testing has promise for identifying other anti-SARS-CoV-2 drugs, rationally designing therapeutic combinations with multiple MOAs, and deployment of these new combinations in a rapid and systemic fashion.
Supplementary Information is available for this paper.
METHODS
Cells and Virus
Vero E6, Caco2 and Huh7 cells were maintained at 37°C with 5% CO2 in Dulbecco’s Modified Eagle’s Medium (DMEM; Welgene), supplemented with 10% heat-inactivated fetal bovine serum (FBS), HEPES, non-essential amino-acids, L-glutamine and 1X Antibiotic-Antimycotic solution (Gibco). iPSC (SPC2 iPSC line, clone SPC2-ST-B2, Boston University) derived alveolar epithelial type 2 cells (iAEC2s) were differentiated as previously described and maintained as alveolospheres embedded in 3D Matrigel in “CK+DCI” media, as previously described (Jacob et al. 2019). iAEC2s were passaged approximately every two weeks by dissociation into single cells via the sequential application of dispase (2mg/ml, Thermo Fisher Scientific, 17105-04) and 0.05% trypsin (Invitrogen, 25300054) and re-plated at a density of 400 cells/μl of Matrigel (Corning, 356231), as previously described (Jacob et al. 2019). SARS-CoV-2 WA1 strain was obtained by BEI resources and was propagated in Vero E6 cells. Viral titers were determined by TCID50 assays in Vero E6 cells (Reed and Muench method) by microscopic scoring. All experiments using SARS-CoV-2 were performed at the University of Michigan under Biosafety Level 3 (BSL3) protocols in compliance with containment procedures in laboratories approved for use by the University of Michigan Institutional Biosafety Committee (IBC) and Environment, Health and Safety (EHS).
Viral Quantification
Vero E6, Caco2 and Huh7 cells were seeded in a 48-well plate at 2×10^4 cells/well incubated overnight at 37°C with 5% CO2. Cells were then infected with SARS-CoV-2 WA1 at a multiplicity of infection (MOI) of 0.2. One hour after infection, cells were harvested (day 0 of infection) or kept at 37°C for 1, 2 and 3 days p.i. Viral titer determination was performed by TCID50 assay on Vero E6 cells of the total virus (supernatant and intracellular fraction). Alternatively, cells were harvested with Trizol and total cellular and viral RNA was extracted with the ZymoGen Direct-zol RNA extraction kit. Viral RNA was quantified by RT-qPCR using the 2019-nCoV CDC qPCR Probe Assay and the probe set N1 (IDT technologies). IFNβ, viperin, MX1, ISG15, IFITM3 and the housekeeping gene GAPDH mRNA levels were quantified by qPCR with SsoAdvanced™ Universal SYBR® Green Supermix (Bio-Rad) with specific primers (IFNβ: F-TTGACATCCCTGAGGAGATTAAGC, R-TCCCACGTACTCCAACTTCCA; MX1: F-CCAGCTGCTGCATCCCACCC, R-AGGGGCGCACCTT CTCCTCA; ISG15:F-TGGCGGGCAACGAATT, R-GGGTGATCTGCGCCTTCA; IFITM3: F-TCCCAC GTACTCCAACTTCCA, R-AGCACCAGAAACACGTGCACT; GAPDH: F-CTCTGCTCCTCCTGTTCGAC, R-GCGCCCCACCAAGCTCAAGA). Fold increase was calculated by using the ΔΔCt method over non-infected untreated Huh-7.
Viral Infectivity Assay
384-well plates (Perkin Elmer, 6057300) were seeded with Huh-7 cells at 3000 cells/well and allowed to adhere overnight. Compounds were then added to the cells and incubated for 4 hours. The plates were then transferred to BSL3 containment and infected with SARS-CoV-2 WA1 at a multiplicity of infection (MOI) of 0.2 in a 10 μL addition with shaking to distribute virus. After one hour of absorption, the virus inoculum was removed and media replaced with fresh compound. Uninfected cells and vehicle-treated cells were included as positive and negative control, respectively. Two days post-infection, cells were fixed with 4% PFA for 30 minutes at room temperature, permeabilized with 0.3% Triton X-100 and blocked with antibody buffer (1.5% BSA, 1% goat serum and 0.0025% Tween 20). The plates were then sealed, surface decontaminated, and transferred to BSL2 for staining with the optimized fluorescent dye-set: anti-nucleocapsid SARS-CoV-2 antibody (Antibodies Online, Cat# ABIN6952432) overnight treatment at 4C followed by staining with secondary antibody Alexa-647 (goat anti-mouse, Thermo Fisher, A21235), Hoechst-33342 Pentahydrate (bis-Benzimide) for nuclei staining (Thermo FIsher, H1398), HCS LipidTOX™ Green Neutral Lipid Stain (Thermo Fisher, H34475), and HCS CellMask™ Orange for cell delineation (Thermo Fisher H32713). iAEC2 maintained in 3D culture were dissociated to single cells and seeded in collagen coated 384-well plates at a seeding density of 8000 cells/well in the presence of 10 μM Y-27632 for the first 48 hours after plating (APExBIO, A3008), and grown to confluence over 72 hours. The following infection, compound treatment, and fixing was identical to that of Huh-7. Staining protocol for the iAEC2s differed slightly with the addition of an anti-acetylated tubulin primary antibody (Cell Signaling, 5335), instead of HCS CellMask Orange, and the use of an additional secondary Alexa 488 antibody (donkey anti-rabbit, Jackson ImmunoResearch, 711-545-152).
Compound Library
The compound library deployed for drug screening was created using the FDA-Approved Drugs Screening Library (Item No. 23538) from Cayman Chemical Company. This library of 875 compounds was supplemented with additional FDA approved drugs and rationally included clinical candidates from other vendors including MedChemExpress, Sigma Aldrich, and Tocris. Our library was formatted in five 384-well compound plates and was dissolved in DMSO at 10 mM. Hololactoferrin (Sigma Aldrich, L4765), apolactoferrin (Jarrow Formulas, 121011) and transferrin (Sigma Aldrich, T2036) were handled separately and added manually in cell culture media. Dilution plates were generated for qHTS at concentrations of 2 mM, 1 mM, 500 μM, 250 μM and 50 μM.
qHTS Primary Screen and Dose Response Confirmation
For the qHTS screen, compounds were added to cells using a 50 nL pin tool Caliper Life Sciences Sciclone ALH 3000 Advanced Liquid Handling system at the University of Michigan Center for Chemical Genomics (CCG). Concentrations of 2 μM, 1 μM, 500 nM, 250 nM and 50 nM were included for the primary screen. Post qHTS screen, all compounds were dispensed using an HP D300e Digital Compound Dispenser and normalized to a final DMSO concentration of 0.1% DMSO. Confirmation dose response was performed in triplicate and in 10-point:2-fold dilution.
Imaging
Stained cell plates were imaged on both Yokogawa CQ1 and Thermo Fisher CX5 high content microscopes with a 20X/0.45NA LUCPlan FLN objective. Yokogawa CQ1 imaging was performed with four excitation laser lines (405nm/488nm/561nm/640nm) with spinning disc confocal and 100ms exposure times. Laser power was adjusted to yield optimal signal to noise ratio for each channel. Maximum intensity projection images were collected from 5 confocal planes with a 3 micron step size. Laser autofocus was performed and nine fields per well were imaged covering approximately 80% of the well area. The Thermofisher CX5 with LED excitation (386/23nm, 485/20nm, 560/25nm, 650/13nm) was also used and exposure times were optimized to maximize signal/background. Nine fields were collected at a single Z-plane as determined by image-based autofocus on the Hoechst channel. The primary qHTS screen was performed using CX5 images and all dose-response plates were imaged using the CQ1.
Image Segmentation and feature extraction
The open source CellProfiler software was used in an Ubuntu Linux-based distributed Amazon AWS cloud implementation for segmentation, feature extraction and results were written to an Amazon RDS relational database using MySQL. A pipeline was developed to automatically identify the nuclei, cell, cytoplasm, nucleoli, neutral lipid droplets and syncytia for feature extraction. Multiple intensity features and radial distributions were measured for each object in each channel and cell size and shape features were measured. Nuclei were segmented using the Hoechst-33342 image and the whole cell mask was generated by expanding the nuclear mask to the edge of the Cell Mask Orange image.
Data Pre-Processing
Cell level data were pre-processed and analyzed in the open source Knime analytics platform48. Cell-level data was imported into Knime from MySQL, drug treatment metadata was joined and features were centered and scaled. Features were pruned for low variance (<5%) and high correlation (>95%) and resulted in 660 features per cell.
Statistical methods and hypothesis testing
Dose-response curves were fit and pairwise differences between experimental conditions were tested using Prism (Graphpad Software, San Diego, CA, USA). Other statistical tests were performed in the statistical programming language and environment R.
Machine Learning - Infectivity score and field-level scoring
Multiple logistic regression as implemented in the statistical language and environment R was used to identify features characteristic of cells within infected wells. Models were fit to cells from infected and uninfected control wells in the first five plate-series of the quantitative high throughput screen. As an independent benchmark, these logistic regression models were validated against a manually selected set of individual infected and uninfected cells; features which degraded performance on the benchmark were excluded from the model. The final model included only virus channel intensity features in the cell and cytoplasm ROIs. As a threshold for initial classification, the minimum value from virus-infected cells in the benchmark was used; the final decision rule is given in Eq. 1.
Then, individual field images from the infected control were categorized as confirmed-infected when the mean feature values, across all cells in the field, were above the threshold in Eq. 1. Using mean values for all 660 cell-profiler features in each field, a random forest classifier was trained to predict a probability of membership in the category of uninfected control fields vs confirmed-infected fields. The output of this random forest classifier is reported as “Probpos” (for the positive, uninfected control), throughout. Field level mean/median feature values were computed and a random forest model was fit between the positive control (32 uninfected wells) and the negative control (32 infected wells, 0.1% DMSO vehicle treated) with 80/20 cross validation. The compound treated wells were scored with the RF model and the efficacy score was normalized to the individual plate.
UMAP embedding
The embed_umap application of MPLearn (v0.1.0, https://github.com/momeara/MPLearn) was used to generate UMAP embeddings. Briefly, each for a set of cells, each feature was per-plate standardized and jointly orthogonalized using sklearn.IncrementalPCA(n_components=379, batch_size=1000). Then features were embedded into 2-dimensions using umap-learn (v0.4.1) (McInnes et al. 2018) with umap.UMAP(n_components=2, n_neighbors=15, min_dist=0, init=‘spectral’, low_memory=True, verbose=True). Embeddings were visualized using Holovies Datashader (v1.12.7)49, using histogram equalization and the viridis color map. Visualizing subsets was done in JMP Pro 14.
Data analytics
HC Stratominer (Core Life Analytics, Utrecht NL) was used as an independent method for hit-calling and performs fully automated/streamlined cell-level data pre-processing and score generation. IC Stratominer was also used to fit dose response curves for qHTS. Compound registration and assay data registration were performed using the open source ACAS platform (Refactor BioSciences github https://github.com/RefactorBio/acas).
Dose-response analysis and compound selection
In qHTS screening, a compound was selected to be carried forward into full dose response confirmation when meeting one of the following criteria: 1) Probpos greater than 0.75 for the median field in at least three concentrations, with per-field cell counts at least 60% of the positive control, and without an observed standard deviation in Probpos across-fields-in-the-well of 0.4 or greater, 2) a dose-response relationship with Probpos was observed (by inspection) across the five concentrations tested, including compounds with Propbos greater than 0.90 at the two highest concentrations, or 3) compounds of interest not meeting this criteria were carried forward if reported positive in the literature or were being evaluated in clinical trials for COVID-19.
Dose response analysis in the confirmation and combinatorial screening
Due to the spatial inhomogeneity of infected cells across a single well, approximately half of the fields were undersaturated, leading to a consistent distribution in Probpos that saturates in the top third of 27 rank-ordered fields (from 9 fields and triplicate wells) for each concentration tested. The Probpos effect for a compound concentration was tabulated by averaging the top third of rank ordered fields. Outlier fields with high Probpos values were visually inspected and eliminated if artifacts (segmentation errors or debris) were observed. Cells treated with known fluorescence drugs including Clofazimine, were confirmed to not have spectral interference. Dose response curves were fit with Graphpad Prism using a semilog 4-parameter variable slope model.
SUPPLEMENTARY INFORMATION
Supplementary Figure 1: Screening assay optimization
Supplementary Figure 2: Compounds exacerbating SARS-CoV2 infection
Supplementary Figure 3: Features of UMAP regions of interest (ROI)
Supplementary Figure 4: Combinatory effects of remdesivir and hydroxycholoroquine with lactoferrin
Supplementary Table 1: Compound Deep Dives
Supplementary File 1: Compound library details
Supplementary File 2: 3D reconstruction video of infected cells
ACKNOWLEDGEMENTS
Funding: University of Michigan Institute for Clinical and Health Research (MICHR) (NCATS- UL1TR002240) and its Center for Drug Repurposing. JZS is supported by the National Institute of Diabetes and Digestive and Kidney Diseases (R01DK120623). JWW is supported by the pharmacological sciences training program (PSTP) T32 training grant. CM is supported by Marie-Slodowska Curie individual fellowship (GA – 841247) and MICHR Postdoctoral Translational Scholars Program. KDA is supported by the I.M. Rosenzweig Junior Investigator Award from the Pulmonary Fibrosis Foundation. JRS is supported by the National Heart, Lung, and Blood Institute (NHLBI – R01HL119215), by the NIAID Novel Alternative Model Systems for Enteric Diseases (NAMSED) consortium (U19AI116482) and by grant number CZF2019-002440 from the Chan Zuckerberg Initiative DAF, an advised fund of Silicon Valley Community Foundation.
The authors would like to thank Matthew Chess for Amazon AWS support, Kevin Yan and Peyton Uhl at Yokogawa for imaging support, Nick Santoro at the University of Michigan Center for Chemical Genomics. We thank David Egan and Wienand Omta from Core Life Analytics for assisting high content data analytics as well as Philip Cheung and Brian Bolt at ReFactor Biosciences for assistance with HTS data registration. Finally, we thank Tracey Schultz and Dianne Jazdzyk for project management.
Footnotes
Conflicts of interest, Jonathan Sexton is the co-founder of Verge Therapeutics, Inc. and owner of Curl Bio, LLC.
Abbreviations
- MOI
- multiplicity of infection
- UMAP
- uniform manifold approximation and projection
- COVID-19
- Coronavirus Disease-2019
- MOA
- mechanism of action
- ROI
- region of interest
- iAEC2
- induced pluripotent stem cell (iPSC)-derived alveolar epithelial type 2 cells
- HCQ
- hydroxychloroquine