Abstract
We report the improved production of recombinant proteins in E. coli, reliant on tightly controlled autoinduction, triggered by phosphate depletion in stationary phase. The method, reliant on engineered strains and plasmids, enables improved protein expression across scales. Expression levels using this approach have reached as high as 55% of total cellular protein. Initial use of the method in instrumented fed batch fermentations enables cell densities of ∼30 grams dry cell weight (gCDW) per liter and protein titers up to 8.1+/−0.7 g/L (∼270 mg/gCDW). The process has also been adapted to an optimized autoinduction media, enabling routine batch production at culture volumes of 20 μL (384 well plates), 100 μL (96 well plates), 20 mL and 100 mL. In batch cultures, cells densities routinely reach ∼ 5-7 gCDW per liter, offering protein titers above 2 g/L. The methodology has been validated with a set of diverse heterologous proteins and is of general use for the facile optimization of routine protein expression from high throughput screens to fed-batch fermentation.
Highlights
Stationary phase protein expression results in high titers.
Autoinduction by phosphate depletion enables protein titers from 2-8 g/L.
Autoinduction has been validated from 384 well plates to instrumented bioreactors.
Introduction
Heterologous protein expression is a standard workflow common in numerous fields of biology and E. coli is the workhorse microbe for routine protein production in academia and industry.1–4 E. coli based processes are used for the production of over 30% of protein based drugs that are on the market today,5–7 and pET based expression in E. coli strain BL21(DE3) and its derivatives is a mainstay of heterologous expression in many labs.
Standard protocols rely on easily prepared media (LB and or TB) but require culture monitoring to optimize induction in exponential phase.8,9 Auto-induction protocols removing the need for manual additions have been developed, most notably by Studier, and require the use of multiple carbon substrates, such as glucose and lactose. After glucose depletion the consumption of lactose induces heterologous expression.10,11 Significant recent work has been done in developing new protocols enabling auto-induction systems focused on using novel auto inducing promoters that respond to a variety of signals from cell density to oxygen limitation.12–16 Despite simplifying expression protocols, many of these approaches still result in relatively low biomass and protein levels and have not been validated in multiple culture systems including instrumented bioreactors. The use of BL21 and its derivatives can be further complicated by heterogenous induction, resulting from lactose based inducers17–19, as well as the accumulation of acetic acid in fermentations with excess carbon source, which can have toxic effects on both cell growth and protein expression.20–24
There remains a need for auto-inducible protein expression methods with tightly controlled expression, minimal overflow metabolism, and a high level of protein expression. Ideally new methods will be adaptable to numerous workflows and culture volumes, from high throughput screening approaches in microtiter plates to larger scale production in instrumented bioreactors, in commercially relevant media.
We report the development of a facile protocol for the routine high level expression of proteins. The method relies on a promoter that is induced by phosphate depletion, where protein expression is induced at the entry into stationary phase. While the expression of heterologous proteins during stationary phase may seem counterintuitive and at odds with maximal production, stationary phase cells can maintain significant metabolic activity and produce high levels of protein.25–29 Specifically, phosphate depletion has been used routinely for heterologous protein expression.30–32 In addition, it has been shown that phosphate depletion can be used to amplify the expression of heterologous proteins using the pET based T7 promoters in E. coli.33 Phosphate dependent promoters are used in an engineered strain of E. coli with minimal acetate production, and near optimal growth rates and yields, offering tightly controlled expression. These strains and plasmids can be used in minimal media in instrumented bioreactors as well as with an optimized autoinduction broth enabling high level batch expression, in cultures as small as 20 μL in 384 well plates, to 100 mL in larger shake flasks.
Results
Initial Characterization of Phosphate Induction with the yibDp gene promoter
To move from IPTG based induction to autoinduction via phosphate depletion, we leveraged a previously reported phoB regulated promoter, specifically a modified promoter of the E. coli yibD (waaH) gene, referred to herein as yibDp, and constructed a plasmid enabling the induction of mCherry upon phosphate depletion (pHCKan-yibDp-mCherry, Table 1, Refer to Supplemental Materials Section 2 for the promoter sequence).26,34–38 We initially evaluated the expression of this construct in BL21(DE3), BL21(DE3) with pLysS and a well characterized E. coli K12 derivative: BW25113.39,40 The accessory plasmid (pLysS) expressing T7 lysozyme, is routinely used to reduce leaky induction in pET based systems.41 Unexpectedly, significant basal expression was observed in BL21(DE3) (Supplemental Figure S1). In contrast, no significant basal expression was observed in BW25113. These results indicate the potential ability of T7 RNA polymerase to recognize this promoter, differential phoB regulation between these strains, or yet additional unknown regulatory differences.
Host Strain Engineering
With BL21(DE3) demonstrating baseline heterogeneous leaky expression with the yibDp promoter, and in light of other routine issues encountered in using BL21 and its derivatives, such as accumulation of acetic acid, we turned to engineering a BW25113 derivative for optimal growth and minimal byproduct formation. We began with a previously reported derivative, strain BWapldf, with deletions in genes leading to common mixed acid fermentation products, such as lactic and acetic acid.43 BWapldf has deletions in the following genes: ackA-pta, pflB, adhE, ldhA, and poxB reducing the rates of production of acetate, formate, lactate and ethanol from overflow metabolites. Deletions of the two global regulators iclR and arcA were next incorporated into this strain. These mutations have been shown to improve biomass yield and reduce overflow metabolism in K12 derivatives. Together these mutations increase flux through the citric acid cycle and glyoxylate bypass and reduce overflow metabolism by increasing i) the rate of oxidation of excess carbon to carbon dioxide and increasing ATP supply.44,45
These strains, as well as a BL21(DE3) pLysS control were initially evaluated in controlled fed batch fermentations, using a defined minimal media (FGM10 media, refer to Materials and Methods) wherein phosphate concentrations limit biomass levels. Growth rates, biomass and byproducts, including acetic acid, were measured. Results are given in Figure 1. In these studies organic acid byproducts, other than acetic acid were not observed. As expected, BL21(DE3) produced acetic acid during growth (Figure 1e). Interestingly, strain BWapldf, despite having numerous deletions had a significantly decreased biomass yield and increased acetic acid production compared to BW25113 (Figure 1f and 1g). The deletion of the two global regulators, arcA and iclR, (strain DLF_R002) recovered biomass yield and virtually eliminated acetic acid production in this host (Figure 1h). Refer to Supplemental Materials Figure S2 for maximal growth rates.
Growth and byproduct formation of E. coli strains in minimal media fermentations. Biomass levels as a function of time for (a) BL21(DE3)pLys (b) BW25113, (c) BWapldf and (d) DLF_R002 respectively. (e-h) Distribution of glucose utilized during growth in minimal medium fermentations: (e) BL21(DE3)pLys, (f) BW25113, (g) BWapldf and (h) DLF_R002 respectively. Results are averages of duplicate fermentations. CO2 was explicitly measured via off-gas analysis for strain BW25113, BWapldf and DLF_R002. In the case of BL21(DE3) pLys, CO2 is included in unknown products required to account for glucose consumption.
Using strain DLF_R002 we next turned to evaluate protein expression in bioreactors using FGM10 media. As mentioned biomass levels supported by FGM10 media are limited by phosphate, and phosphate depletion occurs when biomass levels reach an optical density of ∼ 30-35 or ∼ 10 gCDW/L. In this case we constructed an additional plasmid with GFPuv driven by the yibDp promoter (pHCKan-yibDp-GFPuv, Table 1).46 Results are given in Figure 2a. Biomass levels reached ∼10 gCDW/L producing final GFPuv titers of ∼ 2.7 g/L or 270 mg/gCDW. With the success in low cell density we turned to develop a process with higher cell density utilizing FGM30 media with enough phosphate to support three times the biomass (∼ 30 gCDW/L). Results are given in Figure 2b in this case using DLF_R003. DLF_R003 is a derivative of DLF_R002 with a deletion in the outer membrane ompT protease, which has proteolytic activity even under denaturing conditions, creating issues with purification of recombinant proteins. 62, 63 Specific expression levels were maintained at higher biomass levels resulting in a three fold improvement in GFPuv titer reaching ∼ 8.1g/L.
Autoinduction of GPFuv expression in bioreactors. Triplicate 1L bioreactors, with a) FGM10 minimal media and host strain DLF_R002 bearing plasmid pHCKan-yibDp-GFPuv. b) FGM30 minimal media and host strain DLF_R003 (DLF_R003, ΔompT) bearing plasmid pHCKan-yibDp-GFPuv. Optical density (black lines) and GFPuv were measured over time. Shaded area is standard error of triplicate growth profiles. X’s, triangles and circles are normalized GFPuv fluorescence units from each of the three fermentations. Green line is the best fit of these three expression profiles.
Development of Phosphate Limited Media for Auto-induction
We next turned to the optimization of media formulations for more routine autoinduction via phosphate depletion. Importantly, the fermentations discussed above (Figure 1) were performed with defined minimal media, which while lower in cost in larger scale production, can lead to significant lags when cells transition from a richer cloning and propagation media such as LB. In order to overcome this, seed cultures are often used to adapt the cells to a more minimal media (as they were in this case, refer to Methods) prior to inoculation of bioreactors. For routine lab scale protein expression, media adaptation is not desirable, and rather protocols enabling direct inoculation of production flasks from overnight LB cultures is preferred. As a result, we developed batch autoinduction broth with more complex nutrient sources including yeast extract and casamino acids. Media formulations were developed using standard Design of Experiments methodology (DoE) and evaluated in 96 well plates. These experiments were performed using strain DLF_R002 bearing plasmid pHCKan-yibDp-GFPuv, described above. Briefly, overnight LB cultures were used to inoculate various media in 96 well plates. Biomass and GFPuv levels were measured after 24 hours. Importantly, no phosphate was added to these media, as adequate batch phosphate is supplied in the complex nutrient sources (yeast extract and casamino acids). Results are given in Figure 3. Models built based on these results did not predict significant improvements in expression over the best performing experimentally tested formulations Figure 3a. The media formulation producing the most GFPuv (as measured by relative fluorescence), was renamed AB (autoinduction broth) and used in subsequent studies. To evaluate the time course of growth, phosphate depletion and autoinduction in AB, DLF_R002 pHCKan-yibDp-GFPuv, was grown in AB in the Biolector™ Microreactor. Results are given in Figure 3b.
Media Development using Design of Experiment Methodology. 212 media formulations were evaluated for autoinduction based on phosphate depletion, each comprising different “levels” of casamino acids, yeast extract, trace metals (TM Mix), calcium sulfate (CaSO4), magnesium sulfate (MgSO4), iron(II) sulfate (FeSO4), ammonium sulfate ((NH4)2SO4) and citric acid. a) Upper panel: GFP (green bars) and OD600nm (gray bars) rank ordered plot for all media formulations. Standard deviations are from triplicate experiments. Lower panel: Nutrient concentration levels for all media (Refer to Supplemental Materials Section 4, for specific concentrations for each level). Strain DLF_R002 with plasmid pHCKan-yibDp-GFPuv was used for all experiments. b). GFP fluorescence (green line), phosphate levels (black circles) and OD600nm (black line) for strain DLF_R002 with plasmid pHCKan-yibDp-GFPuv in media #36 (Autoinduction Broth, AB) media. Standard deviations (shaded regions) are from triplicate experiments.
Comparison with current approaches
With the successful development of an optimal autoinduction broth, we turned to a head to head comparison of this approach with the traditional protocols based in LB media as well as the lactose based autoinduction system as developed by Studier.10,11 Due to the availability of a pET-mCherry plasmid (Table 1) mCherry was used as the reporter for this comparison. Specifically, induction of mCherry in BL21(DE3) with pLysS and pETM6-mCherry, using either IPTG based induction in LB media, or lactose autoinduction media was compared to strain DLF_R002 with plasmid pHCKan-yibDp-mCherry in AB. To monitor not only endpoint expression but the dynamics of growth and auto-induction, these studies were performed in the Biolector™. Results are shown in Figure 4. As expected, using E. coli BL21(DE3) and pET based expression, lactose based autoinduction media enabled higher cell densities and higher expression levels of mCherry than induction with IPTG (Figure 4a). Phosphate based autoinduction using strain DLF_R002 enabled a further 40% increase in final mCherry levels at 24 hrs over BL21(DE3) (Figure 4b). Cytometry was used to further characterize these two expression systems (Figure 4c). Phosphate based autoinduction not only had more homogeneous induction but also more expression per cell. Additionally, one of the major potential reported advantages of BL21(DE3) and related strains is reduction in Lon protease activity.47 To investigate the impact of Lon activity in these strains, a previously reported fluorescent Lon substrate was used to monitor the impact of this protease. Specifically, a circular permutation variant of GFP with a Lon degradation tag (GFP-β20-cp6) was used.48 Results are given in Figure 4d. The expression level of the Lon substrate was significantly reduced compared to a non-Lon substrate for both strains, but at least with this specific reporter, no significant difference in cell specific Lon activity was observed between BL21(DE3) and DLF_R002.
Head to head comparison of autoinduction via phosphate depletion with pET based expression in BL21(DE3). a) pET based mCherry expression in BL21(DE3) with pLysS. mCherry (red lines) and biomass levels (OD600nm, black lines) over time. Solid lines- lactose based autoinduction. Dashed lines- IPTG induction in LB media. b) yibDp based mCherry expression in DLF_R002 in AB media mCherry (red lines) and biomass levels (OD600nm, black lines). c) Cytometry of induced populations (gray- empty vector control, red- pET-mCherry in BL21(DE3) + pLysS, green - yibDp-mCherry in DLF_R002). d) expression of the Lon substrate (GFP-β20-cp6) in BL21(DE3) and DLF_R002. Normalized fluorescence is relative fluorescence normalized to optical density. Black line- GFP control (non Lon substrate) in DLF_R002. Red line - BL21(DE3) expressing GFP-β20-cp6. Green line - DLF_R002 expressing GFP-β20-cp6. Shaded areas are standard deviations of at least three replicates.
Optimization of High Throughput Expression Protocols
The DoE results discussed above in Figure 2, were generated in 96 well plates using high shaking speeds in combination with the Duetz system, which utilizes a series of specialty plate covers to minimize evaporative volume loss, while enabling adequate aeration.49–51 As rapid growth and expression are not only a function of media, but culture aeration, we sought to evaluate the optimal aeration conditions for microtiter plate based expression (96 and 384 well plates). In addition to orbital shaking speed and orbit diameter, culture volume (impacting the surface area to volume ratio) can have a significant impact on oxygen transfer and in this case protein expression as shown in Figure 5. In standard 96 well plates, volumes less than 100 μL gave optimal expression. As 384 well plates have a very small area, the surface tension at the culture meniscus can limit mixing. As a result, small amounts of surfactant (commercial antifoam) were added to improve aeration in 384 well plates, Supplemental Figure S4. In 384 well cultures volumes less than 20 μL gave optimal expression with AB media. As can be seen in Figure 5, expression levels, using AB in 384 well plates, did not reach levels observed in the 96 well plates or other culture systems. We hypothesized this was due to remaining mass transfer limitations. We tested this hypothesis by evaluating an autoinduction media identified in the DoE results (AB-C7) that yielded reduced biomass and expression levels, but as a result would have a lower maximal aeration requirement. With lowered biomass levels, and aeration demands, expression levels in 384 well plates reached that of other culture systems using this media (Supplemental Figures S5 and S6). Although total protein levels are higher in AB media, the use of AB-C7 media may be preferred when using 384 well plates in order to minimize oxygen limitations.
Optimization of autoinduction in batch cultures at various scales. Impact of various fill volumes on expression in AB. Varying fill volumes in 384 and 96 well plates as well as 250 mL baffled Erlenmeyer and 2.8 L Fernbach flasks. When using 384 well plates, 0.05% polypropylene glycol (2000MW) was added to the media. DLF_R002 with plasmid pHCKan-yibDp-GFPuv was used for all experiments.
Development of Shake Flask Protocols
For any expression protocol to be widely applicable, it cannot rely on controlled bioreactors and/or specialty plate systems, but be accessible to the average laboratory. Toward this goal, we turned to the optimization of the protocol in shake flask cultures. As mentioned above, one primary difference between bioreactor experiments and shake flask cultivation is oxygen transfer. While instrumented bioreactors and micro-reactors such as the Biolector™ can easily meet these mass transfer targets, standard shake flask have reported oxygen transfer rates anywhere from 20 mmoles/L-hr (for unbaffled flasks) to 120 mmoles/L-hr for baffled glassware.52 A key potential consequence of shake flasks is oxygen limitation and reduced growth rates and expression. As a consequence we sought to evaluate the optimal culture conditions to achieve maximal expression in shake flask cultures with a focus on baffled 250 mL Erlenmeyer flasks and 2.8 L Fernbach flasks. As seen in Figure 5, again culture volume plays a key role in optimal protein expression, with 20mL or lower being optimal in baffled 250 mL Erlenmeyer flasks and 100 mL or lower being optimal in 2.8 L Fernbach flasks. These results were obtained in shakers where an adhesive mat is used to hold flasks and shaking speeds are limited to 150 rpm. Using clamps, higher shaking speeds may enable optimal expression using larger shake flask fill volumes.
Utility with a diverse group of recombinant proteins
All results discussed to this point relied on easily quantified reporter proteins (GFPuv and mCherry), which are easily expressed to high levels in most expression hosts. In order to evaluate the broader applicability of the approach, the expression of a group of other diverse proteins was evaluated in several vector backbone contexts in the phosphate autoinduction protocol. These included: a borneol diphosphate synthase, a terpene synthase with a C-terminal mCherry tag53,54, a mutant alanine dehydrogenase55, a malonyl-CoA synthetase 56, a benzoylformate decarboxylase57, glutathione S-transferase58, HIV-1 nef protein59, a mutant citramalate synthase 60, and a human glycine acyltransferase with an N-terminal chitin binding tag61. (Refer to Table 1 for construct details.) As can be seen in Figure 6, expression levels ranged from ∼ 10 % of total protein for a large terpene synthase to 55% in the case of alanine dehydrogenase, achieving maximal protein concentrations of 275 mg/gCDW in the best case.
Autoinduction in AB in 96 well plates for a diverse set of recombinant proteins including: GFPuv, mCherry, AlaDh* (a mutant alanine dehydrogenase), Nef (HIV-1 Nef protein), hGLYAT2 (human glycine acyltransferase-2 an N-terminal chitin binding tag), cimA3.7 (a mutant citramalate synthase), GST, mdlC (benzylformate decarboxylase), matB (malonyl-CoA synthetase), and SBS (bornyl-diphosphate synthase with a C-terminal mCherry tag). Percent of total expression is given for three replicates. Refer to Supplemental Figure S7 for an example SDS-PAGE result.
Discussion
Two-stage expression induced upon phosphate depletion enables a facile and versatile approach to routine high level recombinant protein production. In the case of GFPuv, protein titers approaching 2 g/L in batch microtiter plates and shake flasks. These titers correspond to protein yields of 20 μg of protein per well in 384 well plates, 170 μg per well in 96 well plates, and 40 mg and 180 mg of protein in 250 mL Erlenmeyer and Fernbach flasks, respectively. Importantly, current results also support homogenous expression using phosphate depletion. Expression levels will of course vary as a function of the protein and expression construct, but initial testing with additional proteins supports expression levels from 10 to 55 percent of total cellular protein, which at the high end is ∼ 275 mg/gCDW of recombinant protein and represents significant improvements in heterologous protein expression in E. coli64,65. More work is needed to better understand the mechanisms unexpectedly high expression levels observed in this system. Initial adaptation to instrumented bioreactors, enabled GFP titers as high as 2.7 g/L, 270 mg/gCDW and 55% expression, with 10 gCDW/L. Process intensification to increase biomass levels to ∼ 30 gCDW/L, while maintaining specific expression levels, resulting in a ∼3 fold improvement in GFP titers reaching 8.1 g/L. Further optimization of bioreactor protocols may enable much higher cell density cultures. If truly high cell density fermentations (from 50-100 gCDW/L of biomass) can be developed with equivalent expression levels, protein titers in the range of 15-30 g/L or higher in some cases can be expected.
There are however several challenges with the existing protocol. Firstly, proteins of interest must be cloned into a plasmid with the yibDp promoter. Screening of additional phosphate (phoB) regulated promoters may yield improved or varied expression. Adaptation of the system for use with existing pET based plasmids would also be of utility for proteins that are already cloned into these standard vectors. Secondly, preparation of AB media is more complicated than making routine LB media.
Despite these potential limitations, the development of strains, plasmids and protocols for autoinduction based on phosphate depletion not only enables improved expression, with impressive protein titers, but also a scalable methodology. A single host and plasmid can be used in high throughput screening of initial expression constructs or mutant variants all the way through to instrumented bioreactors. These results support the biosynthetic potential of phosphate depleted stationary phase cultures of E. coli. Decoupling growth from production also has the potential to enable future studies focused on key remaining limitations in protein biosynthesis in this well characterized host.
Materials & Methods
Reagents and Media
Unless otherwise stated, all materials and reagents were of the highest grade possible and purchased from Sigma (St. Louis, MO). Luria Broth, lennox formulation with lower salt was used for routine strain and plasmid propagation and construction and is referred to as LB below. All media formulations including stock solutions are described in Supplemental Materials, Section 4. Working antibiotic concentrations were as follows: kanamycin (35 μg/mL), chloramphenicol (35 μg/mL), ampicillin (100 μg/mL), tetracycline (5 μg/mL), apramycin (100 μg/mL). Polypropylene glycol (MW of 2000) and casamino acids were obtained from VWR international (Suwanee, GA), product numbers E278-500G and 90001-740, respectively. Yeast extract and MOPS (3-(N-morpholino)propanesulfonic acid) were obtained from Biobasic (Amherst, NY)., product numbers G0961 and MB0360, respectively.
Strains and Strain Construction
E. coli strains BL21(DE3) (Catalogue # C2527) and BL21(DE3) pLysS (Catalogue # C3010) were obtained from New England BioLabs, Ipswich, MA. Strain BW25113 was obtained from the Yale E. coli Genetic Stock Center (https://cgsc.biology.yale.edu/,40,66. Strain BWapldf was a gift from George Chen (Tsinghua University).43 Chromosomal modifications were made using standard recombineering methodologies67 through scarless tet-sacB selection and counterselection, strictly following the protocols of Li et al.68 The recombineering plasmid pSIM5 and the tet-sacB selection/counterselection marker cassette were kind gifts from Donald Court (NCI, https://redrecombineering.ncifcrf.gov/court-lab.html). Briefly, the tet-sacB selection/counterselection cassette was amplified using the appropriate oligos supplying ∼50 bp flanking homology sequences using Econotaq (Lucigen Middleton, WI) according to manufacturer’s instructions, with an initial 10 minutes denaturation at 94 °C, followed by 35 cycles of 94 °C, for 15 seconds, 52 °C for 15 seconds, and 72 °C for 5 minutes. Cassettes used for “curing” of the tet-sacB cassette were obtained as gBlocks from (Integrated DNA Technologies, Coralville, Iowa, USA). The ompT protease gene was deleted using standard recombineering methods by selection for an apramycin selectable marker obtained from the pMDIA plasmid.69 pMDIAI was a gift from Sheng Yang (Addgene plasmid # 51655; http://n2t.net/addgene:51655; RRID:Addgene_51655). Primers and DNA sequences are given in Supplemental Materials Section 3. Chromosomal modifications were confirmed by PCR amplification and sequencing (Genewiz, NC) using paired oligonucleotides, either flanking the entire region.
Plasmids
pETM6, and pETM6-mCherry was a gift from Mattheos Koffas (Addgene plasmids ## 49795 and # 66534). pLysS was obtained from New England Biolabs (NEB, Ipswich, MA). Plasmids made in this study were constructed using G-blocks™ and/or PCR products and assembled using NEBuilder® HiFi DNA Assembly Master Mix following manufacturer’s protocol (NEB, Ipswich, MA). Polymerase chain reactions were performed with Q5 DNA Polymerase (NEB, Ipswich, MA). pSMART-HC-Kan (Lucigen, WI), pTWIST-Chlor-Medium Copy (Twist Biosciences San Francisco, CA), pTWIST-Kan-High Copy (Twist Biosciences San Francisco, CA) and pCDF (derived from pCDF-1b, EMD Millipore, Burlington, MA) were used as a backbone vectors in these studies. Sequences of all oligos and synthetic DNA are given in Supplemental Materials Section 3. All plasmid sequences were confirmed by DNA sequencing (Genewiz, NC). Sequences and maps are available with Addgene. Refer to Table 1 for Addgene numbers. pCDF was constructed from pCDF-1b by first amplifying the vector with primers pCDF-1b-ampl1 and pCDF-1b-ampl2, to remove the lacI gene, followed by DNA assembly with pCDF-MCS. All genes were codon optimized for expression in E. coli using IDT’s codon optimization tool (https://www.idtdna.com/CodonOpt). pHCKan-yibDp-GFPuv, pHCKan-yibDp-mCherry, pSMART-Ala1, pHCKan-yibDp-GFP and pHCKan-yibDp-GST were constructed by DNA assembly with linearized pSMART-HC-Kan obtained from lucigen with yibDp-GFPuv, yibDp-mCherry, yibDp-ald*, yibDp-GFP, yibDp-GST and yibDp-GFP-β20cp6 G-blocks™ respectively. pCDF-yibDp-matB was obtained by DNA assembly of a G-block™ (yibDp-matB) with pCDF-ev which was amplified by PCR with SR2_rc and SL1_rc. pCDF-yibDp-mdlC-his was constructed from the assembly of 2 synthetic G-blocks™ yibDp-mdlC-his1 and yibDp-mdlC-his2. pTCmc-yibDp-SBS-mCherry, pTKhcan-yibDp-cimA3.7, pHCKan-yibDp-GFP-β20cp6 and pHCKan-yibDp-Nef were constructed at TWIST Biosciences (San Francisco, CA) using the pTWIST-Chlor-Medium-Copy, pSMART-HC-Kan, pSMART-HC-Kan and pTWIST-Kan-High-Copy vectors respectively. pHCKan-yibDp-GFP-cp6 was constructed by Q5 mutagenesis or “Around the World” PCR of plasmid pHCKan-yibDp-GFP-β20cp6, to remove the Lon degron tag, with primers GFP_cp6_F and GFP_cp6_R followed by DpnI treatment phosphorylation and self ligation, using KLD reaction mix obtained from NEB. (Ipswich, MA). pHCKan-yibDp-CBD-hGLY was constructed by DNA assembly with 2 PCR products amplified from (i) a plasmid coding hGLY under a T7 promoter (pHCKan-T7-CBD-hGLY, Addgene #134940, constructed at TWIST Biosciences) using primers pS-yibD-hGLY_F and pS-yibD-hGLY_R; and (ii) a plasmid containing the yibDp promoter, (pHCKan-yibDp-ald*-alaE, Addgene # 134939) using primers pS-yibDp-FOR and pS-yibDp-REV).
BioLector™ Experiments
Growth and fluorescence measurements were obtained in a Biolector (m2p labs, 11 Baesweiler, Germany) using a high mass transfer FlowerPlate (CAT#: MTP-48-B, m2p-labs, Biolector settings were as follows: RFP gain=40, GFP gain=20, Biomass gain 20, shaking speed 1300 rpm, temperature 37 °C, humidity 85%. Single colonies of each strain were inoculated into 5 mL LB with appropriate antibiotics and cultured at 37 °C, 150 rpm overnight. Overnight cultures OD600nm was measured and normalized to OD600nm = 25.8 μL of normalized overnight culture was inoculated into 792 μL of the appropriate medium with appropriate antibiotics and transferred into wells of the FlowerPlate. Every strain was analyzed in triplicate.
Microtiter Plate Based Growth and Expression
Plasmids were transformed into host strains using standard protocols. Glycerol stocks were prepared for each strain plate by adding equal volume of overnight LB culture with sterile 20% glycerol. 3 μL of glycerol stocks were used to inoculate overnight culture in 150 μL LB medium with appropriate antibiotics. 96 well and 384 well plates used in these studies were obtained from Genesee Scientific (San Diego, CA, Cat #: 25-104) and VWR ((Suwanee, GA, Cat #: 10814-224). Plates were covered with sandwich covers (Model # CR1596, 96 well plates) (Model # CR1384, 384 well plates) obtained from EnzyScreen, Haarlam, The Netherlands). These covers ensured minimal evaporative loss during incubation. Microtiter plates were cultured at 37 °C, 300 rpm for 16 hours, shaker orbit is 50 mm. This combination of orbit and minimal shaking speed is required to obtain needed mass transfer coefficient and enable adequate culture oxygenation. After 16 hours of growth, a 1% volume of overnight culture was inoculated into autoinduction media plus the appropriate antibiotics. Plates were again covered with sandwich covers and grown at 37 °C, 300 rpm for 24 hours at which point samples were harvested for analysis, ie SDS-PAGE, fluorescence and optical density readings. To test the expression level of the protein panel, a volume of 100 μL of AB media per well was used.
Autoinduction Media Development
The autoinduction media was developed using DoE definitive screening designs and JMP software (SAS, Cary, NC). 1X trace metal mix contains 0.01 mL/L of concentrated H2SO4, 0.0012 g/L CoSO4*7H2O, 0.001 g/L CuSO4*5H2O, 0.0012 g/L ZnSO4*7H2O, 0.0004 g/L Na2MoO4*2H2O, 0.0002 g/L H3BO3, and 0.0006 g/L MnSO4*H2O. 0.25 g/L citric acid, 68 mM (NH4)2SO4, 0.16 mM FeSO4, 10 mM MgSO4, 0.0625 mM CaSO4, 2X trace metal mix, 2.5 g/L yeast extract, and 2.5 g/L casamino acid were used as the starting center point, 4X and 1/4X of the center point values were used as the upper and lower concentration ranges. Definitive screening design was performed in 5 iterations. Center point, upper and lower concentration ranges for future iterations were determined based on DoE results from the previous iteration. For testing all 212 media from the DOE, one mL of each media was prepared in deep well 96-well plates. Media were prepared from sterilized liquid stocks: (NH4)2SO4 (3 M), Citric Acid (25 g/L), FeSO4 (20 mM), MgSO4 (1 M), CaSO4 (5 mM), Trace Metals (250 X), Yeast Extract (100 g/L), Casamino Acids (100 g/L), Thiamine HCl (50 g/L), MOPS (1 M), and glucose (500 g/L). As all media contained equal amounts of Thiamine HCl, MOPS, glucose, and Kanamycin, these were added to each media first, followed by water. Then worklists were prepared to add the remaining media components using Tecan Evo for liquid handling. In between addition of media components, plates were shaken in a Benchmark Incu-Mixer™ MP at 1500 rpm to ensure proper mixing and prevent media precipitation. Once completed, 148.5 uL of media was distributed to triplicate 96 well plates and each well was inoculated with 1.5uL of overnight LB culture. The plates were covered with EnzyScreen covers and shaken at 300 rpm at 37°C. After 24 hours, OD and fluorescence were measured.
Shake Flask Growth and Expression
Glycerol stocks were used to inoculate overnight cultures in 5 mL of LB media, with appropriate antibiotics. After 16 hours of growth, a 1% volume of overnight culture was inoculated into autoinduction media plus the appropriate antibiotics. Flasks cultures were grown at 37 °C, 150 rpm in baffled 250 ml Erlenmeyer flasks for 24 hours at which point samples were harvested for analysis.
Fermentation Seeds
Single colony from transformation plate was inoculated into 5 mL LB with appropriate antibiotics and cultured at 37°C, 150 rpm for 16 hours. 200 μL of the LB culture was inoculated into 20 mL SM10+ media with appropriate antibiotics in 250 ml shaker flasks. The culture was incubated at 37°C with a shaking speed of 150 rpm for 16 hours, at which time OD600nm is usually between 6 and 10. The culture was harvested by centrifugation at 4000 rpm for 15 min, the supernatant was discarded and the cell culture was normalized to OD600nm = 10 using FGM10 media. Seed vials were prepared by adding 1.5 mL of 50% glycerol to 6.5 mL of normalized OD600nm = 10 culture in cryovials, and stored at −60 °C.
1L Fermentations
An Infors-HT Multifors (Laurel, MD, USA) parallel bioreactor system was used to perform 1L fermentations. Vessels used had a total volume of 1400 mL and a working volume of up to 1 L. Online pH and pO2 monitoring and control were accomplished with Hamilton probes. Offgas analysis was accomplished with a multiplexed Blue-in-One BlueSens gas analyzer (BlueSens. Northbrook, IL, USA). Culture densities were continually monitored using Optek 225 mm OD probes, (Optek, Germantown, WI, USA). The system used was running IrisV6.0 command and control software and integrated with a Seg-flow automated sampling system (Flownamics, Rodeo, CA, USA), including FISP cell free sampling probes, a Segmod 4800 and FlowFraction 96 well plate fraction collector. Tanks were filled with 800 mL of either FGM10 or FGM30 medium, which has enough phosphate to target a final E. coli biomass concentration ∼ 10 gCDW/L or 30 gCDW/L respectively. Antibiotics were added as appropriate. Phosphate, glucose, thiamine and antibiotics were added after cooling the tank vessel containing the rest of the media components. Frozen seed vials were thawed on ice and 8 mL of seed culture was used to inoculate the tanks. After inoculation, tanks were controlled at 37 °C and pH 6.8 using 14.5 M ammonium hydroxide and 1 M hydrochloric acid as titrants. The following oxygen control scheme was used to maintain the desired dissolved oxygen set point. Initial air flow rate was set to 0.3 vvm and initial agitation of 300 rpm. In order to maintain a dissolved oxygen concentration of 25%, airflow was increased to 1 vvm, then agitation was increased to a maximum of 1200 rpm, and finally oxygen was increased in the gas mixture from 0% additional to a maximum of 80% additional.
Glucose feeding was as follows. For 10 gCDW/L fermentations, starting batch glucose concentration was 25 g/L. A constant concentrated sterile filtered glucose feed (500 g/L) was added to the tanks at 0.5 g/h once dissolved oxygen concentration dropped from 100% to 25% and ramped up to 1 g/h, once increased agitation was required to maintain the dissolved oxygen setpoint. For 30 gCDW/L fermentations, starting batch glucose concentration was 25 g/L. Concentrated sterile filtered glucose feed (500 g/L) was added to the tanks at an initial rate of 3.5 g/L-h when the rate of agitation increased above 1000 rpm. This rate was then increased exponentially, doubling every 1.083 hours (65 min) until 40 g total glucose had been added.
Organic Acid Quantification
Two orthogonal methods were used to quantify organic acids including lactate, acetate, succinate, fumarate, pyruvate, malate and others. The first method was a reverse phase UPLC method. Chromatographic separation was performed using a Restek Ultra AQ C18 column (150 mm × 2.1 i.d., 3 μm; CAT#: 9178362, Restek Corporation, Bellefonte, PA) at 30 °C. 20 mM phosphoric acid was used as the eluent. The isocratic elution rate was at 0.8 mL/min, run time was 1.25 min. Sample injection volume was 10 μL. Absorbance was monitored at 210 nm. The second method relied on ion exchange chromatography and refractive index detection. A Phenomenex Rezex™ ROA-Organic Acid H+ (8%) (30 x 4.6 mm; CAT#: 00A-0138-E0, Phenomenex, Torrance, CA) was used for a 30 minute isocratic separation using a mobile phase of 5 mM H2SO4, at a flow rate of 0.5 mL/min. Again sample injections were 10 μL. Organic acid elution times were as follows: Pyruvate 13.3 min, Citramalate 13.75 min, Citrate 10.9 min, Lactate 17.5 min and Acetate 20.3 min.
Glucose Quantification
Similarly two methods were used to quantify glucose. The first was identical to the second organic acid method, utilizing the Resex column for ion exchange linked to refractive discussed above, wherein glucose eluted at 12.5 minutes. The second method was a similar UPLC method also relying on ion exchange and refractive index detection. Chromatographic separation was performed using a Bio-Rad Fast Acid Analysis HPLC Column (100 x 7.8 mm, 9 μm particle size; CAT#: #1250100, Bio-Rad Laboratories, Inc., Hercules, CA) at 65 °C. 5 mM sulfuric acid was used as the eluent. The isocratic elution was as follows: 0–0.1 min, flow rate increased from 0.4 mL/min to 0.42 mL/min, 0.1–12 min flow rate at 0.48 mL/min. Sample injection volume was 10 μL.
Determination of Strain Dry Weight
Culture samples (5 ml, n=3) were taken and washed 2X with deionized water via centrifugation and resuspension. After wash steps the OD of the samples were determined at 600 nm. Subsequently, samples were filtered over pre-weighed nitrocellulose filters (pore size, 0.45 μm). Filters were washed extensively with demineralized water and dried in a microwave oven for 2 min and weighed to determine correlation of OD600nm and gDCW, which was 0.35.
Phosphate Quantification
Phosphate concentrations were determined using the BioMOL Green colorimetric assay from Enzo Life Sciences (Farmingdale, NY) according to manufacturer’s instructions.
Fluorescence measurements
Optical densities and fluorescent were measured using a Tecan Infinite 200 plate reader. Measurements were performed using 200 uL in black 96 well plates (Greiner Bio-One, Reference Number: 655087). Optical density was read at 600 nm (Filter from Omega Optical, Part Number: 3019445) and adjusted by subtracting a blank, followed by correction for pathlength and dilutions. For GFP fluorescence, samples were excited at 412 nm (Omega Optical, Part Number: 3024970) and emission was read at 530 nm (Omega Optical, Part Number: 3032166) using a gain of 60. Fluorescence readings were then adjusted for dilution.
SDS-PAGE and GFP quantification
The OD600nm of culture samples to be analyzed was measured before harvesting the cells by centrifugation, which was done at 4000 rpm for 15 minutes. The cells were resuspended in 50 μl of phosphate-buffered saline with protease inhibitors (ThermoFisher Scientific, MA, product number A32965) and 5 mM EDTA. 25 μl of the resuspended cells were mixed with 25 μl of 2x Laemmli sample buffer (Biorad, CA) and boiled for 5 minutes at 95 T. The boiled samples were centrifuged at 14,000 rpm for 10 minutes and 20 μg of total protein per sample was then loaded into a 4-15% gradient Mini-Protean TGX precast protein gel (Biorad, CA) and ran at 140 V. The volume loaded per sample was calculated as volume=100/OD600nm The gels were stained using Coomassie Brilliant Blue R-250. Gels were imaged using a UVP PhotoDoc-It™ Imaging System (Analytik Jena, CA) and expression levels were quantified using ImageJ (NIH, MD). To correlate GFPuv fluorescence with grams of GFPuv, samples were taken wherein both (i) fluorescence was measured as described above and (ii) expression level was calculated as described above. Total cellular protein was estimated at 500 mg/gDCW or 50% of dry cell weight70. In these comparison, 3.24 e 9 relative fluorescent units corresponded to 1 gram of GFPuv. This correlation was also used to calculate GFPuv titers across all experiments.
Cytometry
BL21(DE3) pLys bearing pETM6 (negative control) or pETM6-mCherry as well as DLF_R002 bearing pSMART-HC-Kan (negative control) or pHCKan-yibDp-mCherry were grown in 5 ml of LB overnight at 37°C, 150 rpm. After 16 hours, 1% volume of overnight BL21(DE3)pLys or DLF_R002 cultures were used to inoculate 20 ml of LB or AB media in 250 baffled Erlenmeyer and incubated at 37 °C, 150 rpm. BL21(DE3) cultures were induced at OD600nm ∼0.3 with 1 M IPTG solution to a final 1 mM IPTG concentration. Samples for BL21(DE3) were collected 20 hours after induction with IPTG. Samples for DLF_R002 were collected after 24 hours of inoculation. Samples were serially diluted 1000-fold with sterile DI water before analyzing them in a Thermo Attune NXT flow cytometer (ThermoFisher Scientific, MA). Samples were run at a 12.5 μl/min flow rate. Fluorescence measurements were taken from the 620/15 band pass filter after exciting the cells with a yellow laser at 561 nm. Forward scatter vs time plot was monitored during the run to ensure no clogging occurred. The forward scatter height vs forward scatter area plot was also monitored to ensure no cell clumping. Forward scatter height vs side scatter height plots were analyzed using DLF_R002 bearing pSMART-HC-Kan to determine the appropriate gating to exclude small particles from being counted as events. A forward scatter height of 10,000 and a side scatter of 2,500 were used for gating for all samples. Data was analyzed using FlowJo v10.6.1 (BD, NJ).
Author contributions
Z. Ye and R. Menacho-Melgar contributed equally to this work. R. Menacho-Melgar and Z. Ye constructed plasmids and strains, performed biolector, media development and fermentation experiments. E. Moreb performed micro-fermentations. E. Moreb and T. Yang performed shake flask studies. J. Efromson performed fermentations. J.S. Decker assisted in DoE studies. Z. Ye, R. Menacho-Melgar, E. Moreb and M. Lynch designed experiments and analyzed results. All authors wrote revised and edited the manuscript.
Conflicts of Interest
M.D. Lynch and Z. Ye have a financial interest in DMC Biotechnologies, Inc. R. Menacho-Melgar, Z. Ye and M.D. Lynch have filed patent applications on strains and methods discussed in this manuscript.
Acknowledgements
We would like to acknowledge the following support: the NSF EAGER: #1445726 DARPA# HR0011-14-C-0075, ONR YIP #12043956, and DOE EERE grant #EE0007563, as well as the North Carolina Biotechnology Center 2018-BIG-6503. J.S. Decker was supported in part by the NIH Biotechnology Training Grant (T32GM008555). We would like to thank Professor George Chen (Tsinghua University), for the kind gift of E. coli strain BWapldf, and M. Munson for help with cytometry experiments.
Footnotes
Results for higher cell density fermentations