Strategies for optimization of heterologous protein expression in E. coli: Roadblocks and reinforcements

https://doi.org/10.1016/j.ijbiomac.2017.08.080Get rights and content

Abstract

E. coli is most preferred system used for the production of recombinant proteins in bacteria and the availability of improved genetic tools/methods are making it more valuable than ever. Major challenges faced by this expression system are the expression of unusually difficult/complex proteins with rare codons or membrane and toxic proteins. The proteins expressed either in large amount or hydrophobic in nature tend to form insoluble mass. Despite the appropriate expression system, some proteins express at very low level or not at all. Choosing the correct expression system/protocols are obligatory for the substantial expression of protein in the native form. A number of vectors, their compatible hosts and culture conditions can be used to express recombinant proteins in large amounts and in native form. Also, vectors with the fusion tags/chaperons facilitate protein expression in soluble fraction and assist in proper protein folding besides restoring the native structure of protein. The recovery of native proteins from insoluble inclusion bodies can be achieved by optimization of refolding conditions. In the present review, we discussed recent updates on prokaryotic expression system for successful heterologous gene expression in E. coli and focused on strategies to maximize the yields of native recombinant proteins.

Introduction

Genetic engineering and recombinant nucleic acid (DNA or RNA) technologies are exploited to obtain new combinations of genetic material from different organisms. Genetic engineering was first accomplished by Herbert Boyer and Stanley Cohen in 1973, to create the functional organism that combined the genetic information from different species and replicated them very nicely [1]. They introduced genes from toad Xenopus laevis into bacteria, E. coli and demonstrated that these genes were actively expressed generation after generation [2].

There are many hosts used for the production of recombinant protein such as bacteria, yeast, insect, plants and animals. E. coli has been the most popular means of producing recombinant proteins for over two decades from not only prokaryotic but also eukaryotic origins, as it can grow on inexpensive media under well defined laboratory conditions and have very short doubling time i.e., 20 min. Due to its rapid growth, selection of mutants is easy and convenient. Moreover, E. coli cells are highly efficient to receive (incorporate) foreign DNA and express recombinant proteins at very high rate [3]. Various expression systems are available for various applications. Approximately, 80% proteins with solved three-dimensional structures submitted to the protein data bank (PDB) in 2003 were expressed in an E. coli expression system [4]. Genentech, was the first company to commercialize the human protein somatostatin produced by genetically modified E. coli [5] followed by human insulin in 1978 [6]. Presently, a lot of life-saving drugs are produced in E. coli. But till now, heterologous gene expression in E. coli is not that easy task as it looks.

Extracellular production of recombinant protein is highly desirable in order to reduce the complexity of a bioprocess as well as to improve the product quality. Use of E. coli expression system is limited due to the lack of secretion systems for efficient release of proteins into the growth medium, limited ability to facilitate extensive disulfide-bond formation, inefficient cleavage of the amino terminal methionine, protein inactivity, its toxicity and inability to confer posttranslational modifications which can result in lowered protein stability and increased immunogenicity [7], [8]. Leaky expression is another major problem associated with various expression systems in this organism. With advancement in rDNA technology, numerous engineered plasmid vectors and hosts have been developed to express these difficult proteins [9], [10]. But still, it completely relies on appropriate selection and availability of strains. Researcher have used various molecular strategies to express difficult proteins in available expression system by optimizing various parameters such as, cultivation temperature, inducer concentrations, cell growth and co-expression with chaperons at the time of induction and use of additives in culture medium [11], [12], [13].

Though in the last 25 years, several advanced systems were developed for heterogenous proteins and various reviews were published in this area [7], [14], [15], [16], [17], no comprehensive detail is available at one place. Therefore, an attempt has been made to summarize the available information in this area till date. This review describes various vectors/hosts/methods/strategies used for the high level expression of recombinant protein in E. coli.

Section snippets

Basic elements for the expression of recombinant protein

The recombinant protein expression is achieved through a number of steps and the overall steps are shown in Fig. 1. The two main components required for recombinant protein expression are expression vector and expression host/compatible host

Factors affecting expression of heterologous proteins

In different hosts, expression of protein is controlled by several internal factors. These factors are somewhat different than wild type as discussed below.

Protein expression in inclusion bodies

In E. coli, the high-level recombinant protein expression often leads to the accumulation of insoluble aggregated folding intermediates in the cytoplasm as inclusion bodies [116]. The misfolded denatured protein molecules in the inclusion bodies are devoid of biological activity [117]. So, the protein molecules have to be solubilized and refolded from these inclusion bodies. The main reason for the poor recovery of active protein from inclusion bodies is the loss of secondary structure and

Optimization of various factors affecting expression of recombinant protein in soluble fraction

The induction conditions are important for the production of recombinant proteins and they have to be optimized to increase the yield of the product. Several approaches have been developed to prevent protein accumulation in the form of inclusion bodies.

Bright side of inclusion body formation

It is often undesirable to obtain protein expression in the form of inclusion bodies, but it can be advantageous too. The main advantages of inclusion body formation are: (i) protein homogeneity, (ii) easy isolation, (iii) resistance to proteolytic attack by cellular proteases, (iv) very high level protein expression, (v) cost-effective down-stream processing. Although dealing with inclusion bodies is cumbersome process but there are some advantages also, like in case of toxic proteins [96] or

Translocation of recombinant protein

To circumvent problems with inclusion body formation, two different methods have been used.

  • 1)

    Translocate the recombinant protein to the oxidizing environment of the sub-compartment of E. coli, periplasm. There are mainly four reasons to translocate recombinant protein into the periplasm: (i) oxidizing environment facilitates the disulfide bond formation, (ii) periplasm contain less protease than cytoplasm so recombinant proteins are more biologically active, stable and there is increased

Shuttle vectors for protein expression

Sometimes, E. coli couldn’t work as suitable host for protein expression due to a number of reasons like post-translational modifications, disulfide-bond rich proteins, codon usage. Moreover, expression in nearby model organism is more efficient to understand the role of protein in any of cellular processes. So, shuttle vectors have been used for the ease of screening in E. coli and recombinant protein expression in some other hosts and some of the shuttle vectors for E. coli as one host are

Conclusion

Every protein is different, so optimized conditions for one protein expression would never work similarly for other protein. This review deals with the number of methods and strategies to optimize the expression of recombinant protein. For successful expression of difficult proteins in E. coli, one has to seek optimal combination of vector (promoter/tag) and compatible host as discussed above. In addition, codon usage and chaperons has to be taken care of. In the end, media optimization helps a

Conflict of interest

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgements

Authors dully acknowledges CSIR and ICMR, India for the financial support provided to Prof. Jagdeep Kaur and fellowship to Jashandeep Kaur.

Glossary

1
The origin of replication (also called the replication origin) is a particular sequence in a genome at which replication is initiated
2
Affinity tags are artificial polypeptides which were usually grafted either onto the N- or C-terminus of a target protein through inserting the cDNA sequence which encoded the tag peptide into a matching open reading frame of the target protein
3
A molecular chaperone is defined as any protein that interacts with and aids in the folding or assembly of another

References (299)

  • W. Hillen et al.

    Control of expression of the Tn10-encoded tetracycline resistance operon II. Interaction of RNA polymerase and TET repressor with the tet operon regulatory region

    J. Mol. Biol.

    (1984)
  • H.P. Sørensen et al.

    Advanced genetic strategies for recombinant protein expression in Escherichia coli

    J. Biotechnol.

    (2005)
  • T.H. Grossman et al.

    Spontaneous cAMP-dependent derepression of gene expression in stationary phase plays a role in recombinant expression instability

    Gene

    (1998)
  • F.W. Studier

    Use of bacteriophage T7 lysozyme to improve an inducible T7 expression system

    J. Mol. Biol.

    (1991)
  • N.M. Stano et al.

    T7 lysozyme represses T7 RNA polymerase transcription by destabilizing the open complex during initiation

    J. Biol. Chem.

    (2004)
  • D. Stüber et al.

    System for high-level production in Escherichia coli and rapid purification of recombinant proteins: application to epitope mapping, preparation of antibodies, and structure-function analysis

    Immunol. Methods

    (1990)
  • H. Bujard et al.

    [26] A T5 promoter-based transcription-translation system for the analysis of proteins in Vitro and in Vivo

    Methods Enzymol.

    (1987)
  • X. Xu et al.

    High-level expression of the recombinant hybrid peptide cecropinA (1–8)–magainin2 (1-12) with an ubiquitin fusion partner in Escherichia coli

    Protein Expr. Purif.

    (2007)
  • V.E. Galkin et al.

    Cleavage of bacteriophage λ cI repressor involves the RecA C-terminal domain

    J. Mol. Biol.

    (2009)
  • C.A. Love et al.

    Stable high-copy-number bacteriophage λ promoter vectors for overproduction of proteins in Escherichia coli

    Gene

    (1996)
  • P. Srivastava et al.

    Overexpression and purification of recombinant human interferon alpha2b in Escherichia coli

    Protein Expr. Purif.

    (2005)
  • A.C.L. Velez et al.

    Enhanced production of recombinant thermo-stable lipase in Escherichia coli at high induction temperature

    Protein Expr. Purif.

    (2013)
  • E. Amann et al.

    Tightly regulated tac promoter vectors useful for the expression of unfused and fused proteins in Escherichia coli

    Gene

    (1988)
  • B. Miroux et al.

    Over-production of proteins in Escherichia coli: mutant hosts that allow synthesis of some membrane proteins and globular proteins at high levels

    J. Mol. Biol.

    (1996)
  • U. Brinkmann et al.

    High-level expression of recombinant genes in Escherichia coli is dependent on the availability of the dnaY gene product

    Gene

    (1989)
  • W.A. Prinz et al.

    The role of the thioredoxin and glutaredoxin pathways in reducing protein disulfide bonds in the Escherichia coli cytoplasm

    J. Biol. Chem.

    (1997)
  • J.F. Kane

    Effects of rare codon clusters on high-level expression of heterologous proteins in Escherichia coli

    Curr. Opin. Biotechnol.

    (1995)
  • C. Kurland et al.

    Errors of heterologous protein expression

    Curr. Opin. Biotechnol.

    (1996)
  • A.M. Baca et al.

    Overcoming codon bias: a method for high-level overexpression of Plasmodium and other AT-rich parasite genes in Escherichia coli

    Int. J. Parasitol.

    (2000)
  • H.P. Sørensen et al.

    Production of recombinant thermostable proteins expressed in Escherichia coli: completion of protein synthesis is the bottleneck

    J. Chromatogr. B

    (2003)
  • N.A. Burgess-Brown et al.

    Codon optimization can improve expression of human genes in Escherichia coli: a multi-gene study

    Protein Expr. Purif.

    (2008)
  • Z. Fang et al.

    Expression and production of soluble Mycobacterium tuberculosis H37Rv mycosin-3

    Biochem. Biophys. Rep.

    (2016)
  • F.W. Studier

    Protein production by auto-induction in high-density shaking cultures

    Protein Expr. Purif.

    (2005)
  • S.N. Cohen et al.

    Construction of biologically functional bacterial plasmids in vitro

    Proc. Natl. Acad. Sci.

    (1973)
  • J.F. Morrow et al.

    Replication and transcription of eukaryotic DNA in Esherichia coli

    Proc. Natl. Acad. Sci.

    (1974)
  • J.E. Cronan et al.

    Escherichia coli as an experimental organism

    Encycl. Life Sci

    (2005)
  • K. Terpe

    Overview of bacterial expression systems for heterologous protein production: from molecular and biochemical fundamentals to commercial systems

    Appl. Microbiol. Biotechnol.

    (2006)
  • K. Itakura et al.

    Expression in Escherichia coli of a chemically synthesized gene for the hormone somatostatin

    Science

    (1977)
  • D.V. Goeddel et al.

    Expression in Escherichia coli of chemically synthesized genes for human insulin

    Proc. Natl. Acad. Sci.

    (1979)
  • S.C. Makrides

    Strategies for achieving high-level expression of genes in Escherichia coli

    Microbiol. Rev.

    (1996)
  • L.-M. Guzman et al.

    Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter

    J. Bacteriol.

    (1995)
  • X.Y. Liu et al.

    Cloning and expression of visfatin and screening of oligopeptides binding with visfatin

    Int. J. Clin. Exp. Med.

    (2014)
  • B. García-Fraga et al.

    Optimized expression conditions for enhancing production of two recombinant chitinolytic enzymes from different prokaryote domains

    Bioprocess Biosyst. Eng.

    (2015)
  • D.T. Huyen et al.

    Expression of flagellin FLjB derived from Salmonella enterica serovar typhimurium in Escherichia coli BL21

    TAP CHI SINH HOC

    (2015)
  • Y.E. Kim et al.

    Molecular chaperone functions in protein folding and proteostasis

    Annu. Rev. Biochem

    (2013)
  • M. Alibolandi et al.

    Chemical assistance in refolding of bacterial inclusion bodies

    Biochem. Res. Int.

    (2011)
  • F. Baneyx et al.

    Recombinant protein folding and misfolding in Escherichia coli

    Nat. Biotechnol.

    (2004)
  • G.J. Gopal et al.

    Strategies for the production of recombinant protein in Escherichia coli

    Protein J.

    (2013)
  • X. Zhao et al.

    Several affinity tags commonly used in chromatographic purification

    J. Anal. Methods Chem.

    (2013)
  • W. Schumann et al.

    Production of recombinant proteins in Escherichia coli

    Genet. Mol. Biol.

    (2004)
  • Cited by (259)

    View all citing articles on Scopus
    View full text