Abstract
In modern societies, biodegradation of hydrophobic pollutants generated by industry is important for environmental and human health. In Gram-negative bacteria, biodegradation depends on facilitated diffusion of the pollutant substrates into the cell, mediated by specialised outer membrane (OM) channels. Here we show, via a combined experimental and computational approach, that the uptake of monoaromatic hydrocarbons such as toluene in Pseudomonas putida F1 (PpF1) occurs via lateral diffusion through FadL channels. Contrary to classical diffusion channels via which polar substrates move directly into the periplasmic space, PpF1 TodX and CymD direct their hydrophobic substrates into the OM via a lateral opening in the channel wall, bypassing the polar barrier formed by the lipopolysaccharide leaflet on the cell surface. Our study suggests that lateral diffusion of hydrophobic molecules is the modus operandi of all FadL channels, with potential implications for diverse areas such as biodegradation, quorum sensing and gut biology.
Introduction
The production of pollutants in modern, industrialized society is a burden on the environment and poses substantial risks for human health because many of these compounds are toxic, carcinogenic and teratogenic1–4. One prominent class of pollutants are monoaromatic hydrocarbons (MAH) such as BTEX (benzene, toluene, ethylbenzene and xylene), which naturally occur in crude oil. They are among the most abundant chemicals produced worldwide by the petrochemical industry, with benzene alone accounting for ~50 million metric tons. Due to their limited chemical reactivity and hydrophobic character, MAH and many xenobiotics (“compounds foreign to life”) are highly persistent within the environment. An additional concern with MAH is their relatively high aqueous solubility (~ 5 mM for toluene) and the risk that these compounds end up in drinking water4. Besides the obvious health-related factors, there is also a considerable economic burden from the presence of pollutants within the environment, due to the high costs of the treatment and cleanup of contaminated sites by various abiotic methods. These factors have generated an enormous interest in the metabolisation of pollutants by bacteria (biodegradation), and the utilisation of such bacteria in the removal of pollutants (bioremediation)5–7.
AH are predominantly degraded by aerobic soil bacteria, which utilize these compounds as sole sources of carbon and energy. Many AH biodegraders are Gram-negative bacteria, and genera with an abundance of biodegrading bacteria include Sphingomonas (α-proteobacteria), Burkholderia (β-proteobacteria), and Pseudomonas (γ-proteobacteria). Pseudomonas putida F1 (PpF1) was isolated from a polluted creek in Urbana (Illinois, USA), and was the first biodegrader whose genome was sequenced (https://genome.jgi.doe.gov/portal/psepu/psepu.info.html). PpF1 has a large genome (6.0 Mb) and is a versatile organism found in soil and water that can grow on benzene, toluene, ethylbenzene and p-cymene (isopropyltoluene) as sole sources of carbon (Supplementary Fig. 1) 8–11. A practical advantage of PpF1 is that its biodegrading capabilities are encoded on the genome and not on large catabolic plasmids. PpF1 has two operons devoted to the biodegradation of MAH, tod and cym/cmt, adjacent to each other on the genome (Supplementary Fig. 1)12.
The first step in AH metabolism is the introduction of oxygen into the aromatic ring, carried out by a multi-component dioxygenase that has been widely used in biocatalytic syntheses of chiral chemicals13,14. This step, as well as the subsequent enzymatic steps in AH degradation, occurs in the cytoplasm. Since the plasma membrane is permeable for hydrophobic molecules (“hydrophobics”)15,16, biodegradation in Gram-negative bacteria depends on the presence of diffusion channels in the outer membrane (OM), which does form a barrier for hydrophobics due to the lipopolysaccharide (LPS) in the outer leaflet17.
Thus far, only the FadL OM channel family has been shown to mediate transport of hydrophobics18,19. FadL from E. coli (EcFadL) is the archetype of the family and is a long chain fatty acid (LCFA) transport channel20–22. The lumen of the 14-stranded β-barrel of EcFadL is occluded by an N-terminal ~40-residue plug domain23, reminiscent of the plug or cork domain of much larger TonB dependent active transporters. A “lateral diffusion” transport mechanism for LCFA uptake by EcFadL has been established via detailed structure-function studies22. The key structural feature in EcFadL is an opening in the side of the barrel wall caused by an inward kink in β-strand S3. LCFA binding in an adjacent high-affinity site causes a conformational change in the plug N-terminus, opening up a diffusion pathway through the lateral opening into the OM22–24.
With respect to MAH uptake, the crystal structure of the putative toluene channel TodX from PpF1 has been determined previously and is similar to that of EcFadL in aspects such as the presence of a plug domain and a lateral opening at the same position as in EcFadL25. However, TodX also has a “classical”, polar channel through the plug domain that leads directly into the periplasmic space. This feature, coupled with the much higher aqueous solubilities of MAH compared to LCFA (mM vs. nM), raises the possibility that MAH transport occurs via a “classical” mechanism like that used by most other OM channels, i.e. directly into the periplasmic space25.
Here we show, by using an integrated approach combining growth assays, X-ray crystallography and molecular dynamics (MD) simulations, that MAH uptake as the first step in biodegradation does occur via lateral diffusion. The fact that uptake of hydrophobics with very different structural and physicochemical properties (i.e. LCFA and MAH) occurs via the same mechanism suggest that lateral diffusion is the modus operandi for all FadL-family channels and their large ensemble of potential transport substrates, with potential implications for large areas of Biology. In addition, FadL channels could be utilised to develop genetically engineered biodegrading bacteria that are more effective in the uptake of pollutants or have wider substrate specificities. Applications of FadL channels could also be envisioned for biosensor and biocatalyst bacterial strains for more efficient detection of pollutants and for better syntheses of pharmaceutical and other industrial products26.
Results
TodX and CymD are MAH uptake channels
PpF1 has three FadL orthologs, of which TodX (Pput_2883) and CymD (Pput_2901) are located in the adjacent tod and cym/cmt operons respectively. The third ortholog (Pput_4030) is not in an operon and, since it is most similar to EcFadL (29% sequence identity), will be referred to as F1FadL. To enable structure-function studies we constructed the triple knockout strain PpF1-3, with clean deletions for TodX, CymD and F1FadL (ΔF1fadLΔtodXΔcymD). To link FadL channels to MAH uptake, a variety of assays in liquid culture were tested, including bioluminescence in the PpF1 reporter strain PpG4 (ref. 27) and its triple knockout G4-3. All these assays were very challenging, being hampered either by MAH-related toxicity issues or an inability to observe reproducible differences between the wild type and triple knockout strains. In addition, uptake assays using radiolabeled substrates as used for LCFA uptake by EcFadL22,24 are clearly impractical for volatile compounds such as MAH. We therefore opted to grow various strains on minimal medium agar plates with toluene or p-cymene vapor as the sole carbon source (Methods), and report colony size as a semi-quantitative measure for growth. As shown in Fig. 1, both for toluene and p-cymene there are clear differences in growth between wild type PpF1 and the triple PpF1-3 knockout. PpF1-3 still grows slowly on MAH vapor, most likely as a result of spontaneous (i.e. non protein-mediated) OM diffusion. A further deletion of oprG and tcpY, other potential hydrophobics uptake channels28,29, in the PpF1-3 strain to generate PpF1-5, did not reduce the background (Fig. 1). Toluene causes the most prominent background growth (Figs. 1a,b), most likely due to its smaller size and higher volatility compared to p-cymene. The background colonies with p-cymene have similar diameters compared to those with toluene but are more translucent, possibly because they are thinner, generating very clear phenotypes on imaged plates (Fig. 1b).
a,b Representative growth of various PpF1 strains with and without Tn7 complementation on minimal medium with vapor-phase toluene (a) or p-cymene (b) as the sole carbon source; 1, PpF1; 2, PpF1-3; 3, PpF1-5; 4, PpF1-3-todX; 5, ppF1-3-cymD; 6, PpF1-3-f1fadL. c Individual colony sizes from 2 independent plates after 4 days growth on toluene at 20 °C (top) or 5 days on p-cymene at 30 °C (bottom), in the absence (white) or presence of 0.1% (grey) or 0.5% arabinose (black). Reported values represent mean ± s.d (n ~ 10).
To complement PpF1-3, we used the bacterial Tn7 transposon to integrate various alleles at the single attTn7 site of the bacterial chromosome, under the control of the PBAD promoter (Methods). In the absence of arabinose, only the wild type PpF1 strain grows, and none of the complemented strains grow better than the PpF1-3 background (Fig. 1c). Both TodX and CymD complement growth of PpF1-3 on toluene in the presence of 0.1% arabinose. Increasing the arabinose concentration to 0.5% gives slightly better growth relative to 0.1%, but only for PpF1-3-todX and PpF1-3-cymD, demonstrating that even relatively high concentrations of the sugar by itself do not allow growth. There is no complementation with F1FadL, indicating that LCFA channels do not allow MAH uptake and strengthening previous data, based on lack of LCFA uptake by TodX, that FadL channels are substrate specific25. Interestingly, TodX and CymD differ in their ability to complement growth on MAH. Compared to PpF1-3-cymD, PpF1-3-todX grows slightly slower on toluene and clearly does not grow on p-cymene (Fig. 1). This suggests that even closely related MAH channels have different specificities, with CymD being able to transport more bulky substrates, such as p-cymene.
Molecular dynamics simulation reveal an MAH binding site
Extensive attempts to identify an MAH binding site within TodX via experimental approaches failed, including the use of non-volatile heavy atom MAH derivatives (e.g. iodotoluene) in co-crystallisation and soaking. Mixed micelles in the presence of mM concentrations of MAH derivatives have very different properties than those of pure detergent, leading to phase separation problems. In addition, the C8E4 crystallisation detergent is present at relatively high concentrations (~13 mM) and will bind preferentially to the hydrophobic areas inside TodX. To obtain information about substrate binding, we therefore performed equilibrium molecular dynamics (MD) simulations with TodX embedded in a physiological, asymmetrical OM, with benzene used as the substrate (Methods). Besides being a physiological substrate for PpF1, benzene has the additional advantage for MD in that it is small and symmetric. We placed three benzene molecules near the extracellular end of the hydrophobic channel that was previously identified in TodX based on bound detergent25 (Extended Data Fig. 2). Two independent simulations were performed and showed that benzene can enter the protein readily (1 and 2 benzenes entered in the two independent simulations respectively). In one simulation, the lone benzene entering the channel became restricted to a fairly well-defined region close to the N-terminus after ~50 ns. During the simulation in which two benzenes entered the protein, the first benzene to enter remained near a similar region of the protein, but not as close to the N-terminal domain and was not restricted to the same extent as the lone benzene in the first simulation. The second benzene initially diffused freely, but then rapidly also moved into the region occupied by the first benzene, such that after 100 ns the two benzene molecules simultaneously occupied a region near the N-terminus and remained there (Supplementary Fig. 2).
a, Movement of a single benzene within the unrestrained wildtype protein, corresponding to the simulation shown in Supplementary Fig. 2a. The benzene becomes confined to the P-pocket after the protein undergoes a conformational rearrangement to state II, after about 60 ns. The surface of the plug domain (yellow) is shown to emphasise the difference in conformation. b, Benzene is mobile with the N-terminus of the protein restrained in state I (left panel), whereas restraining the N-terminus in state II rapidly confines the benzene to the P-pocket. c, Benzene freely diffuses through the protein in simulations of the N-terminal deletion mutant. All simulations show benzene molecules from 250 equally spaced frames, according to the colour scheme indicated (0 ns, red; 250 ns, violet).
We next performed a cluster analysis on the trajectories to identify dominant conformational states of the protein during the simulations. This analysis enabled identification of two distinct conformational states of the N-terminus (Supplementary Fig. 3). The first state (state I) corresponds to that of the crystal structure, with the α-amino group of Thr1 close to Arg136. Following a conformational change, the Thr1 α-amino group moves more than 10 Å such that it comes close to Leu105 (Supplementary Fig. 4). This conformation is stabilised through interactions between Ser103, Leu105, Pro371 and Thr1 and between Pro371 and Gln2. Interestingly, this state (state II) appears to confine the benzene molecule to a fairly well-defined binding pocket close to the N-terminus, which we designate as the principal binding pocket (P-pocket). To further investigate the effect of different conformational states of the N-terminus on benzene dynamics, we performed additional MD simulations, but with the N-terminus restrained to either state I or state II. With the N-terminus occupying state I, the benzene molecule is extremely mobile and diffuses back and forth (Fig. 2), without preference for any particular part of the protein. By contrast, with the N-terminus in state II, the benzene molecule diffuses rapidly towards the N-terminus and remains in the P-pocket. Thus, these simulations suggest that a specific conformation of the N-terminus is required for high-affinity substrate binding.
This hypothesis is supported by a simulation where the N-terminal 4 residues are removed, and in which the benzene remains highly mobile (Fig. 2c). This observation is reminiscent of EcFadL, where deletion of the first 3 residues of the N-terminus causes a disruption of the high-affinity LCFA binding site and a loss of transport activity24. Analogous to Phe3 in EcFadL, the side chain of Gln2 in state II is part of the P-pocket, which is additionally defined by the side chains of Leu132, Leu162, Leu164, Leu166, Val272, Phe275, Val313, Leu326 and Ile370 (Fig. 3).
a, Cartoon model showing state II TodX residues mediating the largest number of interactions with benzene from equilibrium MD simulations as green stick models. These residues form the P-pocket. b, Comparison of the location of residues in TodX and CymD that form the largest number of interactions with benzene and p-cymene respectively. Residues are coloured by the percentage of simulation time for which the interactions occur, with data included for both independent simulations of each protein. Interactions are defined as an inter-atomic distance of < 4.0 Å.
Lateral opening mutants are defective in MAH uptake
Previously, we demonstrated the lateral diffusion mechanism for LCFA transport by EcFadL via site-directed removal of the lateral opening, by “flattening” the inward-kinked part of strand S3 (Fig. 4)22. Crucially, closing off the EcFadL lateral opening did not lead to any other changes in the protein structure, and expression levels of the mutants were similar as for wild type. We therefore opted to use the same strategy for TodX and CymD, and reasoned that a lack of growth on MAH after closing the lateral opening would provide strong evidence for a lateral diffusion mechanism. The alternative experiment, i.e. closing the classical channel through the TodX plug domain25, would likely be much more difficult given that the plug of EcFadL does not tolerate changes well.
Structures of the FadL channels discussed in this study. The lateral openings (delineated by the green surface) are similar in size in the wild type structures of TodX and CymD and differ only slightly from EcFadL, in that the latter does not have a bulge in S2 like TodX and CymD, but a more pronounced kink in S3. The two TodX mutant channels ΔS2 and ΔS2S3 have much smaller lateral openings, while retaining overall high structural similarity to the wild type protein.
The lateral opening in TodX involves both β-strand S2 and S3, contrasting with EcFadL in which only strand S3 is involved. Strand S2 in TodX bulges outward, whereas S3 has a less pronounced kink in the same place as EcFadL (Fig. 4). To flatten the S2 bulge, we removed seven residues that were not visible in the density of TodX, replaced a proline with alanine and inserted an alanine to connect the strands (Methods; ΔS2). To flatten the S3 kink in the ΔS2 mutant, we removed the conserved glycine residue G104 to give ΔS2S3. We determined the X-ray crystal structures of both TodX variants which showed that, as expected, the size of the lateral opening is drastically reduced as a result of the flattening of the strands (Fig. 4, Supplementary Fig. 5 and Supplementary Table 1), while the remainder of the mutant structures are very similar to those of wild type.
To enable similar experiments on CymD, we first determined its crystal structure. Despite considerable efforts, we could only obtain one crystal form that diffracted to sufficient resolution for structure determination (Supplementary Table 1). The CymD structure is very similar to that of TodX but lacks density for ~30% of the protein due to mobility of several extracellular loops within the crystal (Supplementary Fig. 5). Importantly however, the visible parts of the structure show that the S2 bulge and the S3 kink are present and in the same location as in TodX. To allow molecular dynamics simulations, we modelled the missing parts of CymD by using the TodX structure (Fig. 4; Methods). For site-directed mutants, we generated ΔS2 in the same way as for TodX. To assess the importance of the S3 kink, the ΔS3 variant was made by deletion of the conserved residue G105. The final CymD mutant was G105A, made to assess the importance of the invariant glycine at this position. Due to the difficulties in crystallising wild type CymD and its similarity to TodX, no attempts were made to crystallise CymD lateral opening mutants.
Plate growth assays with vapor-phase toluene showed background growth for the TodX and CymD lateral opening mutants, indicating that they are defective in toluene uptake (Figs. 5a,b). For TodX, background growth is already observed for the ΔS2 mutant, so that the effect of the additional flattening of strand S3 in the ΔS2S3 variant cannot be assessed. For CymD, the ΔS2 and ΔS3 mutants both cause background growth, indicating that the relatively minor change of flattening S3 is sufficient to impede lateral diffusion (Figs. 5a,b). As mentioned above, growth on p-cymene vapor is observed only with CymD, and the CymD lateral opening mutants have the same phenotypes with toluene and p-cymene. Interestingly, the invariant glycine residue in the S3 kink is not essential for function, as growth for the G105A variant is only slightly lower than that mediated by wild-type CymD (Fig. 5b).
a, Representative growth of Tn7-complemented PpF1-3 strains on toluene (left panel) and p-cymene vapour in the presence of 0.5% arabinose; 1, F1-3; 2, F1-3-todX; 3, F1-3-todX ΔS2; 4, F1-3-todX ΔS2S3; 5, F1-3-cymD; 6, F1-3-cymD ΔS2; 7, F1-3-cymD ΔS3; 8, F1-3-cymD G105A. b, Corresponding colony sizes for toluene (black bars) and p-cymene (white bars) growth from a, taken from 2 different plates (mean ± S.D; n = 5-10). c, Representative growth of pHERD30-complemented PpF1-3 strains on toluene (left panel) and p-cymene vapour in the presence of 0.1% arabinose; 1, F1-3-cymD; 2, F1-3-empty plasmid; 3, F1-3-cymD ΔS2; 4, F1-3-cymD ΔS3; 5, F1-3-cymD ΔS2S3; 6, F1-3-cymD G105A. d, Corresponding colony sizes for toluene (black bars) and p-cymene growth (white bars) from c, taken from 2 different plates (mean ± S.D; n = 5-10). e, Coomassie-stained SDS-PAGE gel of pHERD30-complemented PpF1-3 strains after IMAC. lanes 1/7, empty plasmid; lanes 2/8, CymD wild type; lanes 3/9, CymD ΔS2; lanes 4/10, CymD ΔS3; lanes 5/11 CymD ΔS2S3; lanes 6/12 CymD G105A. Arrows indicate approximate positions of CymD variants before (lanes 1-6) and after boiling (lanes 7-12). As is the case for many OMPs, CymD remains partially folded in SDS without boiling, leading to faster migration in the gel and a lower apparent molecular weight (~31 kD compared to ~48 kD after boiling). Molecular weight markers are indicated on the left.
Since growth will depend on the amounts of the various MAH channels in the OM, it was important to determine their relative expression levels. For this, we used the broad host-range pHERD30 plasmid to complement PpF1-3 with the complete set of C-terminally His-tagged cymD alleles via induction with arabinose. For the plasmid-based expression, 0.5% arabinose results in poor growth for all variants, most likely due to toxicity effects associated with relatively high-levels of OMP overexpression, and therefore 0.1% arabinose was used. The observed phenotypes are very similar for chromosomal and plasmid-based complementation (Figs. 5a,c). Importantly, the OM protein levels for all CymD mutants are similar to those of the wild type (Fig. 5e), indicating that the loss-of-growth phenotypes are not due to lack of protein in the OM and supporting the hypothesis that MAH are taken up by PpF1 via lateral diffusion.
MD simulations favor lateral diffusion over classical transport
To provide additional support for lateral diffusion we next carried out a series of umbrella sampling and steered MD simulations. In order to provide an energetic explanation of where the barriers to permeation through the classical route are located, we first calculated the free energy profile for benzene permeation through TodX via the classical route using umbrella sampling. The free energy profiles are constructed by setting the bulk water regions on the extracellular and periplasmic sides equal to 0 kT. The energetics of benzene within the region of TodX that is bounded by the mouth of the barrel on the extracellular side and the lateral gate are favourable (~ −1 to −7kT) compared to being located in bulk water, indicating there is an energetic incentive for benzene to enter the channel (Fig. 6). This makes sense, given that most of the benzene pathway is lined by hydrophobic residues, and is occupied by detergent in the wild type TodX structure24. However, there is a barrier of ~2kT to benzene diffusing past the lateral pore (Z ~ 0) along the classical route. Beyond this barrier, there is a region of favourable energy where the channel is lined with residues Tyr9, Ile31, Phe100, Val134 and Arg136, followed by an even larger barrier of ~7kT at the region containing the hatch domain (Z ~ 20-25 Å; Fig. 6). This barrier is caused by the polar constriction region formed by the hatch domain and which includes residues Glu47, Ser85 and Ser428. Thus, given the two energetic barriers encountered en route to the periplasmic mouth of the protein, the free energies indicate that the classical pathway is not a feasible route for benzene translocation into the periplasmic space.
An overlaid potential of mean force profile (PMF; left) and the force profile from the steered MD via the classical pathway (right) on the cartoon representation of TodX, with benzene positions shown along the reaction coordinate (centre). The TodX plug domain is coloured yellow, S2 and S3 are coloured magenta.
There are 14 individual umbrella sampling windows within the region z = −13 to −5 Å, which contains the P-pocket. The rearrangement of the N-terminus to state II was observed in one of these windows (corresponding to Z ~ −10 Å), although there was no direct contact between benzene and the N-terminus (the closest distance was ~ 5.5 Å, but only for two short periods of the simulation). This rearrangement was not observed in any of the other umbrella sampling windows (148 in total). This supports the notion that conformational rearrangement in the N-terminus is connected to benzene presence and confinement to the P-pocket. We next employed steered MD to ‘pull’ benzene along the classical route. The resulting force profile identified the location of barriers for benzene permeation, in close agreement with the umbrella sampling calculations (Fig. 6), providing justification for employing the much cheaper steered MD approach for the subsequent simulations. Next, steered MD simulations to pull benzene through TodX with the rearranged N-terminus (state II) were performed. Interestingly, the resultant force profile reveals the first energetic barrier to be absent and consequently, regions that previously were energetically separated are now more connected (Supplementary Fig. 6). However, the second, larger energy barrier inside the plug domain is unchanged, indicating that classical diffusion remains unfavourable. We note that while the steered MD simulations are reliable in showing large energetic barriers as validated with our comparison to umbrella sampling, they are less reliable in reproducing the fine details of energy wells. It is therefore difficult to evaluate, for example, whether the region at the mouth of the lateral gate (Z ~0) is lower in energy than the surrounding area using steered MD.
As detailed above, the rearrangement of the N-terminus to state II confines the benzene to the P-pocket around Z −10 Å, and removes the first energetic barrier at (Z ~ −10-2 Å) in the force profile constructed from steered MD simulations. However, the force profile is still increasing, making it unlikely that diffusion to the lateral opening would happen spontaneously. Indeed, benzene is very stably confined to the P-pocket in our equilibrium simulations, and how it gets to the lateral opening at Z ~ 0 is presently unclear. At the lateral opening, benzene would have three options compared to permeation along the classical route; (i) remain inside the protein, (ii) leave the protein via the same route it entered or (iii) leave the protein via the lateral gate and thereby enter the OM. We used a combination of six steered MD and six equilibrium MD simulations to explore these possibilities. In all of these simulations, benzene was initially located at the mouth of the lateral opening, at a distance of ~9 Å from its position in the P-pocket (Supplementary Fig. 5). This location of benzene was taken from the original benzene position in one of the umbrella sampling simulations (benzene restrained at z = 1.6 Å), used to construct the PMF shown in Fig 6. In this simulation, benzene was observed to spontaneously diffuse ~6 Å to the mouth of the lateral opening, and the equilibrium simulations were therefore initiated from this point. Overall, the steered MD and equilibrium simulations revealed three pathways for benzene exit via the lateral opening (Fig. 7).
a, Views from the side (top panel) and from the extracellular side (bottom panel), showing locations of benzene molecules from 250 equally spaced frames for each simulation, according to the colour scheme indicated. The middle panel shows the breathing motion of the lateral opening during benzene exit via pathway B. b, Force profiles corresponding to pathways A and B (top panel) and views of the initial location of benzene for the equilibrium simulations (bottom).
Pathway A and B were identified from steered MD simulations, whereas pathway C was observed in equilibrium MD simulations. In pathway A, observed in three of the six steered MD simulations, benzene remains on the S3 side of the protein. In pathway B, observed in the other three steered MD simulations, the benzene moves through the S2 bulge and thus ends up on the S1 side of the bulge (Fig. 7a). As a result, the benzene completely avoids the polar regions of LPS and enters the hydrophobic region of the outer leaflet, facilitated by slight widening of the lateral opening (Fig. 7a). In contrast, by not going through the S2 bulge in pathway A, the benzene is forced towards the sugars of LPS. Inspection of the force profiles (Fig. 7b) shows that, as expected, pathway B is more favourable than pathway A. Regarding the equilibrium MD simulations, in two of these benzene moved out of the protein via pathway C. In this pathway, benzene interacts with Tyr79 initially, and then moves towards S3. Subsequently, a widening of the lateral opening enables it to then move out into the OM. In the other four equilibrium simulations, benzene exited the protein via pathway B in two simulations. In one simulation, it remained very close to its initial location near the lateral opening. Interestingly, in the final simulation the benzene moved from the lateral gate back into the high affinity binding site (P-pocket) in the protein interior. After the benzene diffused back into the P-pocket, the N-terminus of the protein switched to state II.
Transport of p-cymene by CymD is similar to benzene transport by TodX
So far, we focused on benzene in our simulations due to its small size and symmetry. However, given that our experimental data clearly support the lateral diffusion of p-cymene through CymD, we next asked if the simulations could also capture the diffusion pathway of this more complex substrate. Two independent equilibrium MD simulations with the p-cymene starting position close to the channel entrance show that p-cymene behaves in a similar manner to benzene in TodX, in that upon entering the protein it quickly becomes confined to the P-pocket and remains there for the duration of the 2 x 500 ns simulations (Fig. 3 and Supplementary Fig. 7). Interestingly, while in the P-pocket, p-cymene showed no orientational preference. In contrast to TodX, we did not observe a large conformational rearrangement in the N-terminus in order for p-cymene to be confined to the P-pocket of CymD, perhaps due to its larger size.
Given that we do not observe spontaneous diffusion of benzene from the P-pocket to the lateral opening in TodX, we expected p-cymene to behave similarly within CymD. Therefore, in order to study potential diffusion of p-cymene out of the protein, six equilibrium simulations were initiated from p-cymene placed at the lateral opening of CymD, based on the position of benzene in TodX from umbrella sampling simulations. P-cymene diffused laterally out of the protein in three out of six simulations (Supplementary Fig. 8). In two of these simulations, p-cymene occupied a position just on the outside of the protein but remained close to the lateral opening after 250 ns. Upon extension of these two simulations, p-cymene moved into the OM after ~ 480 ns in one case, whereas in the other it remained outside, but interacting with the lateral opening of CymD even after 500 ns. These results indicate that lateral diffusion into the OM from CymD is readily feasible for a substrate larger than benzene. In all three simulations, p-cymene exit was via the same pathway. Finally, we performed steered MD simulations for p-cymene in CymD to characterise both the classical and lateral diffusion pathways. The force profile for the classical route closely resembles that for TodX in state II, i.e. there is no large barrier for p-cymene movement past the lateral gate, but there is a sizeable barrier for moving through the plug domain (Supplementary Fig. 9). In other words, diffusion via the classical route is unfavourable. Force profiles from steered MD simulation in which p-cymene is pulled out of the lateral opening are also similar to those for benzene being pulled out of TodX, but with the force barriers around the lateral opening being somewhat higher and wider. The pathway of p-cymene exit from these simulations was the same as that observed from the three equilibrium simulations in which spontaneous exit occurred. Thus, we only observe one pathway for lateral exit of p-cymene from CymD.
Discussion
The fact that toluene, with its small size and hydrophobic character, permeates poorly through the OM of PpF1 in the absence of MAH channels emphasises the truly remarkable permeation barrier formed by this ~30 Å wide hydrophobic layer covered by cross-linked sugars on the extracellular surface. In the presence of TodX/CymD, robust growth is observed, providing the first direct evidence for MAH transport by FadL-type channels30,31. Our equilibrium MD simulations show very rapid movement of MAH molecules over a distance of ~30 Å, from the extracellular surface of the protein, via a largely hydrophobic tunnel, to a well-defined high-affinity binding site close to the N-terminus. This shows that MD is particularly well-suited to study the protein-mediated diffusion of small hydrophobic molecules, while experimental methods are challenging. Within the P-pocket, the MAH molecules appear to be quite stable, and it is currently not quite clear how the MAH moves from the P-pocket to the lateral opening. The limitation of equilibrium MD timescales is not allowing us to observe this movement, and likely requires enhanced sampling methods in follow-up studies. However, the simulation where benzene moves from the lateral opening back to the P-pocket is informative. Here, the benzene moves ~9 Å in a very short time (~30 ps; Supplementary Fig. 10). The movement to the P-pocket includes passage through a narrow, hydrophobic constriction formed by the side chains of Val3, Leu105 in the S3kink and Ile370 in loop L6. Perhaps surprisingly, there are no dramatic conformational changes, with minor, “breathing” movements of the N-terminus, S3 kink and L6 allowing MAH passage from the P-pocket to the mouth of the lateral opening. It seems not unreasonable to propose that the same mechanism could allow forward diffusion from the P-pocket to the lateral opening. From there, both steered and equilibrium simulations show that MAH can move readily into the OM, even on the short timescales of MD simulations (Supplementary Fig. 11 and Supplementary Movie). However, while we can identify plausible routes for MAH exit from the lateral gate, the sheer dimensionality of the problem, exacerbated by the LPS in the outer leaflet which moves very slowly, makes it unfeasible to perform an exhaustive energetic study to predict the most likely route(s).
An important question is what happens with the MAH after moving into the OM. The positioning of the lateral opening, at the interface region of the LPS leaflet (Supplementary Fig. 11), allows movement of the hydrophobic molecule into the hydrocarbon layer of the LPS. Calculation of PMF profiles for benzene permeation across the OM has shown that the interface region of the LPS provides a strong energetic barrier32, preventing movement of the MAH back into the extracellular environment. Inside the hydrophobic core of the OM, the benzene is very mobile and readily enters the inner, phospholipid leaflet (Supplementary Fig. 11). The interface of the phospholipid leaflet forms a much smaller energetic barrier for benzene32, and we propose that MAH move into the periplasmic space via mass action, governed by their partitioning coefficients. MAH accumulation into the OM is likely to be tolerated relatively well compared to the IM, for which harmful changes in structure and function as a result of MAH are well-documented16. An advantage of the mass-action model is that it would provide a steady, low-level of MAH flux into the IM, preventing toxicity. Alternative, and in our opinion less likely, possibilities for FadL substrates to reach the IM are protein-protein33 and perhaps lipid-lipid bridges34 connecting the OM and IM.
FadL orthologs are very widespread in Gram-negative bacteria and, according to the Transporter Classification Database (http://www.tcdb.org/), are found within the phyla Proteobacteria, Nitrospirae, Spirochetes, Chlamydiae and the Bacteroidetes/Chlorobi group. Within the Proteobacteria, FadL orthologs are present in α-through ε-Proteobacteria. Despite their abundance, very few FadL orthologs have been studied, and based on the enormous number of potential substrates it is likely that this family mediates many important processes in Gram-negative bacteria. Our data on MAH uptake provide support for this hypothesis, since they show that small, moderately hydrophobic molecules also function as substrates of FadL channels. Quorum sensing is one potential area in which FadL-mediated transport might play a major role, given that many quorum sensing/auto-inducer molecules are hydrophobic35. Indeed, FadL orthologs have been shown to mediate N-acyl homoserine lactone uptake in Rhizobium spp36. Many Vibrio spp., commonly studied as quorum sensing model organisms, possess multiple FadL orthologs (V. cholerae has three), and it is tempting to speculate that some may be involved in quorum sensing signal uptake. Another, completely unexplored area concerns the potential role of FadL channels in the human gut microbiota. Many species within the abundant genus Bacteroides have several FadL orthologs (B. thetaiotaomicron has four). In addition, the gut contains a huge arsenal of potential substrates, such as LCFA, bile acids and sterols, suggesting that FadL channel-mediated processes could be important in gut biology.
Author contributions
MD simulations were performed by KS, supervised by SK. AD constructed strains, performed assays together with DB, purified and crystallised proteins, and determined crystal structures. AB collected X-ray crystallography data. BvdB initiated the project, constructed strains, performed growth assays, and determined crystal structures. The manuscript was written by BvdB and SK.
Ethics declarations
Competing interests
The authors declare no competing interests.
Materials and Methods
Strain construction
Mini-Tn7 elements can be transferred readily into different backgrounds, since the molecular biology of Tn7 is well understood. The TnsABCD transposase components of Tn7 catalyze site- and orientation-specific insertion with high frequency into bacterial chromosomes at Tn7 attachment (attTn7) sites. These sites are located downstream of the highly conserved glmS gene, encoding essential glucosamine-6-phosphate synthetase (Pput_5291 in PpF1)37. Tn7-based genome knock-ins of PpF1-3 were performed according to established procedures38. Briefly, full-length wild type and mutant genes (including their native signal sequence) were cloned in the multiple cloning site of the pTJ1 delivery vector, which contains the mini-Tn7T-Tp-araC-PBAD-MCS mobile element for cloning and subsequent integration into the chromosome; genes cloned in this vector are under the control of the PBAD arabinose-inducible promoter. The plasmid also contains the dhfRII gene encoding trimethoprim resistance, which is used to select for chromosomal integration. The pTJ1 vector was moved into the S17 λ-pir conjugation strain via heat shock of competent cells. Plasmid transfer to PpF1-3 was achieved via 4-parental conjugation. Additional E. coli strains used were SM10 (λ-pir) containing the helper plasmid pTNS3 encoding the transposase gene, and SM10 (λ-pir) containing the conjugation helper plasmid pRK201338. The helper plasmids were moved into SM10 cells via electroporation at 2500 V (2 mm cuvette). Subsequently, 5 ml overnight cultures of the three plasmid-containing E. coli strains were set up in LB with appropriate antibiotics at 37 °C, in addition to the PpF1-3 strain at 30 °C in LB. The next morning, PpF1-3 was sub-cultured at 1:50 dilution and E. coli at 1:100 dilution (+ antibiotics) and grown until late log phase (OD ~ 1; ~4-5 hrs). 250 μl of PpF1-3 culture and 250 μl of each of the E. coli cultures were combined, centrifuged at 5,000 rpm for 3 mins and resuspended in 100 μl LB. The cells were spotted onto a pre-warmed dry LB plate (no selection) and incubated at 30 °C overnight. The spot was collected in 1 ml LB, centrifuged and resuspended in 100 μl LB. Finally, cells were plated out on Pseudomonas Isolation agar plates with 1.5mg/ml trimethoprim and incubated overnight at 30 °C. Colony PCR was performed to check for Tn7 integration37. The following Tn7-complemented strains were made in PpF1-3: todX, cymD, f1fadL, todX ΔS2, todX ΔS2S3, cymD ΔS2, cymD ΔS3, and cymD G105A.
For plasmid-based complementation, cymD constructs containing a C-terminal His-tag were cloned in the pHERD30 plasmid, a broad-host-range expression vector that utilizes the pBAD promoter and the araC regulator39. The MCS of pHERD vectors is the same as that in the pTJ1 delivery vector described above, facilitating transfer of constructs between both systems. Plasmids were maintained in E. coli in the presence of 15 μg/ml gentamycin and in PpF1-3 with 60 μg/ml gentamycin. The following cymD constructs were made: wild type, ΔS2, ΔS3, ΔS2S3, and G105A.
All mutants were obtained via standard site-directed mutagenesis using the Q5 system (New England Biolabs), according to manufacturer specifications. For TodX ΔS2, the native sequence 67PKSSTRSNNRAPYVGPQ was replaced with PKSSTYAVGAQ. Thus, seven residues (underlined) from the S2 bulge were removed, an alanine was inserted to bridge the expected gap, and a proline at the end of the bulge was replaced by an alanine. For the TodX ΔS2S3 variant, the conserved G104 was deleted within the ΔS2 background. For CymD ΔS2, the sequence 67KADSHSRGRNNGPYVAPE was replaced by KADSHSYAVAAE, analogous to TodX (mature sequence numbering). To generate the ΔS3 strain, G105 was removed (and replaced by Ala in the G105A strain).
Plate growth assays
Plates were made using standard LeMasters-Richards (LR) minimal medium without any carbon source and 1.5% regular agar. For pHERD-based complementation, the plates included 60 μg/ml gentamycin. Arabinose was added in concentrations ranging from 0-0.5% (w/v). For each growth experiment, several colonies were picked from freshly made LB plates containing the respective PpF1-3 strains and resuspended in 0.5 ml LR medium and starved for 15 mins at room temperature. 2-3 μl of resuspended cells were spotted on each plate and streaked out. The plates (up to four at a time) were incubated in a closed, plastic Tupperware box that also contained a microcentrifuge tube with 150 μl of either toluene or p-cymene. The tube was closed in the case of toluene, to prevent excessively fast evaporation. Plates were incubated at 30 °C for up to 6 days in the case of p-cymene and at room temperature for up to 4 days in the case of toluene. Colony sizes were measured using a stereo microscope.
MAH channel expression and purification
Due to low yields for OM-expressed protein, TodX mutants were expressed as inclusion bodies (IB) in E. coli, followed by denaturation and in vitro folding as previously described25. For expression of TodX in IB, plasmid pTodX(ib) was constructed by cloning the mature TodX gene sequence (corresponding to residues 22–453 and with a methionine added at the N-terminus), with a C-terminal hexahistidine tag, downstream of the arabinose-inducible promoter in pBAD22. In this construct, the N-terminal threonine is replaced with a methionine in order to keep to length of the N-terminal plug domain constant. The vectors for the ΔS2 and ΔS2S3 mutants were obtained via standard site-directed mutagenesis using the Q5 system (New England Biolabs), according to manufacturer specifications. For ΔS2, the native sequence 67PKSSTRSNNRAPYVGPQ was replaced with PKSSTYAVGAQ. For the ΔS2S3 variant, the conserved G104 was deleted within the ΔS2 background. Protein expression was carried out in Bl21 (DE3) cells at 37 °C in LB/Amp (100 μg/l) by growing the cells to an OD600 of ~0.6-0.8, followed by induction with 0.1% (w/v) arabinose for 2.5-3 hrs at 37 °C. Following this, cells were harvested by centrifugation, resuspended in TSB buffer [20 mM Tris, 300 mM NaCl, pH 8] and ruptured by one passage at 23,000 psi through a cell disrupter (Constant Systems 0.75 kW). Inclusion bodies and cell debris were collected by centrifugation at 17,000 × g for 20 mins. The pellet was resuspended by homogenization in 10 mM Tris (pH 8), 50 mM NaCl, 1% Triton X-100 and stirred for 30 mins at room temperature. This wash step was repeated twice with buffer lacking Triton X-100. The washed pellet was resuspended in 10 mM Tris (pH 8), 50 mM NaCl containing 8 M urea, and the suspension was stirred for 2 h or overnight at room temperature. After centrifugation (46,000 × g), the supernatant containing urea-denatured protein was slowly (dropwise) diluted 10-fold in 1% (wt/vol) N-lauroylsarkosine (sarkosyl) in TSB with fast stirring at room temperature. Protein folding was achieved by slowly stirring the sarkosyl solution at 4°C for ~18 h. Folded TodX protein was purified by nickel affinity and gel filtration chromatography as described for FadL23. Success of the sarkosyl folding was confirmed by observing the heat-modifiability on SDS/PAGE gels; before boiling, TodX migrates on SDS/PAGE gels with an apparent molecular mass of ~28 kDa, and after boiling, the migration shifts to ~46 kDa.
In contrast to TodX, CymD expressed to high levels within the E. coli OM. For expression, the mature part of CymD (residues 26-460) was cloned into a modified pET22b vector containing the E. coli FadL signal sequence (residues 1-26). The resulting construct has a C-terminal hexa-histidine tag for purification via IMAC. Expression was achieved in C43 (DE3) cells in which the cyoABCD complex (Cytochrome bo oxidase) on the chromosome has been deleted, facilitating purification of membrane proteins post-IMAC that are otherwise contaminated with the Cyo complex. Cells were grown in LB/Amp to an OD of ~0.4 at 37°C, after which the temperature was lowered to 18°C and cells were induced 30 mins later with 0.2 mM IPTG. Cells were harvested by centrifugation, resuspended in TSB buffer (20 mM Tris, 300 mM NaCl, pH 8) and ruptured by one passage at 23,000 psi through a cell disrupter (Constant Systems 0.75 kW). Total membranes were obtained by ultracentrifugation in a Beckman 45Ti rotor for 45 mins at 42,000 rpm at 4°C. Membranes were extracted via homogenisation in 3% Elugent in TSB (using 150 ml for cells from a 12 l culture) for 2 hrs or overnight, followed by ultracentrifugation (45Ti rotor for 30 mins at 42,000 rpm at 4°C). The extract was loaded onto a ~10 ml nickel column (chelating sepharose; GE Healthcare) equilibrated in 0.15% LDAO in TSB. The column was washed with 15 column volumes LDAO buffer containing 25 mM imidazole and subsequently eluted with 3 column volumes buffer with 200 mM imidazole. After concentration by ultrafiltration to ~3 ml (MWCO 50 kD; Millipore) the IMAC step was followed by Superdex-200 16/60 gel filtration in 10 mM Hepes/100 mM NaCl/0.05% LDAO pH 7.5. For crystallisation, another gel filtration step was performed in which the LDAO was replaced by 0.4% C8E4. Peak fractions were pooled, concentrated to ~15 mg/ml, and aliquots were flash-frozen in liquid nitrogen. The final yield of CymD was ~2-2.5 mg/l cell culture.
Crystallisation and structure determination
Proteins were crystallised via sitting drop vapour diffusion, using in-house and commercial screens to obtain initial hits (MemGold1, MemGold2, Morpheus; Molecular Dimensions) via a Mosquito crystallisation robot (TTP Labtech) at 20 °C. Where needed, hits were optimised by fine screening of sitting-drop or hanging-drop setups with larger volumes. Both for TodX mutants and CymD a number of initial hits were obtained, but after optimisation only a limited number diffracted to useful resolutions (~4-5 Å or better on a home source). For TodX ΔS2, the best crystals grew in 0.1 M Na-citrate (pH 3.5), 0.2 M Li2SO4, 28% PEG 400. For TodX ΔS2S3, conditions were similar (0.05 M Na-citrate (pH 4.5), 0.1 M Li2SO4, 30% PEG 400). For CymD, the best crystals were obtained in optimisations from Morpheus F12 (0.1 M bicine/Trizma base pH 8.5, 0.02M mixed monosaccharides, 10-15% PEG 1000/10-15% PEG 3350/12.5% MPD). Diffraction data were collected at the Diamond Light Source (Didcot, UK) and auto-processed via xia240 or DIALS41. The structures were solved by molecular replacement with Phaser42, using wild-type TodX (PDB 3BRZ) as the search model (Supplementary Fig 11). The structures were built via iterative model building cycles within COOT43 and refinement with Phenix44,45, using TLS refinement for the B-factors. All structures contained 1 molecule in the asymmetric unit, with solvent contents ranging from ~70% for both TodX mutants to ~50% for CymD. As is the case for wild type TodX, and for unknown reasons, the electron density in all crystals is relatively poor. This is illustrated by high average B-factors of ~128 Å2 for TodX ΔS2 and ΔS2S3, and ~76 Å2 for CymD. Not surprisingly, this led to relatively high numbers of outliers in the Ramachandran plots and relatively high R-factors (Supplementary Table 1). The Ramachandran statistics are (favoured/outliers %): TodX ΔS2, 82.1/4.6 %; TodX ΔS2S3, 87.1/3.9 %; CymD 94.1/1.1%. Structure validation was performed with MolProbity46.
System preparation for molecular dynamics
The crystal structures of TodX (PDB 3BRZ)25, TodX ΔS2, TodX ΔS2S3 and CymD were completed by adding in the missing residues using Modeller 9.02 (https://salilab.org/modeller/)47,48. In order to mimic its natural environment, the completed protein structure was then embedded in an equilibrated E.coli model outer membrane containing Ra-LPS molecules in the outer leaflet and a mixture of 1-palmitoyl 2-cis-vaccenic phosphatidylethanolamine, 1-palmitoyl 2-cis-vaccenic phosphatidylglycerol, and 1-palmitoyl 2-cis-vaccenic 3-palmitoyl 4-cis-vaccenic diphosphatidylglycerol (cardiolipin) in the ratio 18:1:1 in the inner leaflet49–52, developed by Piggot et al53, using the g_membed protocol54. The Gromos 54a7 force-field was used55. The protein-membrane complex was solvated by the simple point charge (SPC) water model56 and neutralising counterions. To relax any steric conflicts within the system generated during set up, the system was subjected to energy minimization of 1000 steps using the Steepest Descents method. An. equilibration procedure followed in which the entire protein was subjected to positional restraints (with force constant 1000 kJ mol−1 nm−1) for 10 ns of MD, followed by 10 ns in which only the protein backbone was restrained. Visual molecular dynamics (VMD)57 was used as a tool to place benzene molecules around the entrance of TodX transporter for the initial simulations. Initial benzene location in subsequent simulations is explained in where the specific methods for those simulations are discussed.
Equilibrium MD simulations
GROMACS 5.1.458 was employed for all simulations. The short-range electrostatic and van der waals cutoffs were set to 1.4 nm, whereas the long-range electrostatic interactions were treated using the particle mesh Ewald (PME) method55. All atoms were constrained using the LINCS algorithm59 to allow the time step of 2 fs. Temperature was kept at 310 K using the V-rescale thermostat60. To maintain a constant pressure at 1 atm, the Parrinello-Rahman barostat61 was applied with a time constant of 1 ps using semi-isotropic coupling. Equilibrium MD systems were run with at least two repeats, each starting with different initial velocities. The results were analysed using functions within GROMACS package58. Molecular graphic images were prepared using both VMD and Pymol62.
Steered and Umbrella sampling MD simulations
In order to generate the initial input for an umbrella sampling MD simulation, constant velocity pulling simulations were performed along ± Z direction (principle axis of the protein). The initial benzene position was extracted from the equilibrium MD simulations. A harmonic spring with force constant of 1000 kJ mol−1 nm−2 was attached to the benzene and pulled at a rate of 0.5 nm ns−1 for ~12 ns (6 independent simulations were performed to calculate each force curve from steered MD simulations). The force constant and the pulling speed were chosen based on our previous work on arginine permeation through OprD63. Snapshots along the ± Z directions were then selected as initial starting points for umbrella sampling windows, such that they were separated by 0.1 nm between windows. Each window was initially equilibrated with a restraint on the benzene for 1 ns. Each equilibrated system was then subjected to 200 ns MD simulation by applying a harmonic biasing potential on the centre of mass of the benzene; a harmonic force constant of 1000 kJ mol−1 nm−2 was used to restrain the benzene along in the Z direction. In the case of insufficient sampling, the extra umbrella sampling windows were added based on visual inspection of histogram overlap. The PMF profiles were then computed using Weight Histogram Analysis Method (WHAM)64 as implemented within GROMACS. The estimated correlation time for each window and the Bayesian bootstrapping error of the PMF was also calculated using the built-in GROMACS tools65. Steered MD simulations in which the benzene was pulled out through the lateral gate were performed such that the benzene was restrained along a vector from the lateral gate of the protein to the outer membrane.
Data availability
The data supporting the findings of this study are available from the corresponding authors upon reasonable request. Coordinates and structure factors that support the findings of this study have been deposited in the Protein Data Bank with accession codes xxx.
Supplementary Information
a, PpF1 AH substrates with membrane partition coefficients (log Kp values) listed in parentheses. Trichloro-ethylene (TCE) is listed due to its importance and since it is co-metabolized with AH. Butylbenzene and biphenyl are degraded by mutant strains of PpF1 with an extended substrate range that have mutations in cymR as well as in todS. The result of the first mutation is constitutive expression of CmtE, which has an analogous function to TodF but with a broad substrate range. The mutation in TodS results in induction of the tod operon by the new substrates. Thus, both operons work together in the degradation of extended-range growth substrates such as biphenyl1. b, Organization of the cym/cmt (top) and tod operons (bottom) on the PpF1 genome, and illustration of various enzymatic reactions during MAH biodegradation. The locations of cymD and todX are highlighted in red. todR is a non-functional, LysR-type regulator2.
Benzene permeation through TodX during two independent equilibrium MD simulations. a, Confinement in the P-pocket of the single benzene that entered the protein during this simulation. b, Movement of two benzene molecules that entered the protein during this simulation. While the benzene molecules again have an overall preference for the P-pocket, the confinement is not nearly so pronounced as in the simulation shown in (a). The N-terminal plug domain of TodX is coloured yellow and the strand S2 is highlighted in pink.
Cluster analysis of 4 x 500 ns equilibrium MD simulations (2 apo (A and B) and 2 with benzene (B and C)). In each simulation one cluster dominates, indicated by coloured lines. The conformations of the N-terminus (backbone of residues 1-12) from these dominant clusters are shown in the 4 panels on the right-hand side. The N-terminus in the X-ray crystal structure of TodX is shown in yellow. Panel D corresponds to the conformation (shown in green) that coincides with benzene confinement in the P-pocket. We have termed this ‘state II’.
N-terminal conformational rearrangement leading to state II. a, Conformations of the N terminus in the crystal structure (state I, yellow) and state II (green). Residues Thr1, Leu105 and Arg136 are labelled to highlight the conformational difference between the two states. b, Minimum distance between Thr1 and Leu105/Arg136 as a function of simulation time; the conformational change to state II occurs at about 50 ns, leading to a shift in proximity of the N-terminus from Arg136 to Leu105, c, interactions that stabilise the N-terminus in state II.
X-ray crystal structures of TodX ΔS2, TodX ΔS2S3 and CymD. The structures of EcFadL (PDB ID 1T16) and wild type TodX (PDB ID 3BRZ) are shown for comparison. Missing residues are 1-3 and 401-407 in TodX ΔS2, 1-3 and 400-408 in TodX ΔS2S3, and 1-2, 60-74, 109-126, 366-372, and 405-418 in CymD.
Force profiles from steered MD simulations through the classical pathway. a, Surface model of the classical channel through TodX based on the crystal structure (3BRZ) after addition of hydrogens. b, Snapshots from steered MD simulations of benzene pulled through TodX when the protein is unrestrained in state I. c, Force profiles of benzene pulled along the classical route when the protein is unrestrained in state I (yellow), and when the N-terminal plug domain is restrained in state II (green).
Equilibrium MD simulations of p-cymene and CymD. a, b Each panel shows one of the independent, 500 ns simulations. In each case, p-cymene remains within the P-pocket. The right-side plots show the orientations adopted by p-cymene throughout the simulations as a function of time. The z-coordinate of the centre of mass of the isopropyl group with respect to the centre of the mass of the 6-membered ring is plotted on the y-axis, with accompanying images of p-cymene to facilitate interpretation. Across the two simulations, there is no single preferred orientation.
Equilibrium MD simulations with p-cymene initially placed at the lateral opening. a, Superposition of Tod X and benzene (blue) and CymD and p-cymene (red), the location of the MAH is highlighted by the yellow circle. Thus, the equilibrium simulations exploring substrate diffusion into the OM are initiated from equivalent protein-substrate configurations. b, Lateral diffusion of p-cymene into the OM within 250 ns in 3 independent simulations.
Force profiles from steered MD simulations. a, Force profile for p-cymene being pulled along the classical route. There is a large barrier at ~20 Å which corresponds to the plug domain. b, Force profile for pulling p-cymene through the lateral opening. The profile is similar to that for benzene through TodX via pathway B (Fig. 7).
Movement of benzene from the lateral opening to the P-pocket and concomitant conformational change during an equilibrium MD simulation initiated from benzene at the lateral opening (red). The timeframe shows that the move occurs very quickly within a 30 ps timeframe and thereafter there is less movement as the benzene remains confined to the P-pocket. The side chains of the aliphatic gate residues Val3, Leu105 and Ile370 are shown as stick models.
The fate of benzene in the OM after emergence from TodX in equilibrium MD simulations. a, Cartoon showing the two LPS molecules within 6 Å of benzene, about 20 ns after benzene (yellow) has emerged from the protein. b, The benzene location at every 4 ns during a 250 ns simulation is shown. The benzene is very mobile within the OM and even reaches the inner phospholipid leaflet. Benzene becomes confined for around 80 ns at one particular region in one of the simulations, where a pocket is formed by Leu28 and Lys 43. This is shown in the inset in panel b. Two phospholipids are shown for context regarding location within the OM. The phosphate groups of lipid A (cyan) and phospholipids (olive) are shown as VDW spheres. S2 and S3 are coloured red to show the location of the lateral opening.
Data collection and refinement statistics (molecular replacement)
Summary of the equilibrium MD simulation systems
Supplementary Movie Diffusion pathway for benzene through TodX. To generate the movie, equilibrium MD simulations were combined with steered MD simulations.
Acknowledgements
This work was supported by a NIH RO1 grant (1R01GM104495-01 to BvdB). BvdB was also supported by a Royal Society Wolfson Merit award. KS is funded by the Development and Promotion of Science and Technology Talents Project (DPST), Thailand. The authors wish to thank Mark Sansom for helpful discussions, in particular concerning the free energy calculations. The authors acknowledge the use of the IRIDIS High Performance Computing Facility, and associated support services at the University of Southampton and the use of the UK national supercomputer, ARCHER granted via the UK High-End Computing Consortium for Biomolecular Simulation, HECBioSim (http://hecbiosim.ac.uk), supported by EPSRC (grant no. EP/R029407/1), in the completion of this work.