Unexpected transformations during pyrroloiminoquinone biosynthesis

Pyrroloiminoquinone containing natural products have long been known for their biological activities. They are derived from tryptophan, but their biosynthetic pathways have remained elusive. Studies on the biosynthetic gene cluster (BGC) that produces the ammosamides revealed that the first step is attachment of Trp to the C-terminus of a scaffold peptide in an ATP and tRNA dependent manner catalyzed by a PEptide Amino-acyl tRNA ligase (PEARL). The indole of the Trp is then oxidized to a hydroxyquinone. We previously proposed a chemically plausible and streamlined pathway for converting this intermediate to the ammosamides using additional enzymes encoded in the BGC. In this study, we report the activity of four additional enzymes that show that the proposed pathway is incorrect and that Nature’s route towards pyrroloiminoquinones is much more complicated. We demonstrate that, surprisingly, the amino groups in pyrroloiminoquinones are derived from three different sources, glycine, asparagine, and leucine, all introduced in a tRNA dependent manner. We also show that an FAD-dependent putative glycine oxidase is required for the process that incorporates the nitrogens from glycine and leucine, and that a quinone reductase is required for the incorporation of the asparagine. Additionally, we provide the first insights into the evolutionary origin of the PEARLs as well as related enzymes such as the glutamyl-tRNA dependent dehydratases involved in the biosynthesis of lanthipeptides and thiopeptides. These enzymes appear to all have descended from the ATP-GRASP protein family.


General materials and methods
Molecular biology experiments were performed using reagents from New England Biolabs, Thermo Fisher Scientific, or Sigma-Aldrich.Plasmid maintenance and protein overexpression were conducted using Escherichia coli DH10β and BL21 (DE3) strains.pRSFDuet-1 and pCDFDuet-1 were obtained from Novagen.MALDI-TOF-MS data acquisition was performed using a Bruker UltrafleXtreme mass spectrometer (Bruker Daltonics) in reflector positive mode at the University of Illinois School of Chemical Sciences Mass Spectrometry Laboratory.A commercial mixture of 9:1 2,5-dihydroxybenzoic acid (DHB) and 2-hydroxy-5-methoxybenzoic acid (Super-DHB or SDHB) solution with a stock concentration of 25 mg/mL was routinely mixed with samples in a 1 to 1 ratio and air-dried before MALDI-TOF-MS analysis.MS1 and MS2 data acquisition was performed using an Agilent qTOF instrument equipped with an UPLC system.
2D NMR experiments were recorded using an Agilent 600 MHz spectrometer in 100% d6-DMSO.Ni-NTA resin for peptide and protein purifications was obtained from Thermofisher.

Molecular Biology Techniques
Plasmid DNA was prepared using a MiniPrep kit (Qiagen) following the manufacturer's instructions from E. coli DH10β cells.Plasmids were manually designed and prepared by Gibson assembly using NEBuilder HiFi DNA Assembly Master Mix.The vector primer design usually incorporated overhangs complementary to the gene of interest (Table S1).PCR amplification was performed using Q5 polymerase, dNTPs and Q5 reaction buffer from New England Biolabs.Designed primers for cloning and in vitro transcription were acquired from Integrated DNA Technologies (Table S1).Genes encoding E. coli asparagine tRNA Asn and leucine tRNA Leu ligase were obtained from TwistBioscience (Table S1).
Cultures were shaken at 190 RPM at 37 o C for 3 h following inoculation with IPTG to a final concentration of 0.4 mM when OD600 reached 0.6-0.8.After incubation for 18 h overnight at 18 o C, cells were harvested by centrifugation at 5,200 ×g.Cell pellets were stored in a -80 o C freezer for future use or resuspended in denaturing buffer composed of 6 M guanidinium hydrochloride, 50 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES), pH 7.6.
Next, resuspended cells were disrupted by sonication using repeated cycles of 2 s on and 5 s off.After centrifugation at 50,000 ×g, clarified lysate was passed through Ni-NTA resin to retain the His-tagged peptide.The column was washed with 20 column volumes (CV) of wash buffer composed of 25 mM imidazole, 300 mM NaCl, and 50 mM HEPES at pH 7.6.The His-tagged peptide was then eluted with 5 CV of elution buffer composed of 200 mM imidazole, 300 mM NaCl, and 50 mM HEPES at pH 7.6.The peptide was then purified by HPLC or desalted using a C18 ziptip and eluted into 1.5 µL of 25 mg/mL of a solution of SDHB dissolved in 80% CH3CN (ACN)/20% H2O/0.1% HCOOH (FA) for MALDI-TOF-MS analysis.

High-performance liquid chromatography (HPLC) purification of isolated peptide
Peptides obtained from Ni-resin purification were acidified with a 10% trifluoroacetic acid (TFA) solution to a final concentration of 0.1 to 2% TFA to precipitate any coeluting protein from the overexpression protocol.The sample was then passed through a 0.45 µm syringe filter and injected directly into an Agilent 1260 Infinity II HPLC instrument for purification.The peptide was purified using a preparative scale column, VP NUCLEODUR C18 HTec, 5 µm, 250x10 mm with the following gradient: 0-5 min isocratic 2% ACN, 5-40 min 2%-60% ACN, 40-42 min 60%-100% ACN; mobile phase buffered with 0.1% TFA.Fractions were collected manually and analyzed by MALDI-TOF MS by combining 1 to 1 with a 25 mg/mL solution of SDHB.Fractions containing the desired peptide were combined and lyophilized.The dried solids were then used for in vitro assays, further purification on an analytical scale column, or proteolytic digestion for LC-MS or NMR analysis.
When the samples were used for NMR experiments or LC-MS/MS analysis, the HPLC purified peptides were subjected to trypsin proteolytic cleavage, acidified to 0.1 to 1% TFA, filtered through a 0.45 µm membrane or centrifuged at 20,000 ×g for 5 min, and purified by HPLC.The tryptic fragment containing the C-terminal modifications was then eluted with the same column and gradient as indicated above for analytical scale purification.Fractions were collected manually and submitted for MALDI-TOF MS analysis for peptide fragment identification.

BhaG expression and purification
Protein expression was carried out by growing E. coli BL21 (DE3) cells expressing the Nterminal His6-tagged BhaG from pET28-BhaG.Cells were grown with 50 µg/mL kanamycin in Luria Bertani Broth.Cultures were shaken at 190 RPM at 37 °C for 3 h following inoculation with IPTG to a final concentration of 0.4 mM when OD600 reached 0.6-0.8units.After incubation for 18 h overnight at 18 °C, cells were harvested by centrifugation at 5,200 ×g.Cell pellets were stored in a -80 °C freezer for future use or resuspended in non-denaturing buffer composed of 20 mM NaH2PO4, 500 mM NaCl, and 50 mM imidazole at pH 7.5 with 5% glycerol.Next, resuspended cells were disrupted by sonication with repeated cycles of 2 s on and 5 s off.After centrifugation at 50,000 ×g, the clarified lysate was passed through Ni-NTA resin, previously equilibrated with the same initial buffer, to retain the His-tagged protein.The column was washed further with 20 CV of the initial buffer.Then, the His-tagged protein was eluted with 5 CV of elution buffer composed of 20 mM NaH2PO4, 500 mM NaCl, and 200 mM imidazole at pH 7.5 with 5% glycerol.After that, a previously equilibrated PD-10 column was used to buffer exchange the protein solution to 50 mM K3PO4 and 5% glycerol at pH 8.0.The flowthrough was further concentrated using an Amicon centrifugal device with a 30 kDa MWCO.A slight precipitation of the yellow protein was observed and which was redissolved by adding glycerol in a 1 to 1 ratio.

BhaB4 and AmmB1 expression and purification
Protein expression was carried out by growing transformed E. coli BL21 (DE3) cells expressing the N-terminal His6-tagged PEARL BhaB4 or AmmB1 from the corresponding pET28-BhaB4 or pET28-AmmB1 plasmids.Cells were grown with 50 µg/mL kanamycin in Luria Bertani Broth.
Cultures were shaken at 190 RPM at 37 °C for 3 h following inoculation with IPTG to a final concentration of 0.4 mM when OD600 reached 0.6-0.8units.After incubation for 18 h overnight at 18 °C, cells were harvested by centrifugation at 5,200 ×g.Pellets were stored in a -80 o C freezer for future use or resuspended in non-denaturing buffer composed of 20 mM HEPES, 300 mM NaCl, and 50 mM imidazole at pH 7.5 with 5% glycerol.Next, resuspended cells were disrupted by sonication with repeated cycles of 2 s on and 5 s off.After centrifugation at 50,000 ×g, clarified lysate was passed through Ni-NTA resin, previously equilibrated with the same initial buffer, to retain the His-tagged protein.The column was washed further with 20 CV of the initial buffer.Then, the His-tagged protein was eluted with 5 CV of elution buffer composed of 20 mM HEPES, 500 mM NaCl, and 200 mM imidazole at pH 7.5 with 5% glycerol.After that, an Amicon with a 50 kDa cutoff was used to buffer exchange the protein solution to 50 mM HEPES and 5% glycerol at pH 7.5 and further concentrated to up to 30 mg/mL protein solution.Aliquots were taken and flash-frozen in liquid nitrogen before storing the protein solutions at -80 °C.

In vitro Transcription of E. coli tRNA Asn and tRNA Leu
tRNA was obtained as described previously with minor modifications. 1 First, double-stranded DNA (dsDNA) used as a template for transcription was produced using DNA Polymerase I, Large (Klenow) fragment according to the manufacturer's protocol.Briefly, forward, and reverse primers encompassing the desired nucleotide sequence and with a ten-nucleotide overlap (Table S1) were dissolved to a stock solution of 100 µM for each primer.Primers, dNTPs, and NEBuffer TM 2 (10X) were combined to a final concentration of 4 µM, 33 µM, and 1X, respectively.Then, 3 U of Klenow fragment was added while keeping all components on ice.
After that, the mixture was incubated at 25 °C for 15 min.The reaction was stopped by adding EDTA to 10 mM final concentration and incubated at 75 °C for 20 min.The reaction was usually set up in four individual 200-µL tubes.After quenching, all reactions were pull together and the DNA was precipitated by adding 2.5 volumes of ice-cold 100% ethanol.The mixture was left at -80 °C for 1 h and then centrifuged at 20,000 ×g for 30 min at 4 °C.The supernatant was discarded and the solid was washed with 75% EtOH twice.The resulting pellet was then air dried for 15 min and re-dissolved in nuclease free water (~20 µL).After concentration quantification by nanodrop, the solution was aliquoted and stored at -20 °C for future use.In vitro transcription was performed using HiScribe® T7 High Yield RNA Synthesis Kit according to the manufacturer's protocol for short transcripts (< 0.3 kb).Briefly, nuclease-free water and kit components were mixed to a final concentration of 1X Reaction Buffer, 7.5 mM ATP, 7.5 mM GTP, 7.5 mM CTP, 7.5 mM UTP, and 1 µg of the dsDNA template in a final volume of 20 µL.Then 1.5 µL of T7 RNA Polymerase Mix was added and the mixture was incubated for 16 h at DNA template was removed by adding 8 µL of DNase I (RNase-free) using 1X DNase I Buffer and 280 µL of nuclease-free water.The mixture was incubated for 15 min at 37 °C.Then, the mixture was extracted with an equal volume of acidic phenol:CHCl3 using a vortex-mixer for 1 min and centrifuged at 20,000 ×g for 5 min at 4 °C.The top phase was then carefully removed and mixed with an equal volume of a mixture of CHCl3 and isoamyl alcohol (24:1), vortexed, and centrifuged again for 5 min to separate the phases at 4 °C.The top phase was removed and mixed with 2.5 volumes of cold 100 % EtOH to precipitate tRNA on ice.The solid was then separated by centrifugation at 20,000 ×g for 15 min at 4 °C.The supernatant was discarded, and the pellet washed twice with ice-cold 75% EtOH.The tRNA was air dried and resuspended in approximately 20 µL of 2 mM NaOAc pH 5.2.The concentration was determined by nanodrop and adjusted to 1 to 5 µg/µL before aliquoting and storing at -20 °C for future use.

In vitro assay of BhaG and phenylhydrazine derivatization
A 50-µL solution containing a final concentration of 25 mM HEPES pH 7.6, 1 mM TCEP pH 7.6, 5 µM BhaG, 0.5 mM MgCl2, and 30 µM of His-tagged Intermediate 1 was incubated for 2 h at 37 °C.After that time, the reaction mixture was combined with 50 µL of Ni-resin slurry, vortexed, and centrifuged at 20,000 xg for 10 min.The supernatant (~80 µL) was removed and combined with 50 µL of a solution of 20 mM freshly prepared phenylhydrazine solution.The resulting solution was incubated for 2 h at 37 °C and then submitted for LC-MS analysis.The resin was then treated with 50 µL of 200 mM imidazole solution to elute the bound modified product of the BhaG reaction (intermediate 3) and centrifuged at 20,000 xg for 10 min.The resulting supernatant was transferred to an Amicon 3 kDa MW cutoff to buffer exchange the high concentration of imidazole to 50 mM HEPES pH 7.6 buffer.The resulting solution was treated with 2 µg of trypsin protease and incubated for 2 h at 37 °C.Finally, the digested peptide solution was desalted using a Pierce™ C18 column and the sample submitted for ESI-MS/MS analysis.

In vitro assay of BhaB4 and AmmB1
In a 200 µL tube, a solution containing a final concentration of 50 mM HEPES, 25 mM KCl, and 15 mM MgCl2 pH 7.6, 2 mM DTT, 5 mM amino acid (asparagine or leucine for BhaB4 or AmmB1, respectively, or a mix of 15 N-labeled L-amino acids), 6 mM ATP, 10 U Thermostable Inorganic Pyrophosphatase (TIPP), 10 U Superase RNAse inhibitor, 60-200 ng/µL of tRNA (E. coli mix, tRNA Asn , or tRNA Leu ), 20 µM of the corresponding aminoacyl-tRNA synthetase AsnRS or LeuRS, 5 µM of PEARL BhaB4 or AmmB1, and 15 µM of peptide substrate was incubated for 4 h at 37 °C in a heating block.After this time, the modified His-tagged peptide was isolated using Ni agarose resin.The eluted His-tagged peptide was buffer exchanged into 50 mM HEPES pH 7.5 and treated with trypsin or chymotrypsin for ESI-MS/MS analysis.
For the reaction with AmmB1, 2,4-dinitrophenylhydrazine (2,4-DNP) was used to determine whether isovaleraldehyde or 4-methyl-2-oxopentanoic acid was formed after PEARL-catalyzed leucine addition to intermediate 6.After the 4 h incubation with AmmB1 as indicated above, the reaction mixture was diluted with an equal amount of water and 2,4-DNP was added to a final concentration of 2 mM.The resulting mixture was incubated at 37 °C for 16 h and then centrifuged through an Amicon 3 kDa MW cutoff for 10 min.The flowthrough was used for ESI-MS/MS analysis.

LC-MS analysis of the products of reactions with BhaG, BhaB4, or AmmB1
After trypsin digestion, the solutions containing peptide fragments were desalted using a Pierce™ C18 spin column.Briefly, the spin column was activated with 200 µL of 60% ACN with 0.1% formic acid (FA) and rinsed three times with 200 µL of 0.1% FA.The sample was passed through the equilibrated column and washed three times with 200 µL of 0.1% FA.Then, the peptide fragments were eluted using 25 µL of 60% ACN with 0.1% FA, twice.The eluant was lyophilized, redissolved in 25-50 µL of water, and passed through a 0.45 µm membrane to remove any insoluble material.The sample was then injected to an Agilent 1260 Infinity II mass spectrometer with a 6545 LC/Q-TOF detector.A kinetex ® 2.6 µm C8 100 Å column, 150x2.1 mm, was used for peptide separation using the following gradient: 0-3 min isocratic 5% ACN, 3-13 min 5%-95% ACN, 13-15 min 95%-5% ACN; mobile phases contained 0.1% FA.A DualAJS ESI detector was utilized with the following parameters, Gas Temp of 325 °C, Drying Gas of 13 L/min, Nebulizer of 35 psi, Sheath Gas Temp of 275 °C, Sheath Gas Flow 12 L/min, Nozzle Voltage of 500 V and MS-TOF Fragmentor 175 V.

NMR characterization of the peptides modified by BhaB4 and AmmB3
Trypsin generated fragments of the products of BhaB4, AmmB3, and AmmB1 were dissolved in DMSO-d6 and transferred into a 5 mm Shigemi tube DMSO-d6 matched solvent and bottom length of 8 mm compatible with Bruker.ostulating that a PEARL in the amm cluster would install the second amino group (red font) through a glycine addition to the oxidized C-terminal Trp.Similarly, the amide group nitrogen in the final product (also red font) was proposed to be derived from Gly-tRNA.The order of the various proposed steps could be different than depicted.

Figure S2 .
Figure S2.ESI-MS/MS analysis of the products of the co-expression of BhaC1 and BhaB5.The mixture of products 1, 2, and 3 were digested with trypsin and subjected to LC MS/MS analysis.The major product was the quinone-glycine adduct 1, followed by compounds 2 and 3 that were produced in similar amounts.

Figure S4 .
Figure S4.In vitro reactions with BhaG.(A) Reaction catalyzed by BhaG.(B) LC-MSMS analysis of a peptide mixture enriched in the glycine-quinone adduct 1 that was reacted with BhaG under the specified reaction conditions.The mixture was then digested with trypsin and analyzed by LC-MS/MS Extracted Ion Chromatography (EIC) showing that incubation with BhaG results in conversion of 1 into aminoquinone 3 (left panel), whereas incubation without BhaG does not result in conversion to aminoquinone 3 (middle panel) beyond background reaction (right panel), which was probably the source of 3 seen in a previous study. 2SM = starting material.EIC, extracted ion chromatogram.

Figure S5 .
Figure S5.Glyoxylate derivatization.(A) Reaction mechanism of the BhaG catalyzed reaction.First, hydride transfer from glycine-quinone adduct 1 to FAD generates an imine intermediate.Then, hydration and elimination releases glyoxylic acid and the product aminoquinone 2. Released glyoxylic acid can be derivatized with phenylhydrazine forming the corresponding phenylhydrazone derivative.Molecular oxygen presumably regenerates the oxidized FAD cofactor.(B) LC-MS analysis of the BhaG reaction products derivatized with phenylhydrazine.EICs monitoring the hydrazone adduct.Top trace: product of the in vitro BhaG reaction treated with phenylhydrazine. 2 nd trace: glyoxylate authentic standard treated with phenylhydrazine.3 rd trace: coinjection of the glyoxylate authentic standard treated with phenylhydrazine and the BhaG product trated with phenylhydrazine.Bottom trace: Reaction mixture without enzyme treated with phenylhydrazine.

Figure S7 .
Figure S7.(A) MALDI-TOF mass spectra of in vitro reactions containing BhaG product (aminoquinone intermediate 3), and BhaB4 using E. coli cell-free extract (top panel), using E. coli AsnRS and tRNA Asn (second panel), including BhaH in addition to E. coli AsnRS and tRNA Asn (third panel), and using anaerobic conditions with BhaH in addition to E. coli AsnRS and tRNA Asn (bottom panel).(B) ESI-MSMS analysis of in vitro produced unlabeled BhaB4 product 4. Cleavage of the amide bond at AP during LC-MS/MS analysis produces a shorter fragment PLALAWN with the corresponding b and y ions.This fragmentation is likely due to the Pro effect.[3][4][5][6]A graph of the ppm errors for each identified ion is shown.7

7
Figure S7.(A) MALDI-TOF mass spectra of in vitro reactions containing BhaG product (aminoquinone intermediate 3), and BhaB4 using E. coli cell-free extract (top panel), using E. coli AsnRS and tRNA Asn (second panel), including BhaH in addition to E. coli AsnRS and tRNA Asn (third panel), and using anaerobic conditions with BhaH in addition to E. coli AsnRS and tRNA Asn (bottom panel).(B) ESI-MSMS analysis of in vitro produced unlabeled BhaB4 product 4. Cleavage of the amide bond at AP during LC-MS/MS analysis produces a shorter fragment PLALAWN with the corresponding b and y ions.This fragmentation is likely due to the Pro effect.[3][4][5][6]A graph of the ppm errors for each identified ion is shown.7

Figure S8 .
Figure S8.(A) BhaH co-purifies with flavin mononucleotide (FMN) as a cofactor.(B) BhaH activity assay using menadione as a substrate under anaerobic conditions.During the catalytic cycle, the bound cofactor, FMN, is reduced by NADPH.Then, the resulting FMNH2 reduces menadione regenerating FMN.(C) BhaB4 activity was tested in the absence of BhaH or NADPH, no conversion to product 4 was observed.However, when both, BhaH and NADPH were added to the mixture, conversion to product 4 is observed.

Figure S9 .
Figure S9.LC-MS/MS analysis of BhaB4 product obtained from in vitro reaction using labeled L-15 N2-Asn, tRNA Asn , AsnRS, and BhaH .Cleavage N-terminal to the Pro produces a labeled PLALAWN with the corresponding b and y ions, likely due to the Pro effect. 3-6A graph of the ppm errors for each identified ion is shown. 7W = modified Trp in 4 (see text).

Figure S13 .
Figure S13.(A) Products of co-expression of AmmB1, AmmA*-Trp, BhaC1 (AmmC1 ortholog), AmmB3, and BhaG.Trypsin digested fragments were generated and analyzed with LC-MS/MS.These fragments contained IAPLALAW with mass shifts of -1 (experimental m/z=898.5181,theoretical m/z=898.5145,error=4.0066ppm), +42 (experimental m/z=941.5144,theoretical m/z=941.5091,error=5.6292ppm), and +85 Da (experimental m/z=984.5530,theoretical m/z=984.5513,error=1.7267ppm), compared to peptide 6.The monoacetylated product is drawn as residing on O5 but it could also be on N7. (B) LC-MS/MS analysis of product 7 obtained from in vitro reaction with AmmB1 using labeled L-15 N-leucine.A graph of the ppm errors for each identified ion is shown.7The mass of the peptide was increased by 1 Da in the C-terminal Trp derivative.W = modified Trp.

Figure S14 .
Figure S14.(A) LC-MS analysis of trypsin-digested aminoquinone intermediate 6 and the product 7 of the in vitro reaction with AmmB1 using an 15 N-labeled amino acid mix.The retention time of 7 is slightly earlier than that of 6.Because of the use of 15 N, the m/z of 6 and 7 is nearly the same in this experiment.(B) LC-MS analysis of in vitro reactions of 6 with E. coli tRNA and aminoacyl-tRNA synthetase (AARS) mix and the indicated mixture of amino acids.(C) LC-MS analysis of in vitro reactions of 6 with E. coli tRNA and AARS mix using individual Ile, Leu, Val, Pro, and Ala.

Figure S15 .
Figure S15.Phenylhydrazine derivatization of the AmmB1 in vitro reaction.(A) Reaction product obtained in vitro by incubation of peptide 6 with AmmB1, Leu, LeuRS, tRNA Leu , and ATP.Two possible pathways are shown that explain the observed product.These mechanisms would

Figure S17 .
Figure S17.(A) Crystal structure of the split LanB glutamylation enzyme TbtB (PDB 6EC8).The active site is depicted with a nonreactive substrate analog, 5'-phosphoryl-desmethylglutamycin (B).Phe783 and Arg743 are positioned to provide π-π stacking and hydrogen bonding interactions with the adenosine moiety of the ligand.

Figure S18 .
Figure S18.Constraint-based Multiple Alignment Tool (COBALT) was used to align the PEARLs BhaB1, BhaB4, AmmB1, AmmB4, and TglB versus shorter PEARL-like sequences found by a BlastP search with AmmB4 as query.The red color indicates highly conserved regions and blue indicates less conserved regions.Shorter PEARL sequences align well to the C-terminus of the full length PEARL sequences.In addition, this region has structural homology with ATP-GRASP family members as seen from a 3D data-based search.

Figure S19 .
Figure S19.(A)The AlphaFold predicted structure for TglB was overlayed with the crystal structure for E. coli glutathione synthase (PDB 1GSA) showing the potential binding site of ATP in the predicted PEARL structure.(B) Similarly, AlphaFold predicted structure for TglB was overlayed with the crystal structure for E. coli bifunctional glutathionylspermidine synthase (PDB 2IO9).

Table S1 . DNA sequences for primers and enzymes used in this study
Biosynthetic pathway towards ammosamide C proposed in a previous study