Abstract
When bacterial cells come in contact, antagonism mediated by the delivery of toxins frequently ensues. The potential for such encounters to have long-term beneficial consequences in recipient cells has not been investigated. Here we examined the effects of intoxication by DddA, a cytosine deaminase delivered via the type VI secretion system (T6SS) of Burkholderia cenocepacia. Despite its killing potential, we observed that several bacterial species resist DddA and instead accumulate mutations installed by the toxin, indicating that even in the absence of killing, interbacterial toxins can have profound consequences on target populations. Investigation of additional toxins from the deaminase superfamily revealed that mutagenic activity is a common feature of these proteins, including a representative we show targets single-stranded DNA and displays a markedly divergent structure. Our findings suggest that a surprising consequence of antagonistic interactions between bacteria could be the promotion of adaptation via the action of directly mutagenic toxins.
Introduction
Pathways for the delivery of toxins into contacting cells are widespread in bacteria. These include the type IV-VI secretion systems (T4-T6SS) in Gram-negative bacteria, the Esx secretion system of Gram-positives, and a number of specialized mechanisms that display a more limited distribution (1-5). Although most interbacterial toxins promote competitiveness in producing organisms, the precise impact that they have on recipient cells can vary considerably. For instance, toxins that degrade the cell wall through amidase or muramidase activity lead to cellular lysis, while others such as nucleases cause cell death without the release of cellular contents (6-9). Yet others, including NAD+ glycohydrolases and small ion-selective pore-forming toxins, cause growth arrest without killing (10-12). Beyond these outcomes that are detrimental to recipient cells, there are also scenarios in which toxin delivery could provide a transient benefit. Within populations of toxin producing strains, self-intoxication is prevented through the production of specific immunity determinants that neutralize individual toxins (13). Toxins inactivated by immunity proteins in this manner are generally assumed to have no or little impact. However, in Burkholderia thailandensis, it was demonstrated that a toxin delivered by the contact dependent inhibition pathway (CDI) causes increased expression of more than 30 genes in resistant cells, including those encoding exopolysaccharide biosynthesis proteins and a T6SS (14). This is associated with an increase in cellular aggregation and colony morphology changes. A separate study found that within a population of CDI toxin-producing bacteria, some cells fail to completely neutralize incoming CDI toxins delivered by their neighbors, leading to induction of the stringent response, and ultimately to enhanced antibiotic tolerance via growth arrest of this sub-population (15). The above examples highlight how interbacterial toxins can have short-term beneficial effects. Whether toxin delivery can impact the long-term evolutionary landscape of target cell populations has not been explored.
In this study, we investigate the consequences of cellular intoxication by an interbacterial toxin that acts as a cytosine deaminase. Proteins that catalyze the deamination of bases in nucleotides and nucleic acids are found in all domains of life and play essential roles in a range of physiological functions (16). For instance, deaminases that target free nucleotides and nucleosides contribute to cellular homeostasis of these molecules and can be involved in the biosynthesis of modified nucleic acid-derived secondary metabolites (17-19). RNA-targeting deaminases include the tRNA adenosine deaminase family of proteins (TAD) that contribute to tRNA maturation (20), the double-stranded RNA-specific adenosine deaminase (ADAR) enzymes that affect gene regulation through the editing of mRNA and small non-coding RNA targets (21), and mRNA-editing cytosine deaminases APOBEC1 and members of the DYW family (22, 23). Activation induced cytidine deaminase (AID) and APOBEC3 both target cytosine residues in single-stranded DNA and contribute to the generation of antibody diversity or control of retroviral or other retroelement replication, respectively (24, 25).
Bioinformatic analyses of the origins of the deaminase fold led to the surprising finding that a large and diverse collection of predicted bacterial and archaeal deaminases exhibit hallmarks of substrates of antibacterial toxin delivery pathways, including the T6SS, the Esx secretion system, and the CDI pathway (26, 27). We recently demonstrated that one of these proteins, a T6SS substrate from Burkholderia cenocepacia, acts as a double-stranded DNA-targeting cytosine deaminase (28). This unusual activity stands in contrast with all previously characterized DNA-targeting deaminases, which act preferentially on single-stranded substrates. We also showed that unlike the housekeeping deaminases APOBEC3G, TadA, and Cdd, this protein, which we named DddA (double-stranded DNA deaminase A), is highly toxic when expressed heterologously in E. coli. The mechanism by which DddA intoxicates cells was not determined.
Here we report the discovery that DddA is a potent, direct mutagen of otherwise resistant target bacterial populations. Furthermore, we find that despite considerable differences in sequence, structure, and preferred substrates, deaminase toxins representing other subfamilies similarly possess mutagenic capacity. These results expand the range of outcomes which can result from interbacterial interactions and suggest that deaminases could play a significant role in generating genetic diversity in bacterial populations.
Results
DddA mediates chromosome degradation and DNA replication arrest
We previously demonstrated that DddA is a B. cenocepacia T6SS-1 substrate that deaminates cytosine in double-stranded DNA (28). Cytosine deamination generates uracil, which is removed from DNA by the base excision repair (BER) pathway (29). This process generates abasic sites and previous reports indicate that the presence of these lesions in close proximity on opposite strands can lead to double-strand DNA breaks (30). Thus, to begin dissecting the mechanism by which DddA leads to killing, we examined the impact of the toxin expression on chromosome stability in E. coli. DAPI staining coupled with fluorescence microscopy revealed that the nucleoids of cells exposed to DddA rapidly disintegrate (Figures 1A,B; Supp. Figure 1). We further observed this phenomenon as genome fragmentation detectable by gel electrophoresis analysis of DNA extracted from cultures undergoing DddA-mediated intoxication (Figure 1C).
(A) Fluorescence microscopy of the indicated E. coli strains expressing DddAtox or carrying an empty vector (Control). DAPI staining (DNA) is shown in cyan. Top: Representative micrographs for each condition, scale bar = 10 μm. Bottom: representative images of cells with intact or degraded nucleoids. (B) Quantification of nucleoid state in cells shown in A (n = ∼100-200 cells per condition). (C) Agarose gel electrophoresis analysis of total genomic DNA isolated from the indicated E. coli strains expressing DddAtox, DddA E1347A, or carrying an empty vector (Control) after induction for the time period shown. (D) Fluorescence microscopy indicating genomic uracil incorporation (red) of E. coli strains expressing DddAtox or carrying an empty vector (Control), scale bar = 10 μm. (E) Quantification of uracil labeling signal from cells shown in D (n = ∼50 cells per condition). Values and error bars reflect mean ± s.d. of n = 2 independent biological replicates. *P < 0.0001 by unpaired two-tailed t-test.
Uracil DNA glycosylase (Ung) initiates BER by removing uracil (31). We therefore investigated how cells lacking Ung activity (Δung) respond to DddA expression. The level of uracil in genomic DNA can be measured by treating fixed cells with a fluorescently conjugated, catalytically inactive Ung protein (32). Employing this tool, we found that, as predicted, Ung inactivation leads to the accumulation of uracil in the DNA of DddA-intoxicated cells (Figures 1D,E; Supp. Figure 2). Additionally, Ung inactivation alleviated both nucleoid disruption and DNA fragmentation in E. coli populations expressing DddA (Figure 1A-C). In contrast, deletions of genes encoding the endonucleases Xth and Nfo, which operate downstream of Ung in BER by removing DNA abasic sites, had no impact on DddA-induced nucleoid disintegration. These findings collectively link the effects of DddA on chromosome integrity to uracil removal from DNA by the BER pathway.
(A) Viable cells (colony forming units, cfu) of the indicated E. coli strains recovered following induction of DddAtox or empty vector (Control) for the time period shown. Values represent mean ± s.d. of n=3 technical replicate and data are representative of 3 independent experiments. (B) Representative images from time-lapse microscopy of DnaN-YPet expressing strains 6 hr post-induction of DddAtox, scale bar = 5 μm. (C) Cell tower representation of averaged localized fluorescence intensity of DnaN-YPet expressing strains shown in B over the course of cell lifetimes. (n = 20-300 cells per condition at start of experiment). (D) Mean focus intensity of each frame during time-lapse microscopy of DnaN-YPet expressing strains over the course of 6 hr (n= 20-300 cells per condition at start of experiment). (E) Representation of single nucleotide variants (SNVs) by chromosomal position, frequency, and density in E. coli Δung following 60 min expression of DddAtox. (F) Frequency of the indicated substitutions among the SNVs shown in E. (G) Probability sequence logo of the region flanking mutated cytosines among the SNVs shown in (E).
If Ung-catalyzed removal of uracil from DNA and subsequent chromosome fragmentation is responsible for DddA-mediated killing, we reasoned that Ung inactivation should affect susceptibility to DddA. However, we found that E. coli wild-type and Δung were equally susceptible to intoxication (Figure 2A). It was previously demonstrated that starvation for thymine, which leads to in an increase of uracil incorporation in DNA, can disrupt DNA replication complexes, killing cells in a process known as thymineless death (TLD) (33). Accordingly, we investigated whether DNA replication is affected by DddA. In both wild-type and Δung E. coli strains, DddA induction resulted in the rapid loss of fluorescent foci formed by YPet-labeled DnaN, an established indicator of replication fork collapse (34, 35) (Figure 2B-D; Supp. Figure 3A,B; Supp. Videos 1 and 2).
(A, E, F, G) Competitiveness of the B. cenocepcia strains indicated at bottom against selected Enterobacteriaceae (A), Moraxelacea (E), Pseudomonadaceae (F) or Burkholderiacea (G). Pairs of organisms were cocultured on a solid surface for 6 hr. (B, H and K) Mutation frequency as measured by spontaneous rifampicin resistance frequency in clones of different species recovered from growth in competition with B. cenocepacia wild-type or B. cenocepacia dddAE1347A (n = 2). (C, I, and L). Distribution of different mutation types detected in rpoB or the whole genome of rifampicin resistant clones of different species recovered after growth in competition with wild-type B. cenocepacia (D, J, and M). Genome distribution of different mutation types detected by whole-genome sequencing of rifampicin-resistant clones recovered after competition with B. cenocepacia. Species targeted include E. coli. (B-D), E. coli EHEC (H-J), and K. pneumoniae (K-M). (C) Pattern of spontaneous mutation types observed in rpoB of E.coli derived from (40). Values and error bars reflect mean ± s.d. of n = 2 independent biological replicates with n=3 technical replicates each. *P < 0.0001 unpaired two-tailed t-test. Spont. Spontaneous; n.d., not detected.
To further probe how DddA intoxicated cells die, we sequenced the transcriptome of E. coli cells expressing DddA for one-hour – a time point by which >99% of cells are no longer viable (Figure 2A). Despite substantial sequencing depth, and the known ability of RNA polymerase to readily incorporate adenine opposite uracil residues encountered in DNA (36, 37), this experiment yielded no evidence for the incorporation of mutations into transcripts (Supp. Figure 3C,D). However, genomic DNA sequencing at this time point revealed widespread C•G-to-T•A transitions in the preferred context for DddA (5’-TC-3’), consistent with the uracil enrichment we observed by fluorescent labeling (Figure 1D,E, and 2E-G). We note that genomic DNA sequencing could only be conducted on the Δung background, as nucleoid deterioration of wild-type intoxicated cells prohibited sequencing library construction (Figure 1A-C). Taken together with our nucleoid integrity and DNA replication reporter data, these findings suggest that the fate of cells intoxicated by DddA is determined prior to nucleoid deterioration and prior to or coincident with the inhibition of transcription. Our findings do not rule out a mechanistic overlap between DddA-and TLD-mediated cell killing. In this regard, it is noteworthy that despite a considerable volume of research spanning several decades, the molecular underpinnings of TLD remains incompletely understood (33).
Bacteria resistant to DddA-mediated intoxication accumulate DddA-catalyzed mutations
The experiments described above were performed in E. coli heterologously expressing DddA, which may not capture the physiological impact of the toxin on recipient cell populations when it is delivered by the T6SS of B. cenocepacia. As a first step toward assessing the effect of DddA on recipient cells, we performed interbacterial competition assays between B. cenocepacia wild-type or a strain bearing catalytically inactive DddA (dddAE1347A) and E. coli (Figure 3A). Surprisingly, we found no evidence of DddA-mediated inhibition of E. coli in these experiments. A straightforward explanation for these results is that T6SS-1 of B. cenocepacia is unable to deliver toxins to E. coli. However, we observed that a B. cenocepacia strain lacking T6SS-1 activity (ΔicmF) exhibits reduced competitiveness toward E. coli, indicating that toxin delivery can occur between these organisms.
Since our earlier results clearly demonstrated the capacity of DddA to act within E. coli, we tested whether delivery of DddA via the T6SS could lead to the accumulation of mutations, rather than cell death. Precedence for the mutagenic activity of cytosine deaminases at physiological levels is well established for eukaryotic enzymes, including those of the APOBEC and AID families (38, 39). Strikingly, we found that E. coli populations subject to intoxication by T6SS-1 of wild-type B. cenocepacia display a nearly 10-fold increase in the frequency of rifampicin resistant (RifR) cells compared to those exposed to B. cenocepacia dddAE1347A (Figure 3B). We next used DNA sequencing to establish whether these mutations derive directly from the activity of DddA. Resistance to rifampicin can result from at least 69 base substitutions distributed over 24 positions in rpoB, which encodes the β-subunit of RNA polymerase (40). Sequencing of rpoB from 25 independent clones obtained from E. coli populations grown in competition with B. cenocepacia revealed that in all but one of these, rifampicin resistance was conferred by C•G-to-T•A transitions within the preferred context of DddA (Figure 3C, Supp. Table 1). In contrast, a study of 152 spontaneous RifR mutants in E. coli found only 32 such mutations in rpoB (40), indicating significant enrichment for this sequence signature in clones deriving from DddA-intoxicated populations (Fisher’s exact test, p<0.001). Whole genome sequencing (WGS) of 10 RifR clones isolated from these experiments led to the identification of an additional 12 mutations outside of rpoB distributed across these strains, and all were C•G-to-T•A transitions in a 5’-TC-3’ context (Figure 3C,D, Supp. Table 1). Both a Fisher exact test and negative binomial regression analysis indicated a significant enrichment for C•G-to-T•A mutations in the context preferred by DddA in intoxicated populations when compared to the pattern of mutations found to arise spontaneously in E. coli under neutral selection (P<0.005) (41). Together, these data provide compelling evidence that DddA delivered to E. coli during interbacterial competition drives mutagenesis.
Our finding that DddA delivered to E. coli via the T6SS of B. cenocepacia led to mutagenesis rather than killing led us to question whether the toxin generally exhibits this property, or whether it can be lethal against select recipients. To evaluate this, we performed additional competition experiments between B. cenocepacia WT or dddAE1347A and a panel of Gram-negative organisms (Figures 3A,E-G). We found that the effect of DddA delivery via the T6SS varied; in general, enteric species and Acinetobacter baumannii (Figures 3A,E) resisted intoxication, while other species, including Pseudomonas aeruginosa, P. putida, and B. thailandensis (Figure 3F,G), were highly sensitive to DddA-mediated killing. Indeed, in these sensitive bacteria, DddA accounted for all of the inhibitory activity of the T6SS of B. cenocepacia.
We next examined whether bacteria resistant to DddA-mediated killing generally exhibit an accumulation of mutations following intoxication. Among the resistant organisms, Klebsiella pneumoniae and enterohemorrhagic E. coli (EHEC) displayed elevated RifR frequency when in contact with B. cenocepacia containing active DddA than when in contact with B. cenocepacia bearing the dddAE1347A allele. Similar to our findings in E. coli, rpoB genes from RifR K. pneumoniae and EHEC clones isolated from cells exposed to wild-type B. cenocepacia exhibited mutagenesis signatures consistent with DddA activity (Figure 3H-M, Supp. Table 1). We further probed the mutagenic potential of DddA within these clones by performing WGS. Within the 11 K. pneumoniae isolates we sequenced, 50 of the 51 SNPs detected were C•G-to-T•A transitions, and each of these was located in the 5’-TC-3’ context preferred by DddA (28), a highly significant enrichment compared to spontaneous mutation patterns (P<0.001) (41). The EHEC isolates we sequenced shared this trend, with 82 of 88 SNPs across seven strains corresponding to C•G-to-T•A transitions in the context preferred by DddA. We found no indication of clustering of DddA-induced mutations across genomes of E. coli, K. pneumoniae or EHEC, suggesting that mutagenesis by the toxin occurs in a random fashion (Figure 3D, J, and M).
Finally, we asked whether species susceptible to killing by DddA could also accumulate mutations in cells that survive intoxication. The genomes of 65 RifR P. aeruginosa clones derived from interbacterial growth competition experiments between P. aeruginosa and B. cenocepacia revealed mutations exclusively within rpoB, and only 15% of these mutations conformed to the mutagenic signature of DddA (Supp. Table 1). These results raised the formal possibility that the lack of mutations accumulated by species sensitive to DddA killing is attributable to a lack of cytosine deamination activity. However, single nucleotide variant (SNV) analysis of genomic DNA sequences, mass spectrometric measurement of genomic uracil content, and cellular fluorescence labeling of uracil from contacting P. aeruginosa and B. cenocepacia cells together provided conclusive evidence of DddA-catalyzed cytosine deamination (Figure 4A-C, Supp. Figure 4).
(A) Mass spectrometric quantification of uracil in genomic DNA obtained from 1hr co-cultures of the indicated strains of P. aeruginosa with B. cenocepacia wild-type or dddAE1347A. Values and error bars reflect mean ± s.d. of n = 3 independent biological replicates. *P < 0.001, unpaired two-tailed t-test. B, C. SNVs detected in populations of P. aeruginosa wild-type (B) or Δung (C) after one hour growth in competition with B. cenocepacia. SNVs are plotted according to their chromosomal position and frequency and colored according to their relative density. (D) Mutation frequency as measured by spontaneous rifampicin resistance frequency in P. aeruginosa Δung recovered after 1 hr growth in competition with the indicated strain of B. cenocepacia. Values and error bars reflect mean ± s.d. of n = 2 independent biological replicates with n=3 technical replicates each. *P < 0.001, unpaired two-tailed t-test. (E) Cell viability of the indicated P. aeruginosa strains after 1 hr growth in competition with B. cenocepacia wild-type or dddAE1347A. Values and error bars reflect mean ± s.d. of n = 2 independent biological replicates.
Furthermore, intoxication of P. aeruginosa Δung by DddA yielded elevated mutation frequency, resembling that of Enterobacteriaceae exposed to the toxin (Figure 4D). Interestingly, the susceptibility of P. aeruginosa to DddA intoxication was not influenced by ung inactivation (Figure 4E), consistent with our conclusion from heterologous expression-based assays that cell death mediated by the toxin occurs upstream of massive uracil accumulation and widespread chromosome fragmentation (Figure 1).
Diverse deaminase toxins have mutagenic activity
Our discovery that DddA can be a mutagen in bacterial communities prompted us to investigate whether deaminase toxins might more broadly impact the mutational landscape of bacteria. Aravind and colleagues describe 22 sequence divergent subfamilies into which enzymes belonging to the deaminase superfamily distribute (16). Among these, eight subfamilies possess members associated with interbacterial toxin systems (Figure 5A). For instance, DddA belongs to subfamily SCP1.201-like, herein renamed Bacterial Deaminase Toxin Family 1 (BaDTF1), composed of members from Proteobacteria and Actinobacteria. We selected representatives from two additional deaminase superfamily subgroups containing predicted interbacterial toxins: P. syringae WP_011168804.1 of the DYW-like subgroup, of which we reassigned the bacterial toxin members to BaDTF2, and Taylorella equigenitalis WP_044956253.1 of the Pput_2613-like subgroup, renamed BaDTF3 (Figure 5A). Like DddA, both exhibited toxicity when heterologously expressed in E. coli (Figure 5B). We then evaluated whether these toxins have mutagenic activity by sequencing DNA extracted from E. coli expressing each protein. To both increase our sensitivity for detecting cytosine deaminase activity and to circumvent the potential for barriers to sequencing caused by BER-generated lesions in DNA (e.g. abasic sites and double stranded breaks), we employed the Δung background in these studies. Remarkably, despite their sequence divergence from DddA, representatives from both BaDTF2 and BaDTF3 introduced high levels of C•G-to-T•A transitions (Figure 5C-F). However, each operated within a unique sequence context, which differed from that preferred by DddA. The BaDTF2 representative preferentially targeted cytosine located within pyrimidine tracks (5’-YCY), but could also deaminate cytosines in all other contexts with lower efficiency (Figure 5G). In contrast, the BaDTF3 member targeted cytosine most efficiently when preceded by thymidine, and less efficiently when preceded by adenosine or cytosine; it did not act on cytosines preceded by guanosine (5’-HC) (Figure 5H).
(A) Dendogram indicated evolutionary history of clades within the deaminase superfamily their predicted substrates (colored dots), modified from (16). Predicted toxins with unknown substrates, yellow boxes; deaminases toxins with defined biochemical activity, pink boxes. (B) Toxicity of representative BaDTF2 and BaDTF3 toxins as indicated by the proliferation (fold change in cfu recorved) of E. coli after one hour expressing the toxins or the empty vector (Control). Values represent means ± s.d., and asterisks in indicate statistically significant difference (p < 0.001, n = 2). (C and D) Representation of SNVs by chromosomal position, frequency, and density in E. coli Δung after 1 hr expression of representative BaDTF2 (C) or BaDTF3 (D) toxins. (E and F). Distribution of different nucleotides substitutions among SNVs detected in E. coli Δung expressing representative BaDTF2 (E) or BaDTF3 (F) toxins. (G and H) Probability sequence logo of the region flanking mutated cytosines from E. coli Δung intoxicated with representative BaDTF2 (G) or BaDTF3 (H) toxins. (I and J). In vitro cytidine deamination assays for BaDTF2 toxin SsdA using a single-stranded (I) or double-stranded (J) FAM-labelled DNA substrate (S) with cytidines in the contexts CC, TC, AC and GC. Cytidine deamination leads to products (P) with increased mobility. A3A, APOBEC3A (control for activity on ssDNA). DddAtox was used as a control for activity toward dsDNA. K. In vitro cytidine deamination assays for SsdAtox or DddAtox using a single-stranded or double-stranded FAM-labelled DNA substrate with a single cytidine in the context indicated at top. Gels shown in I-K are representative of two replicates Data in B represent means ± s.d., and asterisks in indicate statistically significant difference (p < 0.001, n = 2).
Previously characterized proteins in the DYW deaminase family are found in plants and protists and participate in C to U editing of specific organellar mRNA transcripts (23, 42, 43). In these organisms, the DYW deaminase domain is generally found at the C-terminus of a larger polypeptide characterized by pentatricopeptide repeats (PPRs), which recruit the catalytic domain of the protein to target mRNAs (44). The mutagenic activity of P. syringae WP_011168804.1 suggested that contrary to other DYW proteins, the substrate range of this BaDTF2 member could include DNA. To examine this biochemically, we expressed and purified the toxin domain of the P. syringae enzyme and performed in vitro deamination assays on single-stranded and double-detectable increase in the number of C•G-to-T•A transitions in cDNA sequences (Supp. Figure 5B,C).
The SsdA structure specifies a new group of bacterial deaminases
To begin to understand the basis for the distinct substrate preference exhibited by SsdA, we determined its structure in complex with its immunity determinant, SsdAI, to 3.0 Å (Figure 6A, Supp. Table 2). The structure of SsdA exhibits the basic characteristics of deaminase enzymes, including active site histidine and cysteine residues in position to coordinate a zinc ion, and the five β-strands (S1-5) and three α-helices (H1, H3, and H4) that constitute the core fold of deaminase superfamily enzymes (Figure 6B) (16). More specifically, SsdA groups with the C-terminal hairpin division of the superfamily, and consistent with this assignment, S4 and S5 of SsdA are antiparallel. This is a notable divergence from all other characterized deaminases that act on ssDNA, for instance members of the APOBEC family, wherein these strands are parallel. Outside of its basic fold, SsdA bears little structural homology with characterized deaminases, including DddA (Figure 6C-E). Indeed, SsdA bears most overall structural similarity with the folate-dependent transformylase domain of PurH (DALI; Z score, 7.7), which contains the stranded DNA templates. Unlike DddA, the BaDTF2 protein exhibited potent cytosine deaminating activity toward ssDNA (Figure 5I). Cytosine residues in dsDNA were also targeted, but with considerably lower efficiency (Figure 5J). Based on these data, we named the P. syringae representative of the BaDTF2 subfamily single-strand DNA deaminase toxin A (SsdA). Consistent with the in vivo mutagenesis results, purified SsdAtox could target cytosine residues preceded by any of the four bases (Figure 5K). Notably, we detected only residual activity of SsdAtox toward RNA targets in vitro, suggesting that DNA is the physiologically relevant substrate of the toxin (Supp. Figure 5A). This is supported by RNA-seq analysis of E. coli cells expressing SddAtox, which revealed no deaminase fold despite its sequence and functional divergence from deaminase enzymes (16, 45).
SsdA bears little resemblance to DddA and reveals motifs differentiating toxins from RNA-targeting DYW proteins. (A) Ribbon diagram depiction of the SsdAtox-SsdAI structure. (B,C) Secondary structure diagrams for SsdAtox and the core fold of deaminase superfamily proteins (16). The SWG motif conserved in BaDTF2 toxins and the accompanying α-helical insertion (blue) are indicated in B. (D) Active site view of SsdAtox. Catalytic and zinc-coordinating residues, SGW motif and α-helical insertion (blue) are indicated. (E) Active site view of DddAtox indicating catalytic and zinc-coordinating residues. (F) Zoom-in view of the contact site between SsdAI and the active site of SsdAtox. (G) Conserved motifs identified in DYW and BaDTF2 proteins.
SsdAI is an exclusively α-helical protein that shares a large (∼1300 Å2) interface with SsdA. As is often observed in toxin–immunity co-crystal structures, this interface effectively demarcates the active site of SsdA (Figure 6A) (13). Much of the SsdA-interacting surface of SsdAI is composed of a protrusion that includes an extended loop and two short helical segments that project residues into the active site cavity of SsdA (Figure 6F). Interestingly, these residues contact a conserved Ser-Gly-Trp (SGW) motif that resides in close proximity to the active site of SsdA and is unique to the BaDTF2 members of the DYW subgroup.
SsdA represents the first member of proteins classified into the DYW-family to be structurally characterized, precluding a direct comparison with proteins from this family known to target mRNA. However, the fact that SsdA displays a substrate preference distinct from the related plant and protist proteins led us to search for sequence elements that differ between BaDTF2 members and canonical DYW proteins. Our search revealed several conserved features of DYW proteins previously demonstrated to be important for mitochondrial or chloroplast gene editing in vivo that are absent from the BaDTF2 proteins. These include the eponymous C-terminal DYW residues, a CxC motif believed to be important for coordinating a second Zn2+ atom, and a proline-glycine (PG) motif located within a characteristic insertion between S2 and H2 of the deaminase fold (Figure 6G) (46-48). Instead, BaDTF2 proteins contain the aforementioned SGW motif within this insertion. Given the significant divergence of functionally critical regions between DYW and BaDTF2 proteins, we propose that BaDTF2 members constitute a new deaminase subfamily. The universal link of BaDTF2 proteins to interbacterial antagonism pathways leads us to speculate that BaDTF2 proteins beyond SsdA are likely to target ssDNA.
Discussion
In this work, we have shown that interbacterial deaminase toxins can act as potent mutagens of target cells. This discovery provides a previously unrecognized, and potentially widespread mechanism by which bacteria can acquire genetic diversity that allows them to adapt to changing environmental conditions. The ecological ramifications of mutagenesis by deaminase toxins is not yet understood. There are myriad sources of single nucleotide substitutions within bacterial populations. These include endogenous cellular maintenance activities such as replication and metabolism, and environmental stresses including xenobiotics and ionizing radiation (49, 50). Studies often report C•G-to-T•A transitions as the most common mutation type observed within evolving bacterial populations. Unfortunately, the majority of these studies are performed on monocultures of in vitro grown bacteria, limiting their applicability to our understanding of deaminase toxins (41, 51, 52). Nevertheless, they do provide some insights into the origins of cytosine mutations in natural contexts, as they implicate spontaneous cytosine deamination in the GC skew observed between leading and lagging strands on bacterial chromosomes (53). Relative to spontaneous cytosine deamination, the global contribution of cytosine deaminase toxins to the landscape of cytosine mutations is likely minor. However, under certain circumstances, cytosine deaminase toxins could significantly impact evolutionary outcomes. For instance, based on the ability of DddA to install multiple mutations on a single genome during one intoxication event, deaminase toxins may facilitate the emergence of phenotypes that are otherwise slow to evolve. This could be clinically relevant to the treatment of polymicrobial infections, where bacteria that possess deaminase toxins are present and the resistance to certain antibiotics requires a succession of mutagenic events (54, 55).
Our study leads us to question whether mutagenesis of target cells by deaminase toxins occurs opportunistically, or whether this phenomenon could be an evolutionarily selected property of the toxin in certain instances. There is strong evidence that hypermutator phenotypes are under selection within bacterial populations (56-58). In these cells, the positive contribution of certain mutations on fitness is presumably greater than the reduction on fitness by other, deleterious mutations (59, 60). In cooperating bacterial systems, it is conceivable that such hypermutator phenotypes could be achieved with the aid of neighboring cells via the delivery of a mutagenic toxin. This naturally prompts the question, what is the benefit to the toxin-producing cell in this scenario? If the producer and recipient are co-dependent, the producer would obtain indirect benefits through the installation of adaptive mutations within the recipient. Our data show that the same deaminase toxin can mutagenize certain bacteria, while causing death in others. Therefore, a toxin that acts as a mutagen of a cooperating bacterium can remain under selection as a component of the antibacterial arsenal of the producer.
With a relatively small sampling of bacterial diversity, we found both species that are highly susceptible to DddA and others that are completely resistant to the toxin. This suggests that the determinants of cell fate following intoxication by DddA frequently vary between species. DNA repair pathways may provide one relevant source of diversity. Although our work did not implicate BER in susceptibility to DddA, we cannot rule out that the documented differences within this pathway could be responsible for at least part of the variability we observe (61). Our data show that Ung effectively removes the uracil resulting from DddA activity. However, the bulk uracil measurement methods we employed may not capture small changes in uracil levels distributed across bacterial chromosomes, or localized hotspots of uracil accumulation. Thus, it remains possible that levels of other glycosylases that act on uracil to initiate BER, such as mismatched uracil glycosylase (Mug), may influence cell fate following exposure to DddA (62). Interestingly, mug orthologs are not universal among bacteria; the gene is present across Enterobacteriaceae, but it is not found in most Pseudomonads (63, 64). Another pathway that varies widely in bacteria and is implicated in uracil removal is mismatch repair (MMR). This pathway does not play a role in the removal of misincorporated uracils; however, those that derive from cytosine, and thus generate a mismatch, are potential substrates (65). There is strong evidence linking MMR to uracil removal from U•G mismatches in eukaryotes, but the sparsity of comparable data in bacterial systems renders it challenging to estimate the extent to which MMR influences susceptibility to DddA (66).
Other explanations for the variable effects of DddA on recipient cells may be unrelated to DNA repair entirely. Killing of target cells by interbacterial antagonistic systems can rely on the delivery of exceedingly small numbers of toxin molecules – in some cases as few as one (67). We posit that the dependence on so few proteins could leave the fate of recipient cells subject to stochastic behavior. Recognition and turnover of DddA via cellular proteolytic machinery is one potential source of this stochasticity. In the majority of cells belonging to a resistant species, proteolytic machinery may fortuitously and effectively deplete DddA; however, in a small subset of these cells, this machinery might fail to act before DddA installs one or more mutations. In such a scenario, a species sensitive to the toxin would be expected to lack a pathway for DddA degradation altogether. This offers an explanation for our inability to detect mutations with surviving cells of bacteria sensitive to DddA; these may constitute a subpopulation that was not directly exposed to DddA.
Based on the inferred relationship between SsdA and DYW proteins, we were surprised to find that SsdA targets DNA, and not RNA (16). Our data suggest that there may be a mosaic of substrate specificities even within relatively closely related clades of deaminases. Analyses performed by Aravind and colleagues provide support for the evolutionary origins of DYW proteins in bacteria (16). Although there is substantial evidence implicating DYW proteins in RNA editing, little of this is direct and to our knowledge the activity of only one DYW protein has been reconstituted in vitro (23). Therefore, it will be of interest to better understand whether DYW family members target RNA exclusively, or whether some might play roles in DNA editing. In this regard, our structure of the SsdA–SsdAI complex likely provides some insights. For example, it places the PG motif region, which differs markedly between BaDTF2 and DYW proteins, in position to engage substrate. Our structure also permits an initial comparison between bacterial deaminase toxins that target dsDNA (DddA) versus ssDNA (SsdA). Despite the predicted shared ancestry of DddA and SsdA, DddA displays significantly greater structural relatedness to eukaryotic deaminases targeting ssDNA (AID/APOBEC) than it does to SsdA. Taken together, this suggests that cytosine deaminases targeting ssDNA evolved at least twice from the ancestral deaminase fold (16). Finally, the differing target specificities of DddA and SsdA suggest that both ds and ssDNA may be effective targets for toxic deaminases. Given that ssDNA represents only a fraction of total genetic material typically present in a bacterial cell, this finding seems counterintuitive. One explanation may be that because highly transcribed genes are more likely to exist in a ss state, the toxicity of SsdA could result from the targeting of these specific regions. Alternatively, the lower level of dsDNA targeting by SsdA that we observe in vitro may represent the physiologically relevant activity of the protein during cellular intoxication.
While nutrient deprivation typically results in growth arrest, thymine starvation induced through genetic inactivation or chemical inhibition of thymine biosynthesis leads to cell death in organisms from E. coli to humans, in a process known as TLD (33, 68). There remains no consensus on the precise mechanism of TLD, but it is nevertheless possible to draw parallels between TLD and DddA intoxication. Both processes involve the incorporation of uracil into DNA and produce complex phenotypes including DNA replication arrest and chromosome degradation (33). Also, for both TLD and DddA intoxication, the removal of uracil incorporated into DNA does not rescue cells from intoxication (69). More generally, suppressor mutants resistant to killing by TLD appear to be difficult to obtain (70, 71). Our finding that Ung inactivation alleviates the phenotype of nucleoid degradation, yet Ung-deficient strains remain as sensitive to intoxication as the wild type, indicates that redundancy built into the mechanism by which DddA kill cells could similarly prevent suppressor mutant emergence. For TLD, this property has enabled the development of anticancer and antimicrobial drugs that induce TLD by acting as thymine analogs (72, 73); for DddA, difficulties associated with developing resistance to toxin activity may explain why toxins predicted to act as deaminases have become prevalent in the bacterial kingdom (16).
Our studies of the mutagenic potential of deaminase toxins suggest these proteins may play an important role in the generation of diversity in microbial communities. Although researchers have overcome many hurdles to deciphering the intricate mechanistic details of interbacterial toxin delivery systems in vitro, it has remained difficult to extrapolate these results to the broader impact of the systems on bacterial ecology. In this regard, the indelible signature that deaminase toxins leave within the genomes of target cells offers a unique opportunity. Future studies aimed at mining genomic and metagenomic datasets for sequence signatures indicative of deaminase toxin activity have the potential to provide heretofore unobtainable insights into the complexities of bacterial interactions in nature.
Materials and Methods
Bacterial strains and culture methods
Unless otherwise noted, bacterial strains used in this study were cultivated in Lysogeny Broth (LB) at 37 °C with shaking or on LB medium solidified with agar (LBA, 1.5% w/v, except as noted). When necessary, antibiotics were supplied to the media in the following concentrations: carbenicillin (150 μg ml−1), gentamycin (15 μg ml−1), trimethoprim (50 μg ml−1), chloramphenicol (25 μg ml−1), irgasan (50 μg ml−1), kanamycin (50 μg ml−1) and streptomycin (50 μg ml−1). E. coli strains DH5α, XK1502 and BL21 were used for plasmid maintenance, toxicity and mutagenesis assays, and protein expression, respectively. A detailed description of the remaining bacterial strains and plasmids used in this study is provided in Supp. Table 3.
Molecular biology techniques and plasmid construction
All primers used in this study are listed in Supp. Table 3. The molecular biology reagents for DNA manipulation, Phusion high fidelity DNA polymerase, restriction enzymes, and Gibson Assembly Reagent were acquired from New England Biolabs (NEB). GoTaq Green Master Mix was obtained from Promega. Primers and gBlocks used in this study were acquired from Integrated DNA Technologies (IDT). To generate a pETDuet-1-based expression construct for SsdAtox and SsdAI, the corresponding gene fragment and complete gene were amplified from Pseudomonas syringae and cloned into the MCS-1 (BamHI and NotI sites, introducing an N-terminal hexahistidine tag) and MCS-2 (NdeI and XhoI sites) respectively using Gibson assembly. For deaminase toxicity assays, ssdAtox was amplified from P. syringae and the toxin domain of candidate BaDTF3 deaminase EL142_RS06975 of Taylorella equigenitalus was obtained by gBlock synthesis. Each were cloned individually into pSCRhaB2 (NdeI and XbaI sites) by Gibson assembly. The corresponding immunity genes ssdAI and EL142_RS06970 were also amplified and synthesized, respectively, and then cloned by Gibson assembly into pPSV39 (SacI and HindIII sites). A vector for markerless in-frame deletion of ung in P. aeruginosa PAO1 was generated by amplifying and combining 600bp regions flanking the ung gene in the pEXG2 vector using PCR and Gibson assembly, generating pEXG2::Δung.
Construction of genetically modified P. aeruginosa
A markerless in-frame deletion of ung in P. aeruginosa PAO1 was generated by allelic exchange with the suicide vector pEXG2::ung, employing SacB-based counter selection (74). The pEXG2::Δung vector was transformed into E. coli SM10 for conjugation into P. aeruginosa. Conjugation was performed by incubating a 1:1 mixture of E. coli SM10 (pEXG2::Δung) with P. aeruginosa for 6 hr on LBA. Selection for chromosomal integration of pEXG2::ung in P. aeruginosa was achieved by plating on LBA supplemented with irgasan and gentamycin. Resulting merodiploids were grown overnight, then plated on LBA supplemented with 5% (w/v) sucrose for SacB counter selection. Deletion of ung in resulting gentamycin susceptible colonies was confirmed by PCR.
Construction of genetically modified E. coli
Deletions in E. coli were generated with Lamda-Red recombination (75). Deletion cassettes for nfo and xthA were generated by amplifying the chloramphenicol resistance gene from pKD3 and kanamycin from pKD4 respectively, adding 50 base pairs of the region flanking the deletion target to the amplicon. Expression of the recombinase in E. coli carrying pKD46 was induced by sub-culturing an overnight culture of the strain in a 1:100 dilution in LB at 30 °C with the addition of 20mM arabinose. At OD600 0.6, the cells were recovered, washed repeatedly with sterile water then transformed by electroporation with the deletion cassettes. Successful deletion of the targeted genes was confirmed by antibiotic resistance and PCR. Both mutations were combined in a single strain by P1 phage transduction (76). Lysates of E. coli nfo::cm were prepared by sub-culturing and overnight culture of the strain diluted 1:100 in LB until OD600 0.6, at which point different concentrations of P1 were added to the culture samples. After one hour of incubation, cultures that exhibited strong lysis were used for phage lysate preparation. Cell debris was removed by centrifugation and droplets of chloroform were used to remove any viable cells remaining. Transduction was performed by combining an overnight culture of E. coli xthA::kan with phage lysate at different rations (1:1, 1:10, 1:100) in a final volume of 200µL and incubating for 30 min at 37 °C. Transduction was stopped with the addition of 100µL 1 M Na-Citrate (pH 5.5) followed by an additional 30 min incubation at 37 °C. Cells were then plated onto LB agar supplemented with kanamycin, choloramphenicol and 100mM Na-Citrate. Transductants were confirmed by PCR.
Nucleoid and genomic DNA integrity assays
For fluorescence microscopy-based nucleoid integrity assays, overnight cultures of E. coli strains were grown with the appropriate antibiotic. The cells were then subcultured by diluting 1:10 into fresh media, and incubated until OD600 = 0.6, then cultures were supplemented with 0.2% rhamnose and incubated for one additional hour. Cells were recovered and fixed in 4% formaldehyde for 15 minutes on ice, followed by a wash and resuspension in PBS. The resuspended cells were stained with DAPI (3µg/ml) for 15 minutes then transferred to a 2% agarose pad in PBS for visualization. Fixed cells were imaged under a Nikon widefield microscope with a 60X oil objective. For each condition, individual cells were segmented and quantified from 3 field of views using SuperSegger software (77). We obtained the average fluorescence intensity per pixel of each cell. A histogram of these averages yielded a bimodal distribution, from which we determined a cutoff value that separated degraded and intact nucleoids.
Cultures for gel electrophoresis-based gDNA integrity assessment were prepared in a similar way, but with the addition of sample collection after 0, 30, and 60 minutes after induction. A total of OD600 = 0.3 in 1 ml of cells was pelleted and gDNA was extracted using DNeasy Blood & Tissue kit (Qiagen) using the Gram-negative bacterial protocol followed by quantification using Qubit. A total of 100ng of gDNA was used for integrity visualization by 1% agarose gel electrophoresis, followed by with ethidium bromide staining imaging with an Azure C600.
Fluorescence and phase contrast microscopy of YPet-labeled DnaN
Prior to imaging, E. coli encoding YPet-labeled DnaN and carrying pSChRaB2::dddAtox or empty vector and pPSV39::dddIA were grown overnight with the appropriate antibiotics and 160 µM IPTG to induce immunity gene expression. Cells were then back diluted into M9 minimal medium (1X M9 salts, 2 mM MgSO4, 0.1 mM CaCl2, 0.2% glycerol, 10μg/ml of thiamine hydrochloride, and 100μg/ml each of arginine, histidine, leucine, threonine, and proline) with IPTG and grown in a bacterial tissue culture roller drum incubated at 30°C until reaching OD600 = 0.1. The cultures were then washed twice in the base growth medium (without IPTG) to lessen residual immunity expression. A volume of 2μl was spotted onto a thin 2% (w/v) low-melt agarose (Invitrogen UltraPure LMP Agarose) pad composed of the base growth medium and 0.2% rhamnose for toxin induction. The sample was sealed on the pad under a glass cover slip using VaLP (a 1:1:1 Vaseline, lanolin, and paraffin mixture). Microscopy images were acquired using a Nikon Ti-E inverted microscope fitted with a 60x oil objective (Nikon CFI Plan Apo Lambda 60X Oil), automated focusing (Nikon Perfect Focus System), a mercury arc lamp light source (Nikon IntensilightC-HGFIE), a sCMOS camera (Andor Neo 5.5), a custom built environmental chamber, and image acquisition software (Nikon NIS Elements). The samples were imaged at 30°C at 5 minute intervals over the duration of 6 hours. Cells were segmented in the time-lapse movies using SuperSegger (77), a MATLAB based image processing and analysis package, with focus tracking enabled (78). Data points for the mean focus intensity plots (Figure 2D) were generated by averaging the scores of the single brightest focus of each segmented cell in each frame and then averaging that value over six fields of view, with the error bars representing the standard deviation of the final average. For the intoxicated strains, cell counts in the final frame fell within 300-600 for each FOV. For strains with the empty vector, final counts fell within 600-1,500. Focus scores larger than 30 were excluded as outliers. After obtaining the data points, a centered moving average with nine points in the sample window was used to smooth out fluctuations.
Bacterial competition and mutation frequency experiments
The fitness of B. cenocepacia strains in interbacterial interactions was evaluated in coculture growth assays. B. cenocepacia and competitor strains were grown overnight then concentrated to reach OD600 of 400 (B. cenocepacia) or 40 (competitors). The resulting cell suspensions were mixed in a 1:1 ratio for each B. cenocepacia-competitor pair, and 10, 10 μl samples of each mixture were spotted on a 0.2 μm nitrocellulose membrane placed on LBA (3% w/v) and then incubated 6 hours incubation at 37 °C, or for 30, 90 and 150 min for time course growth competition assays. After incubation, cells were recovered from the membrane surface and resuspended in 1 ml LB. The initial and final B. cenocepacia to competitor ratios were determined by diluting and plating fractions of the cultures on LBA with antibiotics selective for each organism. Mutation frequency was measured by plating post-incubation cultures onto LBA supplemented with rifampicin and an antibiotic to select for the non-B. cenocepacia competitor. Mutation frequency for the competitor species was then calculated by dividing the number of rifampicin resistant cfu by the total number of cfu recovered of that organism.
rpoB and whole-genome sequencing of rifampicin resistant colonies
Rifampicin resistant colonies obtained from competition experiments were used for rpoB and whole-genome sequencing. For rpoB sequencing, the gene was amplified from resistant clones by colony PCR, and PCR amplicons were used as templates for Sanger sequencing. For whole genome sequencing, cultures of isolates in LB were used for gDNA extraction using the Gram-negative bacterial protocol for the DNeasy Blood & Tissue kit (Qiagen). Extraction yield was quantified using a Qubit (Thermo Fisher Scientific). Libraries were constructed using the Nextera DNA Flex Library Prep Kit (Illumina) according to the manufacturer’s protocol. Library concentration and quality was evaluated with a Qubit and 1% agarose gel electrophoresis. An Illumina iSeq (300 cycles paired-end program) was used for sequencing. Reads were mapped to reference genomes using the BWA software (79); references genomes employed were NC_000913.3 for E. coli AB1157, NC_002655.2 for E. coli EHEC, and CP000647.1 for Klebsiella pneumoniae. Pileup data from alignments were generated with SAMtools, and variant calling was performed with VarScan (80, 81). For SNP calling, SNPs were considered valid if they had a frequency greater than 90%, with p-value < 0.05, and if they were not present with parental strain.
SNV analysis
SNV analysis was performed for both E. coli cultures intoxicated by heterologous expression of candidate deaminases (see E. coli toxicity assays) and for P. aeruginosa grown in coculture with B. cenocepacia (see Bacterial competition and mutation frequency experiments). For E. coli, 1 ml of pelleted cells subjected to intoxication was employed for genomic DNA extraction; for P. aeruginosa, cells were collected from 10, 10 ul competition mixtures prepared and grown as described above for one hour. In each case, gDNA extraction was performed with the DNeasy Blood & Tissue kit (Qiagen) using the Gram-negative bacterial protocol. Extraction yield was quantified using a Qubit (Thermo Fisher Scientific). Sequencing libraries were constructed using the Nextera DNA Flex Library Prep Kit (Illumina) as recommended by the manufacture, except the Enhanced PCR Mix (EPM) was substituted with KAPA HiFi HotStart Uracil+ ReadyMix (Kapa Biosystems) to enable the amplification of uracil encountered in the DNA template. Library concentration and quality was evaluated with a Qubit and 1% agarose gel electrophoresis. Sequencing was performed with an Illumina iSeq (300 cycles paired-end program), and the For E. coli, the BWA software was used to map reads against a reference genome (NC_000913.3) (79). For P. aeruginosa cocultures, the reads belonging to P. aeruginosa were separated from those derived from B. cenocepacia using the package BBmap (https://sourceforge.net/projects/bbmap/), and then BWA was used to map the reads against the P. aeruginosa reference genome (NC_002516.2) (79). For both E. coli and P. aeruginosa reads, Pileup data from alignments were generated with SAMtools, and variant calling was performed with VarScan (80, 81). We used a conservative threshold for SNV validation of variant frequency > 0.005, coverage > 50 reads per base, and p-value < 0.05. Probability logos of the consensus region flanking modified bases were obtained by extracting the sequence in the position −3 to +3 flanking the SNV using custom Python scripts, and the logo image was obtained by inputting the sequences in the WebLogo online tool (weblogo.berkeley.edu).
Purification of Flag-ΔUNG-DsRed
Purification of Flag-tagged, catalytically inactive Ung fused to DsRed was performed as essentially as described in (32). In brief, E. coli BL21(DE3) ung-151 carrying pET-20b::Flag-ΔUNG-DsRed was grown in LB broth to OD600 0.6 followed by the addition of 0.6 mM IPTG. The cultures were then incubated for 16 h at 18°C. Cells were pelleted and resuspended in lysis buffer (50 mM TRIS·HCl, pH = 8.0, 300 mM NaCl, 0.5 mM ethylenediaminetetraacetic acid (EDTA), 10 mM β-mercaptoethanol, 1 mM phenylmethylsulfonyl fluoride, 5 mM benzamidine) followed by lysis by sonication. Supernatant was separated from debris by centrifugation at 20,000 × g for 30 min. Supernatant was then applied to a Ni-NTA column and washed with a series of buffers increasing in salt concentration (50 mM HEPES, pH = 7.5, 30 mM KCl, 5 mM β-mercaptoethanol; 50 mM HEPES, pH = 7.5, 300 mM KCl, 5 mM β-mercaptoethanol; 50 mM HEPES, pH = 7.5, 500 mM NaCl, 40 mM Imidazole, 5 mM β-mercaptoethanol). Flag-ΔUNG-DsRed was then eluted with elution buffer (50 mM HEPES, pH = 7.5, 30 mM KCl, 300 mM imidazole, 5 mM β-mercaptoethanol). Eluted sample was further purified with fast protein liquid chromatography (FPLC) and gel filtration on a Superdex200 column (GE Healthcare) in sizing buffer (30 mM Tris·HCl, pH = 7.4, 140 mM NaCl, 0.01% Tween-20, 1 mM EDTA, 15 mM β-mercaptoethanol, 5% (w/v) glycerol).
Labeling of uracil in genomic DNA
Fluorescent labeling of uracil incorporated into genomic DNA was performed in E. coli expressing DddA or in P. aeruginosa cocultured with B. cenocepacia. For E. coli, overnight cultures of the strain carrying pSChRaB2::dddAtox or empty vector and pPSV39::dddIA were grown in overnight in LB with appropriate antibiotics and IPTG at 160 uM to induce immunity gene expression. Cultures were then diluted 1:100 into LB broth with antibiotics but without IPTG and grown until OD600 = 0.6-0.8. Toxin expression was then induced with the addition of 0.2% rhamnose for 60 minutes, following which cells were collected by centrifugation. For P. aeruginosa in coculture with B. cenocepacia, overnight cultures were concentrated to OD600 40 and mixed in a 1:10 ratio. The mixture was then spotted on 0.2µM nitrocellulose membrane on LBA (3% w/v) and incubated for 37 °C for one hour. The cells were recovered and used for staining.
Uracil labeling with Flag-ΔUNG-DsRed was performed essentially as described in (32). Cells collected from experiments described above were resuspended in Carnoy’s fixative reagent (ethanol 60%, methanol 30%, and chloroform 10%), then incubated 20 min at 4°C. Cells were rehydrated at room temperature by a series of 5 min incubations with buffer containing decreasing concentrations of ethanol (1:1 ethanol:PBS, 3:7 ethanol:PBS, then 100% PBS containing 0.05% Triton X-100 (PBST)). For permeabilization, cells were washed once with GTE buffer (50 mM glucose, 20 mM Tris, pH = 7.5 and 10 mM EDTA), then resuspended in GTE buffer containing 10 mg/ml lysozyme and incubated 5 min at room temperature. Lysozyme solution was removed by washing with PBST for 10 min followed by incubation with blocking buffer (5% BSA, in PBST) for 15 min. Genomic uracil residues were labeled by adding 5 μg/ml of purified Flag-ΔUNG-DsRed in blocking buffer and incubating for one hour at room temperature. Cells were again washed with PBST and then applied to a thin 2% agarose PBS slab on a microscopy slide for visualization. Quantification of uracil labeling was performed via Fiji (82) by calculating the mean fluorescence intensity of randomly selected ROIs (10 × 10 μm) in each field of view.
Mass spectrometry-based quantification of genomic uracil
Genomic uracil content was quantified for DNA extracted from cocultures of B. cenocepacia and P. aeruginosa, inoculated and grown as described above for bacterial competition assays, except cells were recovered after 1 hour of growth. Total genomic DNA extraction was performed using the DNeasy Blood & Tissue kit (Qiagen), and uracil quantification was performed as described previously using the excision method (83). Briefly, reactions containing 30µg of gDNA and 1U UNG in UNG buffer (20 mM Tris-HCl pH 7.5, 10 mM NaCl, 1 mM DTT, 1 mg/ml BSA) were incubated for 1 hour at 37 °C for DNA deuracilation. Uracil (1,3-15N2) was then added to the reactions as an internal control at 2 nM, the reactions were extracted with 500µL acetonitrile and centrifuged for 30 minutes at 14,000 rpm. The supernatant was then recovered, vacuum centrifuged until dried, followed by resuspension in 40µL 10% 2 mM ammonium formate, 90% acetonitrile. 10µL samples of the resuspensions were separated on a Waters Xbridge BEH amide column (2.5 μm, 130 Å, 2.1 × 150 mm) at 40C using the following gradient at 0.300 mL/min: 0-3 minutes 95% B and 5% A, 3-8 minutes 95-50% B, and 5-50% A, 8-12 minutes 50% B and 50% A, 12-13 minutes 50-95% B and 50-5% A, 13-18 minutes 95% B and 5% A. Solvent A consisted of 95% water, 3% acetonitrile, 2% methanol, 0.2% Acetic acid (v/v/v/v) 10 mM ammonium acetate, pH approximately 4.2, and solvent B consisted of 93% acetontrile, 5% water, 2% methanol, 0.2 % acetic acid and 10 mM ammonium acetate. The column was reequilibrated between samples for 18 minutes at 95% B. Samples then were analyzed on a LC-QQQ system consisting of Shimadzu Nexera XR LC 20 AD pumps coupled to a Sciex 6500+ triple quadrupole mass spectrometer with Turbo V Ion Source operating in MRM mode through the Analyst 1.6.3 software. Curtain gas was at 9, source temperature was at 425°C, ionspray voltage was −4000V, GS1 was 70 and GS2 was 80. Uracil concentrations were quantified using Multiquant 3.0 software, and the internal control was used to account for variability during extraction and sample preparation.
E. coli toxicity assays
E. coli strains carrying pSCRhaB2 expression constructs for deaminase toxins and pPSV39 expression constructs for the corresponding immunity determinants (Figure 5) were grown overnight in LB with the appropriate antibiotics and IPTG at 160 μM to induce immunity gene expression. Cultures were then diluted 1:100 into fresh medium without IPTG, incubated until OD600 = 0.6, then supplemented with 0.2% rhamnose for toxin induction. For time course toxicity assays with DddA, aliquots of cultures were recovered at 0, 20, 40, 60, 120 min after induction, plated onto LBA for cfu enumeration. For single time point assays with SsdA and EL142_RS06975, the cultures were plated on LBA at the time of toxin induction and after 60 min incubation, and proliferation was reported as final over initial cfu ratio.
Purification of SsdAtox and SsdAtox-SsdAI for biochemical assays and structure determination
For purification of his-tagged SsdAtox in complex with SsdAI, an overnight cultures of E. coli BL21 (pETDuet-1::ssdAtox + ssdAI) was used to inoculate 2L of LB broth in a 1:100 dilution. This culture was then grown to approximately OD600 = 0.6, then induced with 0.5 mM IPTG and incubated for 16 h at 18 °C with shaking. Cell pellets were collected by centrifugation at 4,000g for 30 min, followed by resuspension in 50 ml of lysis buffer (50 mM Tris-HCl pH 8.0, 500 mM NaCl, 10 mM imidazole1 mg ml−1 lysozyme, and protease inhibitor cocktail). Cell pellets were then lysed by sonication (5 pulses, 10 s each) and supernatant was separated from debris by centrifugation at 25,000g for 30 min. The his-tagged SddAtox–SddIA complex was purified from supernatant by FPLC using a HisTrap HP affinity column (GE). Proteins were eluted with a final concentration of 300 mM imidazole. The eluted SddAtox– SddIA complex was further subjected to gel filtration on a Superdex200 column (GE Healthcare) in sizing buffer (20 mM Tris-HCl pH 7.5, 200 mM NaCl). The fraction purity was evaluated by SDS-PAGE gel stained with Coomassie Brilliant Blue and the highest quality factions were stored at −80 °C.
For SsdAtox separation from SsdAI to be used in biochemical assays, the elution underwent a denaturation and renaturation process. The elution was added to 50 ml 8 M urea denaturing buffer (50 mM Tris-HCl pH 7.5, 500 mM NaCl and 1 mM DTT) and incubated for 16 h at 4 °C. This suspension was then loaded on a gravity-flow column packed with 2 ml Ni-NTA agarose beads equilibrated with denaturing buffer. The column was washed with 50 ml 8 M urea denaturing buffer to remove unbound SsdAI. On-column refolding of SsdAtox was achieved with sequential washes with 25 ml denaturing buffer with decreasing concentrations of urea (6 M, 4 M, 2 M, 1 M), and a last wash with wash buffer (50 mM Tris-HCl pH 7.5, 500 mM NaCl and 1 mM DTT) to remove remaining traces of urea. Refolded proteins bound to the column were then eluted with 5 ml elution buffer. The eluted samples were purified again by size-exclusion chromatography using an FPLC with gel filtration on a Superdex200 column (GE Healthcare) in sizing buffer (20 mM Tris-HCl pH 7.5, 200 mM NaCl, 1 mM DTT, 5% (w/v) glycerol). The fraction purity was evaluated by SDS–PAGE gel stained with Coomassie blue and the highest quality factions were stored at −80 °C.
Crystallization and structure determination
Crystals of the selenomethionine derivative of hexahistidine-tagged SsdAtox (a.a. 260-410)-SsdAI complex were obtained at 10 mg/mL in buffer containing 20 mM Tris pH 7.5, 200 mM NaCl, and 1.0 mM DTT. Crystallization was set up at room temperature by mixing the complex in this buffer 1:1 with crystallization buffer (20% (w/v) PEG 3350, 0.1 M Bis-Tris:HCl pH 7.5 and 200 mM MgCl2). Long rod crystals were obtained in 2-5 days. Selenomethionine SsdAtox-SsdAI crystals displayed the symmetry of space group I4122 (a = 93.07 Å, b = 93.07 Å, c = 383.49 Å, a = b = γ = 90°) (Supp. Table 2). Crystals were cryoprotected in a final concentration of 20% glycerol in crystallization buffer,
Diffraction datasets were collected at the BL502 beamline (ALS, Lawrence Berkeley National Laboratory). Data were indexed, integrated, and scaled using HKL-2000 (REF). The calculated Matthews coefficient is Vm= 5.17, with 76% of solvent in the crystal and one molecule in an asymmetric unit. Coordinates and structure factors for SsdA/SsdAI complex have been deposited in the Protein Data Bank (PDB) under accession code 7JTU.
DNA deamination assays
All DNA substrates were acquired from IDT, and contained a 6-FAM fluorophore for visualization. For assays shown in Figure 5I-J, substrates contained cytosine in each possible dinucleotide sequence context flanked by polyadenines. For the substrate preference assays shown in Figure 5K, substrates contained single cytosine residues, one substrate for each of the four possible upstream nucleotide contexts. The complete sequence for substrates is found in Supp. Table 3. To generate dsDNA substrates, a reverse complement oligo was annealed to the substrate containing the 6-FAM fluorophore. Reactions were performed in 10 μl of deamination buffer consisting of 20 mM Tris-HCl pH 7.4, 200 mM NaCl, 1 mM DTT, 1 μM substrate. The SsdAtox or controls APOBEC3A and DddAtox were added at the concentrations indicated in Figure 5. Reactions were incubated for 1 h at 37 °C, followed by the addition of 5 μl of UDG reaction solution (New England Biolabs, 0.02 U μl−1 UDG in 1X UDG buffer) and an additional 30 min incubation at 37 °C. Cleavage of abasic sites generated by UDG-mediated cleavage of uracil residues in substrates was induced by addition of 100 mM NaOH and incubation at 95 °C for 2 min. Reactions were analyzed by 15% acrylamide 8M urea gel electrophoresis in TBE buffer and the 6-FAM fluorophore signal was detected by fluorescence imaging with an Azure C600.
Poisoned primer extension assay for RNA deamination
The RNA substrates and the DNA oligonucleotide containing a 5′ 6-FAM fluorophore for visualization (Supp Table 3) were acquired from IDT. The RNA substrates were designed to allow the 3’ end of the DNA oligonucleotide to anneal immediately before the cytidine target for deamination. Deamination reactions were performed in deamination buffer (20 mM Tris-HCl pH 7.4, 200 mM NaCl, 1 mM DTT), with the addition of 1 μM substrate and SsdAtox, or the negative control protein DddAtox according to the concentrations shown in Supp. Figure 5. Substrate combinations and concentrations were added as indicated in Supp. Figure 5, and reactions were incubated for 1 h at 37 °C. cDNA synthesis was then performed in a 10-μl reaction (2.5 U μl−1 MultiScribe Reverse Transcriptase (Thermo Fisher), 1 μl deamination reaction, 1.5 μM oligonucleotide, 100 μM ddATP, 100 μM dCTP, 100 μM dTTP, and 100 μM dGTP) incubated at 37 °C for 1 hour. Samples were analyzed by denaturing 15% acrylamide 8 urea gel electrophoresis in TBE buffer. The synthesized cDNA fragments were detected by fluorescence imaging with an Azure Biosystems C600.
Statistics
Student’s t-test analyses were performed using GraphPad Prism version 8.0. To test for enrichment for C•G-to-T•A mutations in the context preferred by DddA in intoxicated populations (experimental condition) when compared to the pattern of mutations found to arise spontaneously in E. coli under neutral selection (control condition), a fisher’s exact test, and negative binomial regression were employed. The fisher’s exact test compared presence-absence of any C•G-to-T•A mutations in the context preferred by DddA outside of rpoB, as a binary variable for each clone in the experimental and control conditions. The negative binomial regression compared counts of C•G-to-T•A mutations in the context preferred by DddA outside of rpoB between the experimental and control conditions. The [log] overall mutation rate per condition was included as an offset in the negative binomial regression to make the conditions as comparable as possible. These analyses were run using R version 3.6.2.
Sequencing data
Sequencing data associated with this study have been deposited at the NCBI Trace and Short-Read Archive (SRA) under BioProject accession ID PRJNA659516.
Author Contributions
MdM: Conceptualization; Data curation; Software; Formal analysis; Supervision; Validation; Investigation; Visualization; Methodology; Writing - original draft; Writing - review and editing. FH: Formal analysis; Investigation; Visualization; Methodology. DH: Software; Formal analysis; Investigation; Visualization; Methodology. DB: Data curation; Formal analysis; Investigation. JZ: Investigation. MR: Software; Investigation. NS: Validation. HL: Investigation; Methodology. JF: Investigation. PW: Supervision; Methodology. S.BP: Conceptualization; Formal analysis; Supervision; Methodology; Writing - original draft; Project administration; Writing - review and editing. JM: Conceptualization; Supervision; Funding acquisition; Visualization; Methodology; Writing - original draft; Project administration; Writing - review and editing.
Acknowledgments
We thank Beata Vértessy and Reuben Harris for providing reagents, Simon Dove for critical review of the manuscript, and members of the Mougous laboratory for insightful discussions. DNA sequencing support was provided by the University of Washington Cystic Fibrosis Research Translation Center and Research Development Program Genomic Sequencing Core, which is funded by the National Institutes of Health (NIH, DK089507) and the Cystic Fibrosis Foundation (CFF, SINGH19R0). The work was supported by NIH grants GM128191 (to P.A.W.) and AI080609 (to J.D.M.). M.H.d.M. was supported by CFF Fellowship DEMORA18F0 and J.D.M. holds an Investigator in the Pathogenesis of Infectious Disease Award from the Burroughs Wellcome Fund and is an Investigator of the Howard Hughes Medical Institute.
References
- 1.↵
- 2.
- 3.
- 4.
- 5.↵
- 6.↵
- 7.
- 8.
- 9.↵
- 10.↵
- 11.
- 12.↵
- 13.↵
- 14.↵
- 15.↵
- 16.↵
- 17.↵
- 18.
- 19.↵
- 20.↵
- 21.↵
- 22.↵
- 23.↵
- 24.↵
- 25.↵
- 26.↵
- 27.↵
- 28.↵
- 29.↵
- 30.↵
- 31.↵
- 32.↵
- 33.↵
- 34.↵
- 35.↵
- 36.↵
- 37.↵
- 38.↵
- 39.↵
- 40.↵
- 41.↵
- 42.↵
- 43.↵
- 44.↵
- 45.↵
- 46.↵
- 47.
- 48.↵
- 49.↵
- 50.↵
- 51.↵
- 52.↵
- 53.↵
- 54.↵
- 55.↵
- 56.↵
- 57.
- 58.↵
- 59.↵
- 60.↵
- 61.↵
- 62.↵
- 63.↵
- 64.↵
- 65.↵
- 66.↵
- 67.↵
- 68.↵
- 69.↵
- 70.↵
- 71.↵
- 72.↵
- 73.↵
- 74.↵
- 75.↵
- 76.↵
- 77.↵
- 78.↵
- 79.↵
- 80.↵
- 81.↵
- 82.↵
- 83.↵
- 84.
- 85.
- 86.
- 87.
- 88.
- 89.
- 90.
- 91.
- 92.
- 93.
- 94.
- 95.
- 96.