Abstract
The tick-borne apicomplexan parasite, Babesia bovis, a highly persistent bovine pathogen, expresses VESA1 proteins on the infected erythrocyte surface to mediate cytoadhesion. The cytoadhesion ligand, VESA1, which protects the parasite from splenic passage, is itself protected from a host immune response by rapid antigenic variation. B. bovis relies upon segmental gene conversion (SGC) as a major mechanism to vary VESA1 structure. Gene conversion has been considered a form of homologous recombination (HR), a process for which Rad51 proteins are considered pivotal components. This makes BbRad51 a choice target for development of inhibitors that could both interfere with parasite genome integrity and disrupt HR-dependent antigenic variation. Previously, we knocked out the Bbrad51 gene from the B. bovis haploid genome, resulting in a phenotype of sensitivity to methylmethane sulfonate (MMS) and apparent loss of HR-dependent integration of exogenous DNA. In a further characterization of BbRad51, we demonstrate here a failure to upregulate the Bbrad51 gene in response to DNA damage. Moreover, we demonstrate that ΔBbrad51 parasites are not more sensitive than wild-type to DNA damage induced by γ-irradiation, and repair their genome with similar kinetics. To assess the need for BbRad51 in SGC, RT-PCR was used to observe alterations to a highly variant region of ves1α transcripts over time. Mapping of these amplicons to the genome revealed a significant reduction of in situ transcriptional switching (isTS) among ves loci, but not cessation. By combining existing pipelines for analysis of the amplicons, we demonstrate that SGC continues unabated in ΔBbrad51 parasites, albeit at an overall reduced rate, and a reduction in SGC tract lengths was observed. By contrast, no differences were observed in the lengths of homologous sequences at which recombination occurred. These results indicate that, whereas BbRad51 is not essential to babesial antigenic variation, it influences epigenetic control of ves loci, and its absence significantly reduces successful variation. These results necessitate a reconsideration of the likely enzymatic mechanism(s) underlying SGC and suggest the existence of additional targets for development of small molecule inhibitors.
Author summary B. bovis establishes highly persistent infections in cattle, in part by using cytoadhesion to avoid passage through the spleen. While protective, a host antibody response targeting the cytoadhesion ligand is quickly rendered ineffective by antigenic variation. In B. bovis, antigenic variation relies heavily upon segmental gene conversion (SGC), presumed to be a form of homologous recombination (HR), to generate variants. As Rad51 is generally considered essential to HR, we investigated its contribution to SGC. While diminishing the parasite’s capacity for HR-dependent integration of exogenous DNA, the loss of BbRad51 did not affect the parasite’s sensitivity to ionizing radiation, overall genome stability, or competence for SGC. Instead, loss of BbRad51 diminished the extent of in situ transcriptional switching (isTS) among ves gene loci, the accumulation of SGC recombinants, and the mean lengths of SGC sequence tracts. Given the overall reductions in VESA1 variability, compromise of the parasite’s capacity for in vivo persistence is predicted.
Introduction
The apicomplexan parasite, Babesia bovis, is a quick-change artist with the ability to rapidly alter proteins it expresses on the infected erythrocyte surface. This ability is needed because, during asexual reproduction B. bovis cytoadheres to the capillary and post-capillary venous endothelium within its bovine mammalian host [1–3]. It is thought that this behavior allows this microaerophilic parasite to avoid splenic clearance and to complete asexual development under hypoxic conditions, analogous to the human malarial parasite, Plasmodium falciparum [4]. Cytoadhesion is mediated by heterodimeric variant erythrocyte surface antigen-1 (VESA1) proteins, which are exported by the parasites to the erythrocyte where they integrate into the erythrocyte membrane [5, 6]. Cytoadhesion is compromised by a host antibody response targeting VESA1, preventing and potentially reversing cytoadhesion [6]. However, B. bovis has evolved the ability to rapidly vary the structure and antigenicity of VESA1 polypeptides, abrogating recognition by existing antibodies [7–10]. Antigenic variation in B. bovis involves the ves multigene family encoding VESA1a and 1b polypeptides [9, 11, 12], and possibly the smorf multigene family [13, 14]. B. bovis intraerythrocytic stages reproduce asexually, with a haploid genome of only 8-8.5 Mbp [13]. Despite its small genome size, approximately 135 genes comprise the B. bovis ves multigene family, amounting to approximately 4.7% of all coding sequences [10, 13]. Transcription of ves genes is monoparalogous, arising from a single ves locus at any one time (but typically involves transcription of both a ves1α and ves1β gene from the same locus to encode both subunits), whereas the remainder of the family remains transcriptionally inactive [15]. In situ transcriptional switching (isTS) from one ves locus to another over time has been implicated in B. bovis antigenic variation [10, 15, 16], although segmental gene conversion (SGC) is the only mechanism of variation critically demonstrated in this parasite to date [12]. Progressive replacement of short sequence patches within the actively transcribed ves genes by SGC, yields ves genes (and VESA1 polypeptides) that are mosaics comprised of sequences from many ves loci [11, 12]. The short lengths of the SGC conversion tracts, ability to acquire sequences from any chromosome, and involvement of two similarly variant subunits may enable this gene family to provide practically unlimited diversity in epitope structure [5, 7, 9, 11, 12].
Canonical gene conversion is a form of homologous recombination (HR)-mediated DNA repair. In this process a damaged sequence is repaired by incorporating duplicated homologous sequences from an undamaged allele or paralog to replace the damaged sequences. At least three models have been proposed to explain this process (reviewed in [17]). Common to all models is the assembly of a repair complex at the site of damage. Activities of the repair complex include 5’ to 3’ resection of the broken ends of the damaged molecule, and stabilization of the single-stranded 3’ ends by assembly of RPA ssDNA-binding proteins onto the strands (reviewed in [17, 18]). Rad52 may stabilize the RPA and maintain spatial proximity of the broken ends [19]. The Rad52-RPA complexes then are replaced by Rad51 protein, which forms helical filaments on the ssDNA strands. The Rad51-ssDNA complexes, together with Rad54, mediate both a search for homologous sequences elsewhere in the genome and strand invasion when such sequences are found [20–22]. Once found, single-stranded invasion of the identified sequence allows sequence acquisition by extension of the 3’ end of the invading strand. Depending upon the model, the inter-chromosomal entanglement involves either one or two Holliday junction structures that are resolved to yield two independent chromosomes again. Sequence acquisition may occur with or without crossover, depending upon how the junction structures are resolved [23]. Like canonical gene conversion, we hypothesized SGC to occur via HR, a possibility consistent with the stretches of homologous sequence flanking SGC tracts that are shared between donor and recipient. However, other factors call this into question. For example, the reasons for consistently short conversion tracts are unknown, and crossover appears to be a rare outcome in B. bovis [10]. These traits suggest that minimal end resection, acquisition of only short tracts of differing sequence, and rapid resolution of intermolecular junction structures all may define this process mechanistically. Among the existing models of HR, those most consistent with these traits are synthesis-dependent strand annealing, either with rapid disentanglement of the invading strand(s), or double-strand break repair but with convergent branch migration and junction dissolution. By contrast, resolution of distal Holliday junctions would be expected to result in frequent crossover events [17]. Alternatively, a more exotic explanation may hold, such as the involvement of a template-switching repair polymerase, but no direct evidence yet supports this possibility.
Rad51 is considered essential to HR and the gene conversion process in eukaryotes. Organisms with defective Rad51 consistently suffer reduced viability and enhanced sensitivity to environmental insult [24–29]. For example, knockout of the Rad51 gene in mice resulted in embryonic lethality [30], whereas loss of the Tbrad51 gene in the kinetoplastid parasite, Trypanosoma brucei, yielded parasites that were compromised in growth and hypersensitive to methyl methanesulfonate (MMS) [28]. Interestingly, Tbrad51 knockout parasites continued to undergo variation of their variant surface glycoprotein (VSG) genes, both by isTS and gene conversion mechanisms, but the rate at which variation occurred was slowed dramatically. This observation suggested a role for TbRad51 in facilitating or regulating trypanosomal antigenic variation, but not an essential role in catalysis. Moreover, TbRad51-independent mechanisms may act in trypanosomal antigenic variation. Recently, we knocked out the Bbrad51 gene of B. bovis. Unlike higher eukaryotes, there was no apparent effect on parasite viability or growth. However, parasites were made hypersensitive to MMS and failed to integrate exogenous DNA, suggesting defects in HR [31]. Given the importance of SGC to B. bovis antigenic variation and survival, and of Rad51 to gene conversion and HR, we investigated further the interplay between DNA repair and antigenic variation. We provide evidence that overall DNA repair remains highly robust in the absence of BbRad51. SGC also continues, albeit at an overall reduced rate, concomitant with a significant reduction in isTS. We hypothesize that these results reflect unique roles for BbRad51 in antigenic variation, and suggest that alternative enzymes catalyze recombination during SGC.
Results
Bbrad51 knockout parasites behave similarly to wild-type in response to DNA damage
Previously, we knocked out the Bbrad51 gene, resulting in a parasite phenotype of MMS-sensitivity and apparent loss of the ability to integrate exogenous DNA by HR, but not significant difference in rates of growth [31]. Cells that suffer environmental insult may up-regulate DNA repair proteins, including Rad51 [32]. To test this in B. bovis we needed to create a reporter parasite line with which we could easily quantify BbRad51 protein levels. This was done by integrating the linearized plasmid, pBRNHLg, into the genome by homologous double-crossover, resulting in Bbrad51 coding sequences fused in-frame with NanoLuciferase and dual-HA tag sequences. Immediately following the stop codon, the E. coli glmS ribozyme element, a self-cleaving autoregulatory sequence controlling glucosamine-6-phosphate synthase expression [33], was also incorporated. Upon double crossover integration, a gfp-bsd selectable marker cassette also becomes incorporated downstream of the gene in this approach [34]. This combination allowed us to conveniently measure levels of BbRad51 protein via the activity of the NanoLuciferase fusion partner, and to inducibly knock down expression via the glmS ribozyme element by the addition of the D-glucosamine cofactor [33], an approach used previously in Plasmodium [35]. As a negative control for induction, samples of B. bovis CE11 parasites were transiently transfected with the plasmid pHggb, in which NanoLuciferase-2xHA expression is driven by the Babesia divergens EF1α-B promoter. This promoter was chosen because it would be unlikely to respond to upregulation of DNA repair proteins. Structures of these plasmids are shown in S1 Figure, and sequences are provided in S2_File and S3_File. Reporter parasites were exposed to 250 μM MMS, a level that reproducibly inhibits growth by approximately 10% after 48 hours in wild-type B. bovis CE11 parasites [31], presumably sufficient to induce Bbrad51 expression in response but low enough not to significantly reduce viability. Under these conditions, no evidence of an alteration in BbRad51 expression was observed after either 3 hours (Figure 1A) or 16 hours of post-treatment recovery (Figure 1B). Treatment with glcN resulted in significant knockdown of NanoLuciferase levels, on the order of 80-95%. No upregulation of BbRad51 was observed if knockdown of existing BbRad51 via the glmS element was performed. This was done to allow more sensitive detection in the case of post-translational stabilization, or to prevent any possible negative feedback (Figure 1). This result indicates that BbRad51 levels are not regulated in response to repair of alkylated DNA following acute, sublethal exposure to MMS.
A second measure of phenotype commonly used in studies of DNA repair is sensitivity to γ-irradiation. Sensitivity of the knockout parasites to DNA damage was assessed by exposure to γ-irradiation provided by a Cs137 source. Wild type CE11 parasites first were titrated for sensitivity, over a range from 10 - 1230 gray (Gy). A dosage of 100 Gy resulted in approximately 80-90% lethality by 24 hours, but allowed some parasite survival at 48 hours and beyond, whereas at 200 Gy no parasites were observed to survive 48 hours post-irradiation (S2 Figure). Radiation sensitivities of three independently-derived Bbrad51 knockout lines (all on a CE11 genetic background) therefore was compared with CE11 wild-type parasites over a range from 0-200 Gy. There were no reproducible differences among any of the three knockout lines and CE11 wild type parasites in growth assays (Figure 2).
Because no differential survival phenotype was apparent following DNA damage by γ-irradiation, we asked whether there were any detectable differences in overall rates of DNA repair. To assess this, pulsed-field gel electrophoresis (PFGE) was used to monitor the disintegration of chromosomes and their subsequent reassembly. Parasites were subjected to 100 Gy γ-irradiation, allowed to recover for up to 24 hours, and then were processed for PFGE analysis. In growth experiments, 100 Gy had resulted in killing of 80-90% of the parasites. Consistent with this level of killing, 100 Gy γ-irradiation severely damaged B. bovis chromosomes, virtually eliminating full-length chromosomes 3 and 4, and greatly diminishing the proportion of intact chromosomes 1 and 2. Remarkably, by 24 hours post-irradiation both wild-type and knockout parasites had reassembled their genomes into full-length chromosomes of apparently normal size, recovering approximately 50% of non-irradiated control values (Figure 3). Given the 10-20% viability of parasites receiving this dosage (S2 Figure) it is likely that a large proportion is not viable in the longer-term, but remains metabolically active long enough to reassemble chromosomes. These data, when considered together, demonstrate that the loss of BbRad51 has little, if any, effect on the extensive DNA repair required to recover from such damage and suggest that BbRad51-dependent HR plays little role in this type of repair.
Bbrad51 knockout did not prevent in situ transcriptional switching
Although overall DNA repair following damage from ionizing radiation was not measurably impaired by the loss of BbRad51, sensitivity to MMS and loss of ability to integrate selectable plasmids via long sequence tracts suggested that BbRad51 plays a role in aspects of HR [31], and perhaps in repair of stalled replication forks [36]. Previously, it was demonstrated that SGC is a major mechanism of antigenic variation in B. bovis [12]. Given the seemingly conflicting outcomes obtained with MMS and ionizing radiation, we wished to determine whether BbRad51 plays any role in SGC. In order to assess the nature of any changes occurring in transcribed ves1α genes of wild type and Bbrad51 knockout parasites, we adapted a previously published assay in which the highly variant cysteine-lysine-rich domain (CKRD) region of ves1α transcripts (Figure 4A) is amplified by RT-PCR, and the amplicons undergo deep sequencing [15]. Three immediate subclones of CE11 wild-type and three independent knockout clonal lines were studied; their origin is described in [31]. Total RNAs were collected from each clonal line at one month and five months post-cloning. These two timepoints were used to observe for increases in the numbers of unique recombinants over time, and for some analyses were pooled to minimize the loss of unique variants from the population over time. The forward primer, vesUniF2, was selected because it represents a sequence almost universally conserved among ves1α genes, and in combination with the highly-conserved primer PD1R was anticipated to generate amplicons of approximately 340-460 bp. By constructing bar-coded paired-end libraries from the amplicons and generating 250 bp reads, reads could be merged with high-confidence overlaps of 55 - 185 bp. Merged, full-length sequences were obtained from 69.8 - 84.5% of amplicon reads. Following the removal of ambiguous and low-quality reads, adaptor and primer sequences, and sequences found likely to be PCR chimeras, individual libraries ranged from a minimum of 853,688 to a maximum of 1,738,649 merged reads. Mean merged read lengths ranged from 326.2 ± 60.3 to 334.7 ± 52.9 bp. To determine their probable loci of origin, reads were mapped (using non-global settings) onto the B. bovis C9.1 line genomic sequence (available at Wellcome Trust Sanger Institute; ftp://ftp.sanger.ac.uk/pub/pathogens/Babesia/). The C9.1 line genome was used because we do not currently have a high quality genome for the CE11 line. However, as these are closely-related clonal sibling lines [12] this allowed us to easily identify the probable locus of origin for nearly all reads. In all six knockout and wild-type lines, the earlier time-point ves1α transcripts mapped predominantly, sometimes almost solely, to a single locus (Figure 4B), consistent with prior observations of monoparalogous ves gene transcription in the C9.1 clonal line [15]. At the latter time-point, lines ko1H5 and CE11B8 continued to transcribe almost solely from the original locus, and all lines still transcribed most heavily from the original locus. Minor but significant subpopulations were detected in all lines that had switched to transcription from alternative loci. Without immune pressure there is no obvious selection for parasites expressing specific VESA1a isoforms to predominate. Regardless, detectable transcription occurred from more alternative loci in wild type CE11 subclones B8, C2, and C5 (ranging from 28.7 ± 4.0 at 1 month to 33.3 ±1.5 loci at 5 months) than were observed for the three knockout lines, which ranged from 22.0 ± 3.5 at 1 month to 23.3 ± 1.5 at 5 months (Figure 4C). While transcribing most heavily from a single locus, the CE11 C5 subclone (as a population) also transcribed significantly from several alternative loci at the early time-point, but by the 5-month time-point transcription levels had been reduced from all but the single, major locus. Interestingly, the same alternative loci seemed to dominate as sites to which switching occurred (S3 Figure), suggesting a hierarchy in locus transcription. However, nothing can be inferred from these data regarding an order in switching like that documented for P. falciparum var genes [37].
Bbrad51 knockout failed to prevent segmental gene conversion
The ideal situation for accurate identification and characterization of SGC tracts would be to directly map amplicon sequences against a reference genome. Although the CE11 and C9.1 lines are closely related sibling clonal progeny of the MO7 clonal line, they have quite different histories [12]. Given the nature and rapidity of SGC many loci would have been extensively modified. The ideal would be to map to a high quality genome from each subclone, so that unique variants in each line would be known, but this was not feasible. As a more feasible alternative, we chose to observe for recombination among loci represented by the ves1α transcript amplicons, considering only sequences unique to a given line, on a line by line basis. By taking this approach, we could assess for recombination among sequences that were definitively present in each line at the time the experiment was performed. Using an RT-PCR strategy employed previously to characterize variation in ves1α transcripts [15], we identified all unique transcribed sequences by cluster analysis, then identified representative reads for each cluster. Among those, we then identified sequences for which there was very strong statistical support for them being the result of a true recombination event between two other unique sequences, based upon a consensus of 4 out of 7 statistical analyses (see Methods for details). For comparative analysis, we included only those recombinant sequences found in a single clonal line, on the assumption that any sequences found in more than one line was present prior to the act of parasite cloning and not a result of post-cloning recombination. Information on all identified SGC tracts are provided in S1_File.zip. We propose that most of the unique sequences were in fact recombinant, as a plot of all unique sequences against all statistically-supported recombinant sequences (both normalized per million reads) a regression of R2 = 0.9228 was obtained. Thus, the use of a statistical consensus approach was highly conservative and likely underestimates the true number of recombinant sequences (S4 Figure), but allows for rigorous comparative results.
Variants with strong statistical support as true recombinant SGC tracts were observed among the transcripts of all six clonal lines, but clear distinctions are seen between knockout and wild type. The mean lengths of conversion tracts differed between groups, decreasing from means of 109.01 ± 39.39 in wild type parasites to 91.77 ± 40.78 in knockouts (p <0.001; Figure 5). The distributions of SGC tract lengths, plotted as cumulative proportions of all SGC tracts from that population, resulted in wild type and knockout medians of 107.0 and 81.0 (S5 Figure; p <0.001). Interestingly, the difference between the two populations arose primarily at tract lengths <150 bp. The two populations were not distinct above that length, suggesting the possibility that more than one mechanism gave rise to the SGC tracts. The number of unique SGC tracts arising per active ves locus was not significantly different, ranging from 0.35-1.20 SGC tracts per locus (S6 Figure; p = 0.22 among all group comparisons). Although there was a rise in the frequency of recombinants per locus in CE11 wt parasites over time this was not statistically significant (p = 0.18). However, given the differences in the numbers of transcriptionally active ves loci in wild type and knockout parasites, this led to significantly larger total numbers of unique SGC tracts per million reads among members of the wild type population (Figure 6; at 5 months, p <0.02). Thus, while the frequency of SGC alterations that may be observed at any given transcriptionally active ves locus is approximately constant, the numbers of ves loci that are activated is significantly higher in the presence of BbRad51. The number of unique SGC tracts per million reads may be considered a surrogate measure of overall levels of variability in antigen structure presented by the population of parasites. Taken together, these data demonstrate clearly that neither SGC nor apparent isTS to alternative loci is abrogated by knockout of the Bbrad51 gene, although statistically significant quantitative effects on the lengths of SGC tracts and on the frequency and extent of ves locus switching were observed.
Involvement of homologous sequences flanking SGC breakpoints
We wished to assess whether there is anything shared at SGC breakpoint sites, or unique where SGC tracts differed from the active locus of ves transcription (LAT). To do this, alignments were made of each pair of sequences identified as having given rise to a unique recombinant, and the recombinant sequence itself. Regions of homology between all three sequences were then identified manually that represented a region in which transition occurred from one parent locus providing the sequence to the other (S7 Figure).. No significant differences were observed among wild type or Bbrad51 knockout genotype parasites in the lengths of homology patches (Figure 7). While this is clearly not an exhaustive analysis of all SGC tracts, the results of this subsampling indicate that there is no apparent difference between wild type and Bbrad51 knockout parasites with regard to the lengths of the homologous patches possibly involved in recombination. Importantly, for SGC to occur patches of homology between donor and recipient may not be required, as patches of as little as 2 bp were observed. In a few instances no transition region was present, and in still others there was a brief patch in which no match existed between the three sequences at the site of transition, suggesting a sometimes chaotic process (S7 Figure).
Discussion
SGC is a major mechanism of antigenic variation in B. bovis, and to date the only one that has been demonstrated critically [12]. This phenomenon, in which short DNA patches are duplicated from a donor to a recipient gene, typically occurs without modification of the donor and at least superficially resembles HR-mediated DNA repair. DNA repair in apicomplexan and other protozoal parasites overall is not well understood [38, 39]. It is even difficult to predict from the parasites’ proteomes which repair pathways may be active. Orthologs are recognizable for only a fraction of the proteins known to be important in higher eukaryotes, and many orthologs are simply not present. For example, the proteins RPA, Rad51, Rad52, Rad54, and ATM are considered key participants in DNA repair pathways, including HR and gene conversion. Yet, in B. bovis orthologs may be identified only for RPA, Rad51, and Rad54, and many similar examples of “missing” proteins hold [13, 31], suggesting the merging of functions. In this study, we wished to understand the contributions of BbRad51 to SGC because proteins of this family are considered essential to HR and gene conversion in other systems, including other apicomplexans [29, 40]. Our prior identification of BbRad51 as the true Rad51 ortholog was based upon several criteria, including sequence and structural similarities to established Rad51 proteins, a greatly reduced or abrogated ability to achieve HR-dependent integration of exogenous sequences in the absence of BbRad51, and enhanced sensitivity to MMS in knockouts that could be complemented by Bbrad51 coding sequences [31].
In contrast with our prior work, we present evidence herein that the absence of BbRad51 does not influence the parasite’s survival or extent and rate of general repair of dsDNA breaks engendered by acute exposure to γ-irradiation, and that BbRad51 expression fails to up-regulate in response to MMS-induced damage. The apparent insignificance of BbRad51 to repair and survival of IR-induced DSBs may be attributable to the haploid nature of the asexual stages studied here. A large proportion of IR-caused DSBs would occur in unique regions of the genome, with no intact second copy of the damaged sequence available to support true gene conversion, except briefly during mitosis. Thus, in the absence of available sequence donors for HR, BbRad51 may be largely superfluous to surviving heavy dosages of IR, yielding similar ionizing radiation survival outcomes in wild-type and knockout parasites. B. bovis may depend instead upon error-prone end-joining reactions for survival of such significant damage. Unlike most eukaryotes, including Toxoplasma [41], B. bovis lacks genes for key players in canonical non-homologous end-joining repair, such as Ku70/80 and DNA ligase 4 ([13]; this study). This parasite instead may depend upon a synthesis-dependent microhomology-mediated end-joining mechanism like that demonstrated in P. falciparum [42], but this remains to be determined. In contrast with ionizing radiation, the absence of BbRad51 does render B. bovis sensitive to alkylation damage caused by acute exposure to MMS [31]. In diploid organisms recovery from either type of insult is typically compromised by loss of Rad51 [43]. However, in diploids such repair makes frequent use of the second allele for repair through gene conversion. Unlike IR, MMS does not directly cause DSBs, but rather alkylates adenosine and guanidine bases which must be removed and replaced [36]. DSBs still may result when abasic sites or single-strand breaks created during base excision repair of the methylated bases stall or cause the collapse of mitotic replication forks [44]. In diploid cells HR is the major mechanism used in repairing such DSBs during replication, and in at least one mechanism makes use of Rad51 [43, 45]. In TbRad51-intact T. brucei (a diploid parasite), the repair of DSBs created within VSG bloodstream-stage expression sites resulted in a massive increase in the rate of VSG switching through gene conversion, presumably due to the ready availability of related sequences [46]. This also may be accomplished in haploid organisms, if appropriate sequences have already been replicated on the opposite strand, but would limit the timing of gene conversion only to S phase, prior to separation of sister chromatids.
The distribution and structure of ves genes might suggest that they have evolved for efficient application of the SGC mechanism. In most microorganisms that undergo antigenic variation the variant multigene families involved typically are arranged in large, subtelomeric clusters [47]. By contrast, ves genes are found in numerous small clusters scattered throughout the genome, and often are interspersed with smorf genes [10, 13]. Despite extremely high overall variability, ves genes possess periodic tracts of highly conserved sequence, and ves genes on the same or different chromosomes may provide targets for strand invasion via such conserved tracts [12]. A small-scale chromatin conformation capture (3C) query of sequences in proximity to the transcribed ves genes of the LAT suggested close proximity primarily to other ves genes [16]. In that organization, the ability to find short patches of homologous sequence in which to initiate strand invasion might occur via those conserved patches. However, when the sequences flanking the breakpoint sites were compared, patches of homologous sequence unrelated to the highly conserved sequence regions were observed at the SGC tract breakpoints between the LAT and donor sequences. These stretches of homologous sequence ranged from 2 bp to 35 bp in length (Figure 7), and in some sequences no identifiable homologous sequences could be identified. These data suggest that, for SGC to proceed only local microhomology is required, and perhaps no homology. While these data do not specifically identify the process responsible they are consistent with SGC not relying upon classical HR mechanisms. The gene family- and subfamily-specific conserved tracts found in most ves genes instead likely serve other functions, either in the chromatin or in the VESA1 polypeptides they encode. Interestingly, very similar patterns were reported in P. falciparum of recombination between var2CSA genes associated with adhesion of infected erythrocytes to chondroitin sulfate in placental malaria [48].
When initiating this study we hypothesized that, as Rad51 proteins are considered essential to HR and SGC is thought to be a form of HR, BbRad51 should be essential to this process and its loss should abrogate SGC. Our results disprove that hypothesis. The analysis of sequences surrounding SGC breakpoints supports error-free recombination via homologous sequences in most, but not all, instances. However, unlike Rad51-mediated HR where tracts of homologous sequence at least 8 bp long are needed for synapse formation [49, 50], only local sequence microhomology of at least 2 bp, or perhaps no homology, appears to be needed (Figure 7). It is possible that this is an artifactual result arising from misidentification of recombining sequences during analysis. Alternatively, this may be the actual biological result, due to mismatch repair or deletion subsequent to sequence acquisition or replication. Whereas these results reflect the mechanism(s) responsible, they do not identify the cause. This result is consistent with loss of the ability to incorporate exogenous plasmid sequences into the genome via HR in the absence of BbRad51 [31], whereas only the rate and product lengths of SGC are affected. The reason for the apparent shortening of SGC tracts observed in knockouts is not clear. One possibility is that it may reflect the loss of some recombinants where longer unique tracts were involved, which might require BbRad51 for successful disengagement of the invading and donor strands, repair of breaks created during the disengagement process, or stabilization of a longer crossover region during exchange. Regardless of any secondary effects, BbRad51 is clearly not essential to the SGC process.
The overall reduction in isTS observed among Bbrad51 knockouts, based upon the numbers of alternative loci to which transcripts mapped in each line, is particularly intriguing. At the 5-month time point, in the knockout lines transcripts appear to have arisen from a mean of only 23.3 ves loci, whereas in wild-type parasites a mean of 33.3 loci had been activated within the population. Although statistically significant (p= 0.014), the biological significance of this difference is not as clear. The reason is because these data are derived from ves1α genes only. Also, the maximal number of ves loci that are competent to be activated is not known and the ability to modify a single locus is extensive. About half of the ves family is organized in divergent (head-to-head) pairs (approximately 33 pairs) of ves1α/ves1β or ves1α/ves1α genes that flank quasi-palindromic, bidirectional promoter regions [12, 16]. The remainder are present as individual ves genes with potentially unidirectional promoters. Among the ves loci putatively activated in this study, the ratio of loci with divergent/ unidirectional promoters ranged from 1.07 - 1.60 (mean 1.28 ± 0.24) in CE11, and 1.25 - 2.00 (mean 1.61 ± 0.29) in knockouts. Although wild type parasites, on average, appeared to activate a higher proportion of unidirectional promoters than did knockouts, this did not reach statistical significance (p = 0.059; S1 Table). The functionality of several bidirectional ves promoters has been demonstrated experimentally [16], but function has not yet been similarly tested for promoters preceding individual ves genes. This result clearly indicates that a significant proportion of ves loci have the potential to be activated, including individual ves genes, consistent with a study on the transcriptomes of pairs of virulent B. bovis lines and attenuated lines derived from them. In that study, virulent lines transcribed from a significantly wider variety of ves loci that included both divergent and non-divergent loci [51]. Interestingly, among attenuated parasites ves transcription was upregulated only from loci that are not divergently-oriented. The difference in ves transcriptional behavior observed here and in the attenuation study suggests that in vivo attenuation of B. bovis is unlikely to be related to BbRad51 expression or function. This conclusion is supported by the unperturbed Bbrad51 transcription observed in attenuated parasites [51].
The basis for a connection between the SGC and isTS mechanisms is not clear. Indirect evidence allows us to propose at least three feasible explanations, each with varying levels of support. (i) First, of the various ves loci represented among the transcripts, a subset of the sequences may reflect the complete replacement of the observed region within the original locus of active ves transcription (LAT) by much longer conversion tracts, rather than by isTS. In this case, such a long replacement sequence would cause the read to map artifactually to the donor locus rather than to the locus from which the full ves gene was actually being transcribed. Given the inability to integrate exogenous DNAs into the genome of Bbrad51 knockout parasites [31] it is anticipated that this would be a rare event in knockouts. From our data, we cannot rule out this possibility. (ii) The sites failing to activate in Bbrad51 knockouts may be unusually sensitive to recombination, with most such events leading to lethality. However, comparison of all the putatively activated ves loci in wild-type and knockout parasites reveals that there is essentially complete overlap in the ves gene clusters that can be activated (S3 Figure), directly arguing against this explanation. Rather, the knockouts appear to achieve comparable inter-locus switches, but with a lower frequency than wild-type. (iii) A mechanistically distinct possibility is that Bbrad51 knockouts have a diminished capacity to activate ves loci epigenetically. As a part of DNA repair, chromatin first must be remodeled to make it accessible to the repair machinery. This occurs in part by local chromatin decondensation through histone acetylation [52–54]. Rad51 has been proposed to assist in the assembly of the histone acetylation machinery during repair of dsDNA breaks and stalled replication forks [55]. In the presence of BbRad51, double-stranded breaks in silenced ves loci may be successfully acetylated and repaired, but may not always be remodeled again for silencing. With transcription of a single ves locus being the default state, a choice would have to be made between the existing LAT (the single active locus of ves transcription [15]) and the newly repaired/acetylated ves locus. Silencing of the existing LAT would lead to isTS and establishment of a new LAT. Thus, BbRad51 may be epigenetically effecting isTS of ves genes as an unintended side-effect of the DNA repair process. If true, then in Bbrad51 knockouts the absence of BbRad51 would be anticipated to result in frequent failure to assemble the full repertoire of repair machinery, including epigenetic modifiers. Accordingly, isTS would be a less common event. While explanation (i) is consistent with well-established Rad51 protein functions, possibility (iii) is neither implausible nor inconsistent with less well-characterized functions, and even could provide a potential mechanism for the stochastic switch events of isTS. If substantiated experimentally, this could provide a direct link between DNA repair and antigenic variation via isTS as well as recombination. Our currently available evidence does not distinguish these two possibilities, and this question warrants further investigation.
The limitations of our study include the lack of genomic data for our culture populations. High coverage whole genome sequencing, and good quality assemblies might improve the assessment of SGC and recombination. Still, our results demonstrate that BbRad51 is not necessary for survival of asexual B. bovis in vitro or for overall genome stability in the absence of environmental insult. Moreover, this protein is dispensable to SGC-based antigenic variation in B. bovis, although it influences the rates of SGC antigenic variation and isTS. It is not clear whether its absence would be similarly benign during in vivo infection, where there is strong selection by host immune responses. Evolutionary retention of BbRad51 and its involvement in recovery from alkylation damage indicates clearly that it plays some role(s) in parasite DNA repair, including in asexual developmental stages. The clear implication of this work is that some component besides BbRad51 provides for the recombination observed in SGC. Whether this is from a more distantly related member of the RecA/RadA/Rad51 superfamily proteins encoded by the B. bovis genome, or another enzyme class altogether, awaits experimental evidence.
Materials and Methods
Parasite culture, transfection, and selection
This project used the clonal B. bovis CE11 parasite line as starting material [6]. In vitro parasite cultures were maintained as microaerophilous stationary phase cultures under 90% N2/ 5% O2/ 5% CO2 (v/v), essentially as described [5, 56]. Cloning of parasites was conducted by two sequential rounds of limiting dilution cloning as described previously [7]. Parasites were transfected with DNAs purified from E. coli DH5α, using EndoFree Plasmid Maxi kits (Qiagen; Valencia, CA). Both parasitized erythrocytes and DNAs were suspended in cytomix (120 mM KCl, 0.15 mM CaCl2, 10 mM K2HPO4 /KH2PO4 pH 7.6, 25 mM HEPES pH 7.6, 2 mM EGTA, 5 mM MgCl2, pH 7.6) [57] prior to electroporation.
Electroporation was performed in 2 mm-gap cuvettes, using 5 pmol linearized DNA at 1.25 kV, 25 μF, and 200 Ω, as described by Wang et al. [16], with plating at 1.25% packed cell volume. After 24h recovery in culture, selection was initiated by addition of blasticidin-s hydrochloride (TOKU-E; Bellingham, WA) to a final concentration of 15 μg ml−1 (32.7 μM) [58]. Every three days medium was removed, and replaced with a 2.5% packed cell volume of uninfected erythrocytes in medium plus blasticidin-s. Once viable parasites emerged, usually after approximately two weeks, they were maintained under drug selection and were immediately cloned.
Validation of Bbrad51 knock-out
Validation of Bbrad51 gene knock-out was performed by diagnostic PCR, Southern blotting, RT-PCR, and sequencing of the Bbrad51 locus. These data are presented in [31].
Phenotypic analyses
Parasite growth assays
Parasite growth was assayed by counting Giemsa-stained smears, with samples collected at 0, 24, and 48h growth (approximately 0, 3 and 6 cell cycles [59]). Alternatively, in some experiments a DNA-based SYBR Green I method was performed, essentially as described [60, 61], on parasites grown in bovine erythrocytes depleted of leukocytes [62]. For experiments involving γ-irradiation, parasites were exposed to a calibrated [Cs137] source (Gammacell GC-10 gamma irradiator), on ice. Control cells were maintained on ice for the duration of the treatment time. Samples were immediately diluted into fresh medium containing 10% packed cell volume uninfected erythrocytes, and placed into culture. For experiments involving MMS exposure, parasites were exposed to MMS (diluted in complete medium) for 90 minutes, followed by washout as described [31].
Creation of a Bbrad51-NanoLuciferase-tagged parasite line
The Bbrad51 locus from nucleotide 1,858,332-1,860,393 of the B. bovis C9.1 line genome, already fused with 2x HA tag and a glmS element in plasmid pBbRad51HAglmS, was opened by inverse PCR with primers DA350 and DA351 [63]. NanoLuciferase coding sequences were amplified from plasmid p2xHAglmS-gfp-bsd (pHggb) with primers DA348 and DA349, and inserted into the opened plasmid with NEBuilder reagents (New England Biolabs; Berverley, MA) to create plasmid pBbRad51NLHAglmS (pBRNLHg; S2_File). This plasmid was linearized and transfected into B. bovis CE11 parasites [16]. Transfectants were selected for resistance to blasticidin-S. Recovered transformants had experienced fusion of the cassette to the 3’ end of Bbrad51 coding sequences by double crossover homologous recombination, as described [31].
Chromosome reassembly
Parasites grown in leukocyte-depleted erythrocytes as described above. Cultures, at 2.5% parasitized erythrocytes, were given 100 Gy exposure to [137Cs] on ice to fragment chromosomes. Irradiated cells were placed back into culture to recover for designated times, then were processed for pulsed-field gel analysis [64]. Plugs were embedded into 1% SeaKem Gold agarose in 0.5x TBE buffer, and electrophoresed for 23.5 hours at 180V, with a 50-165 second ramped switch time [65]. Gels were stained with SYBR Gold Nucleic Acid Stain (Invitrogen) for DNA visualization, and photographed. Integrated pixel intensities were plotted for each chromosome and the “smear” of DNA below chromosome 1 using ImageJ v. 1.52 “Gels” and “Measure” algorithms. Corresponding blank gel regions were used for background correction.
SGC-mediated antigenic variation assay
This experiment was performed with three biological replicates per genotype, comprised of one clone each from three independent Bbrad51 knockout lines (CE11Δrad51ko1 H5, CE11Δrad51ko2 E8, and CE11Δrad51ko3 A5; referred to as ko1H5, ko2E8, and ko3A5), and three subclones of wild type CE11 line parasites (CE11 B8, CE11 C2, and CE11 C5). Bbrad51 knockout and wild type parasites were cloned by limiting dilution [7]. RNAs were isolated one and five months after parasite cloning, using Ribozol (Amresco). RNAs were treated two times with TurboDNase (Ambion) supplemented with 1 mM MnCl2 [66], followed by inactivation with DNase Inactivation Reagent (Ambion). M-MuLV reverse transcriptase (New England Biolabs) and oligo-d(T) primers were used to make cDNAs. A hypervariable segment containing most of the CKRD domain of ves1α transcripts was amplified by RT-PCR, using “universal” primers vesUniF2 (TGGCACAGGTACTCAGTG) and PD1R (TACAANAACACTTGCAGCA) as described [15].
Sequencing and recombination analysis
Four independent amplifications of each cDNA were pooled in stoichiometrically equal amounts to maximize detection of rare variants and minimize single-sample PCR artifacts during sequencing. Paired-end amplicon libraries were generated with NEBNext reagents (New England Biolabs) by the University of Florida NextGen Sequencing Core Laboratory, incorporating Illumina TruSeq index sequences. Libraries, spiked with 8% PhiX genomic library as internal control, were sequenced on the Illumina MiSeq platform, using the Illumina Pipeline 1.8. Fastq reads were analyzed using Qiime2 pipeline [67]. Quality and adapter trimming were performed using CutAdapt [68, 69]. Further de-noising and amplicon sequence variant (ASV)-calling were performed using DADA2 [70], truncating the reads at 230 nt and allowing a maximum of 5 expected errors per read. In order to remove potential contaminating sequences, ASVs were aligned with the bovine genome, using BLAST [71]. In order to identify recombinant sequences, ASVs were first aligned using CLC Main Workbench, version 6.9.2. Recombination analyses were then performed on the alignments with RDP, GENECONV, Bootscan, Maxchi, Chimaera, SiSscan, and 3Seq, as implemented in RDP4 [72]. Only recombinant events identified by a minimum of 4 out of seven tests (at p ≤0.05) were considered statistically supported and included in downstream analyses. Note that no genome is currently available for the CE11 line. Therefore, the C9.1 line genome was used, as the C9.1 and CE11 lines are sibling clonal lines derived from the MO7 clonal line [6, 7, 12]. Recombinant results, including extraction of SGC tracts, were summarized using custom scripts written with R version 3.6.3 [73] through the RStudio shell. Mean lengths of SGC tracts were compared by one-way ANOVA, whereas tract length distributions were compared by the Mann-Whitney Rank Sum test, without expectation of a normal distribution of variance, using SigmaPlot version 11.0 (Systat Software, Inc.; San Jose, CA).
Data availability
Raw sequence reads are publicly available for download from NCBI, as BioProject #PRJNA357248 (accession numbers SRR5110992-SRR5111003). All recombinant transfection constructs are available upon request.
Author contributions
Conceptualization: DRA, EAM
Data curation: DRA
Experimental design: DRA, EAM, Y-PX
Formal analysis: DRA, EAM, MST
Funding acquisition: DRA
Investigation: DRA, EAM, SQ, Y-PX
Methodology: DRA, EAM, MST, Y-PX
Project administration: DRA
Resources: DRA
Supervision: DRA
Writing- original draft: DRA, EAM
Writing- review and editing: DRA, EAM, MST
Supporting Information Captions
S1 Table. Organizational nature of the ves loci to which reads mapped. Loci to which amplicons mapped are listed for all 12 samples (i.e., 1-month and 5-month RNAs from all three wild-type and knockout lines), along with mean lengths and the nature of each locus. Both divergent, bidirectional loci and unidirectional loci were active in transcription at some level in all samples.
S1_File.zip. This .zip file contains individual files of SGC tracts identified for all six clonal lines, in .csv format.
S2_File.fa. This file contains the full sequence of the pBbRad51-NanoLuciferase-2xHA-glmS plasmid (pBRNLHg), in fasta format.
S3_File.fa. This file contains the full sequence of the p2xHA-glmS-gfp-bsd plasmid (pHggb), in fasta format.
Acknowledgments
This work was supported by funds from National Institutes of Health grants R01 AI0055864 and T32 AI0007110, USDA Animal Health project FLA-VME-005011, and U.F. College of Veterinary Medicine. The authors thank Allison Vansickle for assistance with animal care and handling, and Kevin Brown, Linda Bloom, Salvador Castaneda Barba, Eva Top, and Olivia Kosterlitz for helpful comments.
Footnotes
This version includes all figures and supplementary files. No changes have been made otherwise.