Abstract
Segregation of chromosomes depends on the centromere. Most species are monocentric, with the centromere restricted to a single region per chromosome. In some organisms, monocentric organization changed to holocentric, in which the centromere activity is distributed over the entire chromosome length. However, the causes and consequences of this transition are poorly understood. Here, we show that the transition in the genus Cuscuta was associated with dramatic changes in the kinetochore, a protein complex that mediates the attachment of chromosomes to microtubules. We found that in holocentric Cuscuta species the KNL2 genes were lost; the CENP-C, KNL1, and ZWINT1 genes were truncated; the centromeric localization of CENH3, CENP-C, KNL1, MIS12, and NDC80 proteins was disrupted; and the spindle assembly checkpoint (SAC) was degenerated. Our results demonstrate that holocentric Cuscuta species lost the ability to form a standard kinetochore and do not employ SAC to control the attachment of microtubules to chromosomes.
Introduction
Faithful segregation of chromosomes during mitosis and meiosis depends on the centromere, a chromosomal domain that facilitates attachment of chromosomes to spindle microtubules. In monocentric chromosomes, the centromere is localized at a single site per chromosome, which is morphologically discernible as a primary constriction. Holocentric chromosomes, on the other hand, lack this primary constriction and instead have the centromere domains distributed along almost the entire chromosome length. Holocentricity evolved from monocentric organization independently several times during the evolution of both plants and animals 1; however, the causes of the transitions are still enigmatic. This is primarily because only a few holocentric species have been studied so far and because most groups of holocentric species evolved from the monocentric ancestors a long time ago, making the factors involved in the transition elusive.
In most species, the centromere is epigenetically determined by the presence of CENH3, a centromere-specific variant of histone H3 that replaces the canonical H3 histones in centromeric nucleosomes 2. At the same time, CENH3 serves as the basis for the kinetochore, a complex multiprotein structure that mediates the connection between centromeric chromatin and the microtubules of the mitotic spindle in most species. The backbone of the kinetochore consists of the constitutive centromere associated network (CCAN), which connects the kinetochore with centromeric chromatin, and the KMN network, which constitutes an interface towards spindle microtubules 3,4. The function of the kinetochore is regulated by additional proteins, the most studied of which belong to the spindle assembly checkpoint (SAC) 5,6 and the chromosome passenger complex (CPC) 7–9.
The role of CENH3 in centromere determination predicts that the transition from monocentric to holocentric centromere organization requires the formation of CENH3-containing domains along entire chromosomes. Indeed, in the few holocentric species studied to date, CENH3 is typically localized along the entire poleward surface of each chromatid where microtubules attach 10,11. An exception are holocentric insects that lack CENH3 and use an alternative pathway of kinetochore assembly that depends on CENP-T protein 12–14.
Recently, we identified the first exception in plants, in Cuscuta europaea, which belongs to the holocentric subgenus Cuscuta of the parasitic plant genus Cuscuta (Convolvulaceae) 15. In this species, the chromosomes restrict CENH3 to only one to three heterochromatin bands, despite being attached to the mitotic spindle along their entire length. This suggests that CENH3 has either lost its centromere function in this species or acts in parallel with an additional CENH3-independent mechanism of kinetochore assembly. Since monocentric relatives of C. europaea from the sister subgenus, Grammica, and the more distant subgenus, Monogynella, have CENH3 localized specifically in primary constrictions 16, it is plausible that the peculiar CENH3 localization in C. europaea resulted from changes in kinetochore assembly that were linked to the transition to holocentricity in the subgenus Cuscuta. However, how kinetochore assembly has changed and whether these changes are related to the transition to holocentricity remains unknown.
In this study, we addressed these questions by comparing the repertoire of major structural and regulatory kinetochore proteins and their chromosomal localization between two Cuscuta species from the holocentric subgenus Cuscuta (C. europaea and C. epithymum), two monocentric Cuscuta species from the sister subgenus Grammica (C. australis and C. campestris), and Ipomoea nil, which was included as an outgroup Convolvulaceae species. To obtain high-quality data for gene identification in the two holocentric Cuscuta species, we sequenced both their genomes and transcriptomes. The chromosomal localization of kinetochore proteins was determined using antibodies developed against key proteins representing different subcomponents of the kinetochore. Comparison of the results between monocentric and holocentric species allowed us to uncover an unprecedented level of changes that occurred specifically in the holocentric species and thus likely played an important role in the transition to holocentricity in Cuscuta.
Results
Transition to holocentricity in Cuscuta was associated with massive changes of kinetochore protein genes
Sequencing of the holocentric species C. europaea and C. epithymum resulted in genome assemblies of 975.8 Mb (N50 = 17.9 Mb) and 997 Mb (N50 = 3.3 Mb), respectively (Supplementary Note 1, and Supplementary Table 1). The completeness of gene space and quality of gene prediction were assessed using BUSCO and were comparable to genome assemblies previously published for the monocentric Cuscuta relatives C. australis and C. campestris (Supplementary Fig. 1). The quality of gene prediction in the genome assembly was also verified by the independent assembly of the transcriptomes, which showed similar results following BUSCO analysis (Supplementary Table 2). To identify kinetochore protein sequences in the species selected for this study, we created a sequence database of 29 structural and regulatory kinetochore proteins known in plants. First, we used the database as a query for blastp searches to identify homologous protein sequences in the monocentric species C. australis, C. campestris, and Ipomoea nil. The identified sequences were manually verified and corrected when needed, and added to the database to improve its sensitivity for homologous protein recognition. The improved database was then used for blastp searches in the two holocentric Cuscuta species. Comparison of the identified kinetochore protein genes revealed that all 29 tested genes are present and mostly intact in the monocentric species, whereas in the holocentric species some of the genes are either absent, significantly truncated, or duplicated accompanied by a higher rate of sequence divergence (Fig. 1a, Supplementary Table 3 and Supplementary Data 1).
The lost genes included both eudicotyledonous plant homologs of KNL2, referred to as αKNL2 and βKNL2 17, and four of eight spindle assembly checkpoint (SAC) genes, namely, BMF1, BMF2, BMF3, and MAD2 (Fig. 1a). Their absence was in all cases confirmed by comparison of genomic loci possessing these genes in C. australis with the orthologous loci in C. europaea and C. epithymum (Supplementary Figs. 2 and 3), as well as by their absence in genome-independent transcriptome assemblies. The only exception was BMF1 whose transcriptionally inactive fragment still remains in C. epithymum (Supplementary Fig. 3). Large gene truncations took place in three structural kinetochore protein genes, including CENP-C, KNL1, and ZWINT1, and the SAC gene MAD1 (Figs. 2 and 3 and Supplementary Figs. 4 and 5). Finally, the CENH3 gene in holocentric species was found to have duplicated once in the common ancestor of C. europaea and C. epithymum, and once independently in each of the two species. The diversification of the duplicated CENH3 genes in holocentric species resulted in considerably higher protein sequence variability for CENH3 compared with monocentric Grammica species, suggesting that they evolved more rapidly (Supplementary Figs. 6, 7, 8 and 9).
Given the function of proteins that are either missing or truncated, the changes are likely to have had a substantial impact on kinetochore assembly and function at multiple levels, from CENH3 loading (absence of KNL2) and kinetochore assembly (truncation of CENP-C, KNL1, and ZWINT1), to regulation of its function (absence of several key proteins of SAC) (Fig. 1b,c).
CENH3 histones do not have holocentric-like distribution in holocentric Cuscuta species
Since KNL2 is essential for proper loading of CENH3 to centromeres 17–20, the loss of both αKNL2 and βKNL2 in holocentric Cuscuta species is likely to have a serious impact on CENH3 localization. On holocentric chromosomes, CENH3 is expected to specifically localize along the poleward side of each chromatid. In contrast to this expectation, we have previously shown that CENH3 occurs in all but one prominent transversal heterochromatin band in C. europaea and that CENH3 distribution does not correlate with the distribution of mitotic spindle attachment sites detected with antibodies against α-tubulin (15 and Fig. 4a,b). To determine the localization of CENH3 in C. epithymum, we developed three antibodies against different N-terminal sequence variants of the proteins. Although the antibodies were made to recognize all CENH3 protein sequence variants present in the tested plant, none of them produced a signal on chromosomes and nuclei that could be distinguished from the background (Supplementary Fig. 10a-g). On the other hand, two of the antibodies developed for C. epithymum detected CENH3 in the heterochromatin domains in C. europaea (Supplementary Fig. 10c,e), demonstrating that they were functional for in situ detection. These results suggest that CENH3 is either not present in chromatin in C. epithymum or that its levels are considerably lower than in C. europaea, and thus below the limits of detection for the applied in situ immunodetection technique. Despite the absence of CENH3 signal, α-tubulin immunostaining revealed attachment of mitotic spindle microtubules to chromosomes along their poleward sides, confirming the holocentric nature of chromosomes in C. epithymum (Fig. 4c). This was in contrast to monocentric Cuscuta spp., which had microtubules attached only to CENH3 containing domains (Fig. 4d and data not shown). These results suggest that CENH3 does not function as a foundational kinetochore protein in holocentric Cuscuta species.
Kinetochore assembly is impaired in holocentric Cuscuta species
The chromosomal distribution of CENH3 together with the truncation of three structural kinetochore proteins suggested that kinetochore assembly may be impaired in holocentric Cuscuta species. To test whether the kinetochore assembles along the poleward chromosome surface, as expected for holocentric chromosomes, we examined the localization of CENP-C, which is a linker between CENH3 and the KMN network, and of MIS12, KNL1, and NDC80, which represent the three complexes of the KMN network (Fig. 1b). Antibodies were developed against peptides designed from domains that were conserved in the holocentric species. However, owing to high sequence similarity between species, it was likely that the antibodies against KNL1, NDC80, and MIS12 would also recognize homologous proteins from monocentric Cuscuta species. Indeed, when these antibodies were used for in situ detection, monocentromeres in C. australis as well as in C. reflexa from the more distant subgenus Monogynella were labeled, demonstrating the functionality of the antibodies (Fig. 4e-g and Supplementary Fig. 11). The antibodies against KNL1 and NDC80 proved to be particularly versatile, functioning even in Rhynchospora pubera, an evolutionarily very distant plant species with holocentric chromosomes, where they detected holocentromere-characteristic signals for both proteins (Fig. 4h,i). In agreement with the lack of CENH3 signal in C. epithymum, CENP-C, KNL1 and NDC80 were not detected on either mitotic chromosomes or in interphase nuclei in this species (data not shown). In C. europaea, these three proteins were detected in small subdomains embedded within CENH3-containing heterochromatin during interphase but not on mitotic chromosomes (Fig. 4j-l, Supplementary Movie 1, and data not shown). Simultaneous in situ detection of KNL1 with either CENP-C or NDC80 revealed that these proteins fully colocalized (Fig. 4m,n and Supplementary Movies 2 and 3). These results suggest that the assembly of the kinetochore during interphase in C. europaea still depends, at least in part, on the presence of CENH3, but that kinetochore organization is disrupted before cells enter mitosis. Strikingly, MIS12 was detected in 2 - 16 (n = 100) discrete nuclear domains during interphase in both holocentric species (Fig. 4o,p). In C. europaea, these domains were always located away from the CENH3-containing heterochromatin (Fig. 4o and Supplementary Movie 4), indicating that MIS12 has become independent of CENP-C and the KMN network proteins.
Conventional SAC is abolished in holocentric Cuscuta species
To test if the regulatory kinetochore complexes form on chromosomes in holocentric Cuscuta species despite the absence of the tested kinetochore proteins and the massive loss of the SAC genes observed, we raised antibodies against BUB3;1/2 and Borealin, which are components of the SAC and CPC, respectively. While the BUB3;1/2 antibodies produced monocentric-like signals on chromosomes in C. australis and C. reflexa, and holocentromere-like signals in Rhynchospora pubera, BUB3;1/2 was not detectable on chromosomes in holocentric Cuscuta species (Fig. 5a-c and data not shown). On the other hand, the antibodies against Borealin labeled the chromosomes in the region around areas of sister chromatid cohesion at centromeres in monocentric C. reflexa and along the entire chromosome length in both holocentric Cuscuta species (Fig. 5d-f). These results indicate that the conventional SAC is abolished, while the CPC maintains at least some of its functions in holocentric Cuscuta species.
Discussion
The peculiar CENH3 localization in C. europaea described in our previous study 15 suggested that the transition to holocentricity in the genus Cuscuta may have been associated with the formation of a CENH3-independent kinetochore assembly. In this study, we have demonstrated that the transition to holocentricity in Cuscuta species was associated with extensive changes in structural and regulatory kinetochore protein genes, and disruption of both standard kinetochore assembly and SAC regulation of mitotic chromosome segregation. This distinguishes holocentric Cuscuta species from both the holocentric nematode Caenorhabditis elegans, which use the CENH3-CENP-C pathway of kinetochore assembly 21, and holocentric insects, in which the CENH3-CENP-C pathway of kinetochore assembly was lost and replaced by the CENP-T pathway 12–14 (Fig. 6).
We hypothesize that one of the most important changes in the evolution of holocentric Cuscuta species was the loss of KNL2. In C. elegans, RNAi depletion of KNL2 leads to a reduction in the presence of CENH3 to levels undetectable by immunodetection, resulting in chromosome segregation defects and embryonic lethality 18,19. Similar phenotypes have been observed in KNL2 mutants in other species, including A. thaliana, demonstrating the general importance of KNL2 for CENH3 loading 17,20. Therefore, the depletion/absence of CENH3 in C. epithymum chromatin could be due to the absence of both KNL2 variants. On the other hand, it is puzzling that CENH3 accumulates in heterochromatin domains in C. europaea despite the loss of KNL2. Given that all heterochromatin domains that contain CENH3 possess the same repetitive sequences, whereas the heterochromatin domain that lacks these repeats also lacks CENH3 15,22, the incorporation of CENH3 into these domains could be DNA sequence-dependent. In light of the importance of KNL2 and CENH3 for centromere determination and kinetochore assembly, it is surprising that the loss of KNL2 in both holocentric Cuscuta species, the depletion/absence of CENH3 on chromosomes in C. epithymum, and the peculiar CENH3 distribution on chromosomes in C. europaea are neither lethal nor cause chromosome segregation defects. The simplest explanation is that CENH3 is no longer necessary for correct chromosome segregation in holocentric Cuscuta species (Supplementary Fig. 12).
The absence of detectable levels of structural kinetochore proteins on mitotic chromosomes in holocentric Cuscuta species is in contrast not only to monocentric Cuscuta species but also to the holocentric-like distribution of NDC80 and KNL1 in R. pubera (Cyperaceae), which was used as a holocentric control plant in this study (Fig. 4). This suggests that the formation of the standard kinetochore is disrupted in holocentric Cuscuta species. In C. epithymum, this could be primarily a direct consequence of the depletion/absence of CENH3 on the chromosomes. In C. europaea, the causes of kinetochore disruption must be different because CENH3-containing heterochromatin is present throughout the cell cycle and partially colocalizes with CENP-C, KNL1, and NDC80 proteins during interphase. The reasons why the putative complex of kinetochore proteins formed during interphase disappears at the onset of mitosis are not clear. Considering that three structural kinetochore proteins are truncated (Fig. 1a), one possibility is that the complex falls apart because of disrupted interactions between kinetochore components (Fig. 1c). The truncation of CENP-C may be the most critical because CENP-C is the only protein known to link centromeric chromatin to the outer kinetochore in plants (Figs. 1b,c and 2a). Although the N-terminus of CENP-C is divergent in sequence between eukaryotes, it has been shown to bind MIS12c in both humans and yeast, indicating a conserved function 23–25. Given that this function is also conserved in plants, the N-terminal truncation of CENP-C in C. europaea should interfere with MIS12c binding. Consistent with this notion, we found that MIS12 does not colocalize with CENP-C and accumulates in discrete domains that are clearly separated from CENH3-containing domains (Fig. 4l and Supplementary Movie 4). While the colocalization of CENP-C, KNL1, and NDC80 suggests that the kinetochore assembles during interphase, despite the absence of MIS12, the complex may not be sufficiently stable to survive mitosis. The N-terminal truncation of CENP-C is, however, unlikely to cause the disappearance of the protein itself because the N-terminus is not required for the binding of centromeric nucleosomes (Fig. 2a and 26). Although the internal portion of CENP-C contains a domain that binds centromeric nucleosomes in humans and yeast (Fig. 2a), the high sequence divergence of CENP-C prevented us from determining by a sequence similarity-based approach whether it overlaps with the region lost in C. europaea. On the other hand, the large size disparity between the domains containing CENH3 and CENP-C (Fig. 4j,o and Supplementary Movies 1 and 2) suggests that there is an imbalance between the levels of the two proteins that may reflect inefficient binding of CENP-C to CENH3.
The results discussed above support a model in which holocentric Cuscuta species either use substantially reduced kinetochores lacking CENH3, CENP-C, KNL1, MIS12, and NDC80 or, more likely, have evolved a completely novel mechanism of chromosome attachment to the mitotic spindle. This conclusion is also supported by the degeneracy of SAC genes that would have been required had the kinetochore been present and functioning in a conventional manner. Alternative kinetochores have already been described in Kinetoplastida, most of which have lost CENH3 and all CCAN and KMN genes. They consist of proteins that probably evolved from meiotic components of chromosome synapsis and homologous recombination machinery 27,28. Moreover, kinetochore-independent chromosomal movement along the spindle, facilitated by kinesin motor proteins, has been described for acentric chromosomes in Drosophila neuroblasts 29,30 and for chromatin knobs in maize 31,32.
Overall, we have shown that the transition to holocentricity in Cuscuta species was unique among all species studied to date. It was accompanied, and perhaps even triggered, by the degeneration of standard kinetochore structure and regulation and the formation of a novel mechanism for chromosome attachment to microtubules. The insights gained in this study provide the basis for future studies aimed at uncovering the plasticity of kinetochore assembly and discovering as yet unknown principles of chromosome segregation.
Material and Methods
Plant material
Seeds of C. europaea (serial number: 0101147) were obtained from the Royal Botanic Garden (Ardingly, UK). C. epithymum plants were collected from a natural population at “U Cáby” (Kroclov, Czech Republic). Seeds of C. australis and C. campestris were provided by Prof. Jianqiang Wu (Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China) and Dr. Chnar Fathoulla (University of Salahaddin, Kurdistan Region, Iraq), respectively. C. reflexa Roxb. plant was obtained from the Botanic Gardens of the Rhenish Friedrich-Wilhelm University (Bonn, Germany). Cuscuta plants were cultivated on the following host plant species: Urtica dioica (C. europaea), Betonica officinalis and Coleus blumei (C. epithymum), Ocimum basilicum (C. australis and C. campestris), or Pelargonium zonale (C. reflexa). Plants of R. pubera were obtained from Dr. André Marques (Max Planck Institute for Plant Breeding Research, Cologne, Germany).
Genome sequencing and assembly
DNA for Illumina and Pac-Bio sequencing was isolated using the CTAB method from nuclei extracted from young shoots of C. europaea and C. epithymum as described previously 33. Shotgun Illumina paired-end sequencing of DNA was performed by the Brigham Young University (Provo, UT, USA) and Admera Health (South Plainfield, NJ, USA). High molecular weight nuclear DNA used for Oxford nanopore sequencing was isolated using a modified CTAB protocol as described previously 34. Nanopore sequencing was performed as described 22. Detailed information about all genome sequence datasets produced in this study is provided in Supplementary Table 4.
Illumina paired-end reads and Oxford nanopore reads were assembled using MaSuRCA 35. PacBio HiFi reads were assembled using Hifiasm assembler (v0.15.5-r350; 36) with default parameters for PacBio HiFi sequence reads. Since the quality of the HiFi-based assemblies were considerably better than those generated by MaSuRCA (Supplementary Table 1), they were selected for submission to European Nucleotide Archive (https://www.ebi.ac.uk/ena/browser/home; Accession numbers: ERZ12293622 (C. europaea) and ERZ12293623 (C. epithymum)). Completeness and contiguity of assemblies were evaluated using BUSCO (v5.2.2; 37) and QUAST (v5.0.2; 38). Genome characteristics were evaluated using kmer analysis and the jellyfish program 39 with kmer length 21 and 51 for Illumina and PacBio HIFI sequence reads, respectively. Heterozygosity was estimated using GenomeScope program 40.
Transcriptome sequencing, assembly and gene prediction
Total RNA was isolated using the Trizol method. Preliminary sequencing for de-novo transcriptome assemblies of C. epithymum, C. europaea, and C. campestris was performed at GATC Biotech (Konstanz, Germany) using Illumina technology producing 50bp paired-end reads. In each species, RNA was isolated from shoots and inflorescences, mixed in a 1:1 ratio, treated with DNase I (Ambion, Austin, TX, USA), and then enriched for poly-A fraction using the Dynabeads mRNA purification kit (Thermo Fisher Scientific, Waltham, MA, USA). Deep transcriptome sequencing of C. epithymum, C. europaea, and C. australis was done using RNA isolated from shoot tips, shoot internodia, or inflorescences at various stages of development. For each species and tissue, the RNA samples were produced in three biological replicates (samples from different plants collected at different time). Subtraction of poly-A RNA using NEBNext Ultra II with a Poly-A Selection kit (New England Biolabs, Ipswich, MA, USA) and poly-A RNA sequencing were performed at Admera Health (South Plainfield, NJ, USA). The sequencing generated more than 500 million 151 nt long paired-end reads for each RNA sample, giving a total yield of about 5 billion reads per species (Supplementary Table 5).
Transcriptomes were de-novo assembled using the Trinity program 41 with default options from pair-end reads. Sequences from individual replicates and tissue samples of each species were concatenated before their assembly. The presence of single copy orthologs in the transcriptomes was evaluated using the BUSCO (v.5.2.2) program 37. To create gene models, pair-end RNA-Seq Illumina reads were aligned to genome assembly using the STAR program (v2.7.7a; 42) with parameters --outSAMstrandField intronMotif --outSAMtype BAM SortedByCoordinate -- alignIntronMax 20000. Each sample was aligned independently. Resulting alignments were merged into a single BAM file using samtools 43. Whole length transcripts and genes were then reconstructed using the Stringtie program (v2.1.7; 44) with parameters -c 2 -f 0.05. Candidate coding regions within transcript sequences were identified using TransDecoder program (https://github.com/TransDecoder/TransDecoder) with default settings.
Predicted protein sequences from C. europaea and C. epithymum were compared with published proteomes of C. campestris, C. australis, and Ipomoea nil using program OrthoFinder (v2.5.2; 45) to identify orthologs and orthogroups. Genome assemblies and associated files containing detailed information about predicted gene models, protein and CDS sequences were downloaded from http://plabipd.de/portal/cuscuta-campestris (C. campestris) or GenBank (https://www.ncbi.nlm.nih.gov/genbank/; C. australis: GCA_003260385.1; I. nil: GCF_001879475.1). RNA-seq data for these species were downloaded from the Sequence Read Archive (SRA; https://www.ncbi.nlm.nih.gov/sra) from the following accession numbers: SRR6664647 – SRR6664654 (C. australis), ERR1916345 – ERR1916364 (C. campestris), and DRR024544 – DRR024549 (I. nil). The RNA-seq data produced in this study or downloaded from other studies were used to verify and correct automatically predicted gene models if needed. Manual verification and editing of gene models were performed using Apollo Genome Annotation Editor 46.
Identification and characterization of kinetochore proteins
Structural and regulatory kinetochore protein sequences identified in A. thaliana were downloaded from uniprot database and from published studies 47–49. These sequences were used for blastp searches to identify their homologs in genome assemblies of C. australis and C. campestris 50,51, representing monocentric Cuscuta species, and in I. nil 52, selected as a monocentric nonparasitic genus of the family Convolvulaceae. All sequences with significant similarity hits were manually inspected to remove false positives, correct erroneous protein sequences, or add additional variants due to alternative splicing. Protein sequences from A. thaliana and the three Convolvulaceae species were combined into a reference data set that was used for blastp and tblastn searches to find homologous kinetochore protein genes in holocentric C. epithymum and C. europaea. The searches were primarily performed in gene and protein sequences predicted using StringTie in the assembly produced from Pac-Bio reads, but the results were verified using the data from the parallel genome assemblies that were made from Illumina and nanopore reads as well as the transcriptome assemblies produced using Trinity.
CENH3 sequences from additional Cuscuta species or other plants of the same Cuscuta species were obtained from our previous study (C. campestris, C. japonica 15), identified in transcriptome shotgun assemblies (C. reflexa, C. campestris) or other available genome assembly (C. epithymum), amplified from RNA using RT-PCR or RACE methods (C. epithymum), or reconstructed from available next generation genome sequence data using GRABb and GeneWise programs (C. americana, C. californica, C. pentagona; 53,54). More detailed information about sources of the CENH3 sequences is provided in Supplementary Table 6.
Sequence alignments were performed using MUSCLE 55. Time trees were inferred using ITS and rbcL sequences and methods described in our previous study 16. ITS and rbcL sequences from C. australis were reconstructed from Illumina paired end reads (SRA run accession number: SRR5851367) using RepeatExplorer 56. A search for conserved sequence motives was performed using MEME 57. Sequence logos were generated using WebLogo 58. The sources of CENP-C and ZWINT1 sequences used for MEME and WebLogo analyses are provided in the Supplementary Table 7.
Antibodies
Antibodies to all kinetochore proteins used in this study were custom-produced by GenScript (Piscataway, NJ, USA) or Biomatik (Cambridge, ON, Canada) against peptides designed from regions that were most conserved among Cuscuta species and I. nil. The particular peptide sequences used for immunization in rabbits were always designed from C. europaea kinetochore protein sequences, with the exception of CENH3, which was designed from variable N-termini. The peptide sequences are provided in Supplementary Table 8. Antibody specificity was confirmed using in situ immunodetection to identify signals in the primary constrictions of monocentric Cuscuta species. The mouse monoclonal antibody to α-tubulin was purchased from Sigma-Aldrich (St. Louis, MO, USA; catalog number: T6199).
Reactivity of the antibodies raised against CENH3 with individual CENH3 variants in C. europaea and C. epithymum was tested using western blot. Full-length CENH3-coding sequences were cloned into pEXP5-NT/TOPO vector (Invitrogen, Carlsbad, CA, USA) in frame with the N-terminal 6xHis tag-coding sequence. Recombinant proteins were produced in BL21-AI strain of E. coli (Invitrogen, Carlsbad, CA, USA) upon induction with isopropyl β-D-thiogalactoside (IPTG). Total protein was extracted using 1× SDS-PAGE buffer according to the manufacturer’s instructions supplied with the pEXP5-NT/TOPO vector, separated on 12% SDS-PAGE gel, and then transferred onto Immobilon-P membrane (Sigma-Aldrich, St. Louis, MO, USA) using TE77XP semi-dry transfer unit (Hoefer, Holliston, MA, USA). Membranes were blocked using 5% skim milk powder in 1× PBS (PBS-M) overnight at 4°C and then incubated for 2 hours at RT with the primary antibody diluted in 1× PBS-M to 2–3 μg/ml. Following six washes in 1× PBS for 10 min at RT each, the antibodies were detected using goat anti-rabbit IgG StarBright Blue 520 secondary antibodies (Bio-Rad, Hercules, CA, USA; catalog number: 12005870) in 1× PBS-M for 1 h at RT. Fluorescent signals were visualized using the Chemidoc MP imaging system (Bio-Rad, Hercules, CA, USA). The presence of recombinant CENH3 proteins on the membrane was always verified by detection with the HisG epitope tag antibody (Thermo Fisher Scientific, Waltham, MA, USA; catalog number: R940-25) and secondary antibody StarBright Blue 700 Goat Anti-Mouse IgG (Bio-Rad, Hercules, CA, USA; catalog number: 12004159).
In situ immunodetection of kinetochore proteins
The biological material (shoot tips for Cuscuta and root tips for Rhynchospora) was fixed in TRIS-fix buffer (4% formaldehyde, 10 mM Tris, 10 mM Na2EDTA, 100 mM NaCl, pH 7.5) for 30 min at 10°C. Infiltration of the fixative was enhanced by applying a vacuum during the first 5 minutes. After fixation, the material was washed in TRIS buffer (10 mM Tris, 10 mM Na 2 EDTA, 100 mM NaCl, pH 7.5) on ice for 30 minutes. For the preparation of chromosomes and nuclei in Cuscuta species, the squashing technique was first used after digesting the shoot apical meristems for one hour at 27.4°C in 2% cellulase ONOZUKA R10 (SERVA Electrophoresis, Heidelberg, Germany) and 2% pectinase (MP Biomedicals, Santa Ana, CA, USA). The squashes were performed in either 1× phosphate-buffered saline (PBS) or LB01 (15 mM Tris(hydroxymethyl)aminomethane, 2 mM Na2EDTA, 0.5 mM spermine, 80 mM KCI, 20 mM NaCl, 15 mM mercaptoethanol, and 0.1% (v/v) Triton X-100, pH 7.5). With this technique, it was possible to obtain reasonable results, but to minimize background, chromosomes and nuclei were later isolated in suspension as described below. Shoot apical meristems were cut up in 1 ml of cold LB01 using a mechanical homogenizer (Ultra-turrax T8, IKA Z404519). The suspension was filtered through a 48 μm nylon mesh and spun onto slides using a Hettich centrifuge with cytospin chambers. In Rhynchospora pubera, formaldehyde-fixed root tip meristems were digested with 2% cellulase ONOZUKA R10 (SERVA Electrophoresis, Heidelberg, Germany) and 2% pectinase (MP Biomedicals, Santa Ana, CA, USA) for one hour at 37 °C. After washing with cold distilled water, meristems were squashed in 1× PBS. Before immunostaining, slides were incubated for 30 minutes at room temperature (RT) in 1× PBS-T1 buffer (1× PBS and 0.5% Triton, pH 7.4) (RT) to increase permeabilization. Slides were washed twice in 1× PBS for 5 minutes at RT and once in 1× PBS-T2 (1× PBS, 0.1% Tween 20, pH 7.4) for 5 minutes at RT. For immunostaining, slides were incubated with primary antibody diluted in 1× PBS-T2 overnight at 4°C. The dilution ratios were as follows: 1:1000 for antibodies to kinetochore proteins and 1:100 for antibodies to α-tubulin (Sigma-Aldrich, St. Louis, MO; catalog number T6199). After washing twice for 5 minutes in 1× PBS at RT, slides were incubated for one hour at RT with the secondary antibody in 1× PBS and then washed twice for 5 minutes in 1× PBS at RT. Primary rabbit and mouse antibodies were detected with goat anti-rabbit Rhodamine Red X (dilution 1:500; Jackson ImmunoResearch, Suffolk, UK; catalog number: 111-295-144) and goat anti-mouse Alexa Fluor 488 (dilution 1:500; Jackson ImmunoResearch; catalog number: 115-545-166), respectively. To distinguish specific signals from background signals caused by nonspecific binding of the secondary antibody, negative control slides were used and subjected to the same treatments as for standard detection, except that the primary antibody was not added. For simultaneous detection of different proteins with two rabbit antibodies, antibodies were labeled directly using Alexa Fluor 488 and Alexa Fluor 568 antibody labeling kits (Thermo Fisher Scientific, Waltham, MA, USA; catalog numbers: A20181 and A20184, respectively) according to the manufacturer’s recommendations. The degree of labeling was determined using a spectrophotometer DS-11 (DeNovix, Wilmington, DE, USA). Before embedding the slides in Vectashield mounting medium (Vector Laboratories, Burlingame, CA) supplemented with 49,6-diamino-2-phenylindole (DAPI), the slides were fixed with 4% formaldehyde in 1× PBS for 10 minutes at RT and then washed twice for 5 minutes in 1× PBS at RT.
Microscopy
For conventional wide-field fluorescence microscopy, a Zeiss AxioImager.Z2 microscope equipped with an Axiocam 506 mono camera was used along with an Apotome2.0 device for better resolution in the z-axis, which was needed when the images were composed of multiple optical sections. Images were generated using the ZEN 3.2 software (Carl Zeiss GmbH). To capture signals at the super-resolution level (∼120 nm using a 488 nm laser), spatial structured illumination microscopy (3D-SIM) was performed using a 63×/1.4 Oil Plan-Apochromat objective on an Elyra PS.1 microscope system, controlled by the ZENBlack software (Carl Zeiss GmbH). Images were captured using the 405, 488, and 561 nm laser lines for excitation and the appropriate emission filters 59. Three-dimensional movies were produced from 3D-SIM image stacks using the Imaris 9.7 (Bitplane) software.
Supplementary Information
Supplementary Notes
Supplementary Note 1: Genome Assembly and gene prediction in holocentric Cuscuta spp.
To assemble genome sequences of C. epithymum and C. europaea, we sequenced the genomic DNA using Illumina, Oxford nanopore, and Pac-Bio Hi-Fi sequencing technologies. Sequence reads from the two former technologies were assembled using MaSuRCA 1, whereas Pac-Bio Hi-Fi reads were assembled using Hifiasm 2. The latter type of the assembly was considerably better in both species (Supplementary Table 1). The total assembly size in C. epithymum was 975 Mbp, which is 1.8-fold bigger than the estimated genome size (1C = 533 Mb) 3. This disparity was attributed to high heterozygosity in the sequenced clone, resulting in the presence of two haplotypes in the assembly (Supplementary Table 1). The C. europaea genome assembly was 997 Mbp in size, corresponding to about 85% of previously estimated genome (1C = 1,169 Mb). This difference was likely due to the presence of highly abundant satellite DNA repeats, which make up 18% of the genome and are generally difficult to assemble 3. Gene prediction using the Stringtie program resulted in 89,521 and 49,635 gene models for C. epithymum and C. europea, respectively. The almost two-fold higher number of gene models in C. epithymum was caused by the presence of two haplotypes in the assembly and thus two alleles for most genes. BUSCO analysis revealed a high proportion of missing genes in both C. epithymum and C. europaea, but comparison with C. campestris, C. australis, and I. nil showed that it was not due to poor genome assemblies and/or gene prediction but to a large gene loss that preceded the divergence of monocentric and holocentric Cuscuta species (Supplementary Fig. 1). This was also confirmed by BUSCO analysis of the assembly-independent de novo transcriptome assemblies (Supplementary Table 2).
Supplementary Figures
Supplementary Movies
Supplementary Movie 1 | Spatial distribution of CENH3 (red) and KNL1 (green) in an interphase nucleus (blue) of C. europaea.
Supplementary Movie 2 | Spatial distribution of KNL1 (green) and CENP-C (red) in an interphase nucleus (blue) of C. europaea.
Supplementary Movie 3 | Spatial distribution of KNL1 (green) and NDC80 (red) in an interphase nucleus (blue) of C. europaea.
Supplementary Movie 4 | Spatial distribution of CENH3 (red) and MIS12 (green) in an interphase nucleus (blue) of C. europaea.
Acknowledgements
This research was financially supported by grants from the Czech Science Foundation (20-25440S) and the Czech Academy of Sciences (RVO:60077344). Computational resources and data-storage facilities were provided by the ELIXIR-CZ Research Infrastructure Project (LM2018131). We thank to J. Látalová and V. Tetourová for their technical assistance.