Abstract
The carboxy-terminal domain (CTD) of RNA polymerase (Pol) II is an intrinsically disordered low-complexity region that is critical for pre-mRNA transcription and processing. The CTD consists of hepta-amino acid repeats varying in number from 52 in humans to 26 in yeast. Here we report that human and yeast CTDs undergo cooperative liquid phase separation at increasing protein concentration, with the shorter yeast CTD forming less stable droplets. In human cells, truncation of the CTD to the length of the yeast CTD decreases Pol II clustering and chromatin association. CTD droplets can incorporate intact Pol II and are dissolved by CTD phosphorylation with the transcription initiation factor IIH kinase CDK7. Together with published data, our results suggest that Pol II forms clusters at active genes through interactions between CTDs and with activators, and that CTD phosphorylation removes Pol II enzymes from clusters for transcription elongation.
Cellular processes often require clustering of molecules to facilitate their interactions and reactions1,2. During transcription of protein-coding genes, RNA polymerase (Pol) II clusters in localized nuclear hubs3. Whereas Pol II concentration in the nucleus is estimated to be ~1 μM, it increases locally by several orders of magnitude4. Such high Pol II concentrations are reminiscent of the clustering of proteins in membrane-less compartments such as P granules, Cajal bodies and nuclear speckles1,2,5,6. These cellular compartments are stabilized by interactions between intrinsically disordered low-complexity domains (LCD) and depend on liquid-liquid phase separation (LLPS) 1,2,6-11. However, the molecular basis of Pol II clustering remains unknown.
The largest subunit of Pol II, RPB1, contains a C-terminal low-complexity domain (CTD) that is critical for pre-mRNA synthesis and co-transcriptional processing12. The CTD is conserved from humans to fungi, but differs in the number of its hepta-peptide repeats with the consensus sequence Y1S2P3T4S5P6S713,14. The human CTD (hCTD) contains a N-terminal half, which comprises 26 repeats and resembles the CTD from the yeast Saccharomyces cerevisiae (yCTD), and a C-terminal half containing 26 repeats of more divergent sequence (Supplementary Fig. 1a). CTD sequences from different species all contain a high number of tyrosine, proline and serine residues (Supplementary Fig. 1b)13,15. The most conserved CTD residues are Y1 and P6 that are present in all 52 repeats of hCTD. Truncation of the CTD of RPB1 in Saccharomyces cerevisiae to less than 13 repeats leads to growth defects and a minimum of eight repeats is required for yeast viability16. The CTD forms a mobile, tail-like extension from the core of Pol II14 that is thought to facilitate the binding of factors for co-transcriptional RNA processing and histone modification13,14.
Despite its extremely high conservation, its essential functions, and a large number of related published studies, the unique CTD structure and properties have remained enigmatic. Here we show that the CTD can undergo cooperative liquid-liquid phase separation in vitro that is driven by weak multivalent interactions. We further show that the CTD is critical for clustering of Pol II in human cells. Together with published results, we arrive at a model for gene activation that involves CTD-mediated Pol II clustering at active gene promoters, and release of initiated polymerases from these clusters after CTD phosphorylation.
RESULTS
CTD of Pol II phase separates into liquid-like droplets
To investigate whether liquid-liquid phase separation of CTD may underlie Pol II clustering, we expressed and purified hCTD and yCTD from Escherichia coli. We used a prokaryotic expression system to prevent eukaryotic post-translational modifications. The biophysical properties of the purified CTD proteins were characterized using circular dichroism (Supplementary Fig. 1c). CD spectroscopy showed that hCTD and yCTD are intrinsically disordered in solution (Supplementary Fig. 1c), consistent with the low complexity of CTD sequences (Fig. 1a) 14.
Next we investigated the ability of CTD to undergo liquid-liquid phase separation using a combination of differential interference contrast (DIC) microscopy and fluorescence microscopy. DIC microscopy revealed the formation of micrometer-sized droplets at a concentration of 20 μM hCTD in the presence of 5-10% of the molecular crowding agent dextran (Supplementary Fig. 2a). Fluorescence microscopy demonstrated that hCTD molecules were strongly concentrated within the droplet interior compared to the surrounding milieu (Supplementary Fig. 2a, lower panels). At higher dextran concentration (16%), droplets could be detected already at a concentration of 5 μM hCTD (Fig. 1b-c). The number of droplets increased with increasing protein concentration (Fig. 1c), consistent with the general concentration-dependence of liquid phase separation17. In addition, hCTD formed droplets in the presence of another molecular crowding agent, the polysaccharide Ficoll (Supplementary Fig. 2b). hCTD also underwent LLPS after cleavage of the MBP tag, while MBP alone did not form droplets in presence of molecular crowding agents (Supplementary Fig. 2c). hCTD droplet formation was also robust against changes in ionic strength (Supplementary Fig. 2d), and incubation of the sample for 90 minutes at different temperatures (Supplementary Fig. 2e).
Like hCTD, yCTD formed droplets in a concentration-dependent manner (Fig. 1d; Supplementary Fig. 3a-b). Contacts of yCTD droplets led to fusion and formation of a single spherical droplet (Fig. 1e; Supplementary movie 1). At concentrations subcritical for LLPS, yCTD was incorporated into preformed hCTD droplets and hCTD was included into preformed yCTD droplets (Supplementary Fig. 3c), in agreement with the ability of the CTD to be trapped into droplets and hydrogels of LCD proteins 18,19. The combined data show that the CTD of Pol II forms LCD-LCD interactions and readily undergoes LLPS to form liquid-like droplets in solution.
Liquid droplets and cellular puncta are held together by weak, distributed interactions between LCDs that are sensitive to aliphatic alcohols6,20,21. As expected for such interactions, liquid phase separation of yCTD was abolished by addition of 5-10% 1,6-hexanediol (Fig. 1f; Supplementary Fig. 3d). Addition of 5-10% of the hexanediol isomer 2,5-hexanediol also dissolved yCTD droplets (Supplementary Fig. 3d). Because it was shown that 2,5-hexanediol is less efficient in dissolving droplets and hydrogels22, the data indicate that CTD droplets are more sensitive to aliphatic alcohols than other LCD-LCD interactions. On the contrary, CTD phase separation is robust to changes in ionic strength (Supplementary Fig. 2d).
CTD length influences CTD phase separation in vitro
A characteristic property of liquid-like droplets is fast diffusion of molecules in their interior1. We used fluorescence recovery after photobleaching (FRAP) to compare diffusion kinetics of hCTD and yCTD molecules within droplets. MBP-tagged hCTD and yCTD proteins were fluorescently labeled on a single cysteine residue that is present C-terminal to the TEV protease cleavage site. After cleavage of the MBP tag and droplet formation, circular regions in the interior of CTD droplets were bleached (Methods). Within hCTD droplets, the bleached fluorescence recovered with a half time of 4.08 s ± 0.88 s (Fig. 1g). For yCTD, recovery was faster with a half time of 1.43 s ± 0.41 s. (Fig. 1g).
These results demonstrate that CTD molecules within droplets are generally highly dynamic, confirming the liquid-like nature of CTD droplets. The difference in fluorescence recovery between hCTD and yCTD further suggests that the higher number of repeats in hCTD strengthens CTD-CTD interactions. This observation is consistent with the concentration-dependent ability of hCTD and yCTD to undergo LLPS when fused to a MBP-tag. MBP-hCTD phase separated at a concentration of 5 μM (Fig. 1c; Supplementary Fig. 4 top). In contrast, LLPS of MBP-yCTD started only at a 4-6 fold higher protein concentration (Supplementary Fig. 4, middle). When the smaller, dimerizing GST-tag was used to replace the more soluble MBP-tag23, the critical concentration for yCTD phase separation decreased to approximately 5 μM (Supplementary Fig. 4, bottom; Fig. 1d). These results suggest that the solubilizing effect of MBP counteracts droplet formation. This effect is more easily overcome by hCTD because the higher repeat number and valency results in stronger CTD-CTD interactions compared to yCTD. We conclude that the length of the CTD influences the stability and dynamics of LLPS droplets, with a longer CTD leading to stronger CTD-CTD interactions and less dynamic droplets.
CTD droplets recruit intact Pol II
The above results indicate that CTD-CTD interactions within liquid droplets may underlie Pol II clustering. However, we could not test directly whether intact Pol II forms LLPS droplets in vitro because it was impossible to prepare Pol II at a sufficient concentration in the presence of dextran or Ficoll. We could however test whether Pol II could be trapped within CTD droplets. We purified Pol II from yeast cells, labeled it with the fluorescent dye Alexa 594 and added it to pre-formed CTD droplets at a concentration of 0.02 μM. Fluorescence microscopy showed that Pol II located to CTD droplets (Fig. 1h).
CTD length controls Pol II clustering in human cells
To explore whether CTD-based LLPS may underlie Pol II clustering in cells, we engineered two human cell lines that express a fluorescently Dendra2-tagged version of RPB1. To create these cell lines we used α-amanitin resistant RPB1 protein (N792D) and grew cells in the presence of α-amanitin, which leads to the degradation of endogenous RPB13. Such cell lines are known to recapitulate the behavior of endogenous wild-type Pol II3,24,25,26. One cell line contained the full-length CTD with 52 repeats (RPB1-52R), whereas the other cell line contained a truncated CTD with 25 repeats (RPB1-25R) that closely resembles the yCTD sequence (Fig. 2a). The two cell lines remained viable upon degradation of endogenous RPB1 after treatment with α-amanitin and expressed similar levels of the Dendra2-tagged exogenous Pol II as assessed by western blotting (Fig. 2b), confocal imaging and FACS analysis (Supplementary Fig. 5a-c).
We now studied clustering of Pol II in these human cell lines with the help of 3D-PALM super-resolution microscopy using induced astigmatism by a cylindrical lens (Fig. 2c-e; Supplementary Fig. 5d)3. Compared to cells with full-length CTD (52R), cells with the truncated, yeast-like CTD (25R) showed less Pol II clustering (Fig. 2c-d). These results suggested that CTD interactions underlie Pol II clustering in cells and that the CTD length influences clustering. To test this directly, we further created a cell line containing an artificially extended CTD (RPB1-70R, Methods). This cell line was also viable upon degradation of endogenous RPB1 but expressed RPB1 at a lower level then the two other cell lines (Fig. 2b). Despite this difference in expression level, the 70R cell line showed even more Pol II clustering than cells with wild-type, full-length CTD (Fig. 2c-d), strongly supporting our findings.
For all three cell lines, differences in CTD-dependent cluster density were supported by quantitative analysis on the basis of a modified Ripley function L(r), which compares the spatial distribution of localizations to complete spatial randomness (L(r)=0 for all r). In all cells, L(r) curves showed a strong clustering signature (Fig. 2e; Supplementary Fig. 5d). Whereas the sharp increase observed at scales less than 100 nm can be influenced by photophysical effects, such as blinking of Dendra227, the continuous increase at larger spatial distances is representative of Pol II clustering. Taken together, these results demonstrate that Pol II clustering in cells depends on the CTD and increases with increasing CTD length.
CTD length influences Pol II dynamics in cells
We next investigated the impact of CTD length on Pol II dynamics in vivo using two orthogonal approaches, live-cell single particle tracking (SPT)28 and FRAP experiments. Because these methods require a high signal-to-noise ratio and a photo-stable fluorescent label, we established cell lines with a Halo-tag on RPB1 containing 25, 52 and 70 CTD repeats (25R, 52R and 70R, respectively) (Supplementary Fig. 6). We then tracked single molecules of Pol II in live cells by single-step photo-activation and -bleaching (Fig. 3a-b, Supplementary movies 2-4). Subsequent kinetic modeling analysis assuming free and bound state (Fig. 3c, Supplementary Fig. 7a) revealed that 29.1% of wild-type Pol II (RPB1-52R) in live cells was not mobile and apparently chromatin-associated. The bound Pol II fraction was decreased to 21% in RPB1-25R cells and was increased to 38.4 % in RPB1-70R cells (Fig. 3c-d; Supplementary Fig. 7b-c). In addition, the diffusion coefficients for free Pol II were higher and lower, respectively, for RPB1-25R and RPB1-70R cells. Free diffusion coefficients of 3.74, 2.97, and 2.34 μm2/s were measured in RPB1-25R, RPB1-52R and RPB1-70R cells, respectively (Fig. 3e). These large differences in diffusion coefficients cannot be explained solely by differences in mass or size (Online Methods). Therefore, our results indicate that CTD length strongly influences Pol II mobility in vivo, with shorter and longer CTDs leading to higher and lower mobility, respectively.
These findings in cells reflect our observed length-dependence of CTD-CTD interactions in vitro (Fig. 1g; Supplementary Fig. 4). Indeed, FRAP recovery curves in human cells depended on CTD length (Fig. 3f), consistent with differences in FRAP recovery kinetics observed between hCTD and yCTD droplets in vitro (Fig. 1g). Analysis of these FRAP recovery curves by a reaction-dominant two-state model29,30 further showed that the fraction that did not recover within a few seconds increased from 27% in RPB1-25R cells to 35% in RPB1-52R cells to 38% in RPB1-70R cells (Supplementary Fig. 7d-f). This is consistent with the SPT results (Fig. 3d), which also showed a higher chromatin-associated fraction for Pol II with a longer CTD. Together our data show that longer CTDs result in more clustering and chromatin-association in vivo, reflecting the influence of CTD length on liquid-liquid phase separation in vitro.
CTD phosphorylation dissolves droplets
Finally, we investigated whether CTD phosphorylation impacts phase separation. It has long been known that assembly of the pre-initiation complex at Pol II promoters requires an unphosphorylated CTD, and that subsequent CTD phosphorylation at S5 CTD residues by the kinase CDK7 in transcription factor IIH (TFIIH) stimulates the transition of Pol II into active elongation31,32. We treated hCTD with recombinant human TFIIH subcomplex containing CDK7 kinase33 and adenosine triphosphate (ATP), leading to S5 phosphorylation of hCTD (Supplementary Fig. 8a; Fig. 4a). The resulting CDK7-phosphorylated hCTD was no longer able to form droplets, whereas prior incubation with ATP alone did not inhibit LLPS (Fig. 4b). Phosphorylation of yCTD by the yeast TFIIH kinase subcomplex also inhibited phase separation (Supplementary Fig. 8b-c). In addition, phosphorylation of preformed hCTD droplets by human CDK7 caused gradual shrinking and ultimately disappearance of hCTD droplets (Fig. 4c; Supplementary movie 5). Therefore, phosphorylation at S5 positions is incompatible with CTD phase separation and transfers the CTD from the highly concentrated state within droplets to the dispersed pool.
DISCUSSION
Here we show that the Pol II CTD can undergo length-dependent liquid-liquid phase separation in vitro, and that it controls Pol II clustering and mobility in vivo. Whereas CTD function is generally thought to depend on defined binary interactions of short CTD regions (1-3 repeats) with CTD-binding proteins, our results suggest that CTD function can additionally depend on weak heterotypic LCD-LCD interactions, and that these interactions may dominate Pol II localization in vivo. Although it was previously described that the CTD can interact with pre-formed LLPS droplets and hydrogels of FET proteins18,19, our results demonstrate for the first time that the CTD alone, in absence of other proteins, can undergo phase separation.
Our findings have implications for understanding Pol II transcription in eukaryotic cells and suggest a simple model for gene activation and the initiation-elongation transition during early transcription (Fig. 5). Unphosphorylated Pol II clusters in cells, mediated by CTD-CTD interactions. Pol II clusters may be recruited by transcriptional activators that bind to regulatory sites such as enhancers34. Transcriptional activators can also undergo LCD interactions35 and might nucleate Pol II clustering when Pol II concentration is subcritical. Formation of Pol II clusters near gene promoters may provide a reservoir of Pol II to achieve high initiation rates during activated transcription. When a Pol II enzyme is incorporated into a pre-initiation complex, its CTD gets phosphorylated by the TFIIH kinase CDK7. Phosphorylation removes this Pol II enzyme from the cluster and liberates it to escape the promoter and transition into active transcription elongation (Fig. 5). Finally, our results also provide a starting point for analyzing the chemical basis for CTD phase separation, its possible modulation by nucleic acids and protein factors, and its specific roles in transcription regulation and the coordination of the transcription cycle.
METHODS
Methods and any associated references are available in the online version of the paper.
Note: Any Supplementary Information and Source Data files are available in the online version of the paper.
AUTHOR CONTRIBUTIONS
M.B. designed experiments, generated constructs and prepared proteins unless otherwise noted. C. D-D designed experiments and established and characterized the RPB1 cell lines and performed and analyzed the in vivo FRAP data. M.R. designed experiments, performed all phase separation assays, DIC and fluorescence microscopy, in vitro FRAP measurements and data analysis. A.S.H. designed, performed and analyzed SPT experiments and helped with the in vivo FRAP analysis. H.M.N. designed, performed and analyzed 3D PALM experiments. G.D. designed and cloned the different RPB1 expression vectors. G.K. prepared human TFIIH kinase complex. T.Y. performed CD and NMR experiments. C.D.-D., X.D., P.C. and M.Z. designed and supervised research. C.D.- D., P.C., X.D. and M.Z. prepared the manuscript with input from all authors.
COMPETING FINANCIAL INTERESTS
The authors declare no competing financial interests.
ONLINE METHODS
Cloning and protein expression
A plasmid encoding the human Pol II CTD sequence (hCTD) (RPB1 residues 1593-1970) fused C-terminally to a sequence encoding H6-tagged maltose-binding protein (MBP) directly followed by a flexible linker (10xAsn) and a tobacco etch virus (TEV) protease cleavage site was a kind gift from Seychelle M. Vos (Max Planck Institute for Biophysical Chemistry, Göttingen). Derivative constructs, in which the hCTD sequence was replaced by the sequence coding for the S. cerevisiae Pol II CTD (yCTD) (RPB1 residues 1542-1733) or entirely removed, were generated using Gibson Assembly (New England Biolabs) and through deletion mutagenesis, respectively. All proteins contain a single cysteine residue C-terminal of the TEV protease cleavage site that allows for site-specific labeling. MBP-tagged proteins were overexpressed in E. coli BL21 (DE3) RIL cells (Stratagene) cultured in LB medium containing 50 mg/l kanamycin and 34 mg/l chloramphenicol. After reaching an OD600 of ~0.8, 0.5 mM IPTG was added and proteins were expressed for 3-4 h at 37 °C. Cells were harvested by centrifugation and resuspended in lysis buffer LB300 (20 mM HEPES pH 7.4, 300 mM NaCl, 30 mM imidazole, 10 % glycerol, 1 mM DTT, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine). The cell suspension was snap frozen and stored at −80 °C.
The sequence coding for the yCTD was additionally cloned into a pET27-derived plasmid, directly C-terminal to a Glutathione-S-Transferase (GST) tag followed by the TEV protease cleavage site. An N-terminal 6xHis-tag was introduced by site-directed mutagenesis. From the obtained plasmid, a second expression vector encoding only His6-GST-TEV was constructed through deletion mutagenesis. GST-tagged proteins were overexpressed in E. coli BL21 Rosetta 2(DE3)pLysS cells (Stratagene) grown in 2xYT medium containing 50 mg/l kanamycin and 34 mg/l chloramphenicol. After the culture reached an OD600 of 0.6-0.8, Isopropyl β-D-1-thiogalactopyranoside (IPTG) was added to a final concentration of 0.5 mM. GST-yCTD was overexpressed for 16 h at 18 °C, GST for 3 h at 18 °C. Cells were harvested by centrifugation, resuspended in lysis buffer LB150 (20 mM HEPES pH 7.4, 150 mM NaCl, 30 mM imidazole, 10 % glycerol, 1 mM DTT, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine), flash frozen in liquid nitrogen and stored at −80 °C.
Sequences encoding the full-length subunits of the human TFIIH kinase module (CDK7, MAT1 and cyclin H) were separately transferred into MacroBac 438B vectors38 and combined into a single construct by ligation independent cloning. All subunits contained a N-terminal 6xHis-tag followed by a TEV protease cleavage site. Insect cell expression was performed as described39.
Protein purification
All purification steps were performed at 4 °C. Frozen E. coli cell suspension was thawed, lysed by sonication, cleared from insoluble material by centrifugation (27,000 g, 45 min, 4 °C) and filtered through 0.8 μm syringe filters.
For the purification of MBP-tagged proteins, cleared E. coli lysate was loaded onto a 5 ml HisTrap HP column (GE healthcare) equilibrated in LB300. The HisTrap column was washed extensively using high salt buffer HSB1000 (20 mM HEPES pH 7.4, 1 M NaCl, 30 mM imidazole, 10 % glycerol, 1 mM DTT, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine) and equilibrated again in LB300. The column was then attached in-line to a LB300-equilibrated XK-16 column (GE healthcare), which was packed with amylose resin (New England Biolabs). Bound proteins were eluted directly onto the amylose column using Nickel elution buffer 300 (20 mM HEPES pH 7.4, 300 mM NaCl, 500 mM imidazole, 10 % glycerol, 1 mM DTT, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine). The HisTrap column was subsequently removed and the amylose column was washed again extensively with HSB1000 buffer. MBP-tagged proteins were eluted using amylose elution buffer (20 mM HEPES pH 7.4, 300 mM NaCl, 10 % glycerol, 1 mM DTT, 117 mM maltose, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine) and concentrated with a 30 kDa MWCO Amicon Ultra filter unit. The concentrate was then subjected to size-exclusion chromatography using a Sephadex 200 10/300 Increase column pre-equilibrated in SE300 buffer (20 mM HEPES 7.4, 300 mM NaCl, 10 % glycerol, 1 mM TCEP). Pure fractions, as assessed by SDS-PAGE and Coomassie staining, were pooled and concentrated using a 30 kDa MWCO Amicon Ultra centrifugal filter. The protein concentration was calculated based on the absorbance at 280 nm and the predicted molar extinction coefficient (DNAstar Lasergene Suite). Aliquots were frozen in liquid nitrogen and stored at −80 °C.
His6-GST-TEV-yCTD was purified following a similar scheme as described earlier 40 with the following modifications. The clarified extract was applied to a 5 ml HisTrap HP column equilibrated in lysis buffer LB150. The column was extensively washed using high salt buffer HSB800 (20 mM HEPES pH 7.4, 800 mM NaCl, 30 mM imidazole, 10 % glycerol, 1 mM DTT, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine) and equilibrated again in LB150. A pre-equilibrated 5 ml HiTrap Q HP column (GE healthcare) was attached in-line to the HisTrap column, which was subsequently eluted using a linear gradient from 0-100 % Nickel elution buffer 150 (20 mM HEPES pH 7.4, 150 mM NaCl, 500 mM imidazole, 10 % glycerol, 1 mM DTT, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine). The flow-through fractions were analyzed by SDS-PAGE and Coomassie staining, pooled and concentrated using a 30 kDa MWCO Amicon Ultra centrifugal filter unit (Merck). The sodium chloride concentration was adjusted to 50 mM and the protein was applied to a 1 ml HiTrap S column (GE healthcare). The flow-through was concentrated using a 30 kDa MWCO Amicon Ultra concentrator and then separated on an equilibrated Sephadex 200 10/300 Increase column (GE healthcare) with buffer SE300. Individual fractions were analyzed by SDS-PAGE and Coomassie staining, pure fractions were pooled and concentrated with a 30 kDa MWCO Amicon Ultra filter unit. E. coli extract from the H6-GST-TEV expression was applied to a 5 ml HisTrap HP column, washed with HSB800 and eluted with B150. The protein was concentrated using a 10 kDa MWCO Amicon filter unit and directly subjected to size-exclusion chromatography as described above. Concentrated protein solutions were aliquoted, flash-frozen in liquid nitrogen, and stored at −80 °C.
The recombinant S. cerevisiae TFIIH kinase module consisting of the three subunits Kin28, Ccl1 and Tfb3 was prepared as described 41. For the purification of the three-subunit human TFIIH kinase module (CDK7, cyclin H and Mat1), insect cells were lysed by sonication in lysis buffer (400 mM KCl, 20 mM K-HEPES pH 7.0, 400 mM KCl, 10% glycerol, 1 mM MgCl2, 10 μM ZnCl2, 5 mM β-mercaptoethanol, 30 mM imidazole pH 8, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine). Clarified cell lysate was applied onto a HisTrap HP 5 ml column (GE Healthcare), washed with 20 CV of lysis buffer and eluted with a linear gradient of 0-100% of elution buffer (20 mM K-HEPES pH 7, 400 mM KCl, 10% glycerol, 1 mM MgCl2, 10 μM ZnCl2, 5 mM β-mercaptoethanol, 500 mM imidazole pH 8, 0.284 μg/ml leupeptin, 1.37 μg/ml pepstatin A, 0.17 mg/ml PMSF, 0.33 mg/ml benzamidine) in 10 CV. Peak fractions were combined, supplemented with 2 mg of 6xHis-tagged TEV protease and dialyzed overnight against 2 L dialysis buffer (20 mM K-HEPES pH 7, 400 mM KCl, 10% glycerol, 1 mM MgCl2, 10 μM ZnCl2, 5 mM β-mercaptoethanol). The dialyzed solution was applied onto a HisTrap HP 5 ml column pre-equillibrated in dialysis buffer. The trimeric complex was eluted with 10% elution buffer and concentrated using an Amicon Milipore 15 ml 30 kDa MWCO centrifugal concentrator. The sample was applied to a Superdex 200 10/300 GL size exclusion column (GE healthcare) pre-equillibrated in storage buffer (20 mM K-HEPES pH 7, 350 mM KCl, 10% glycerol, 1 mM MgCl2, 10 μM ZnCl2, 5 mM β-mercaptoethanol). Peak fractions containing stoichiometric kinase trimer were pooled, concentrated using a Amicon Milipore 15 ml 30 kDa MWCO centrifugal concentrator to 130 μM, aliquoted, flash frozen in liquid nitrogen and stored at −80 °C.
The identity of all purified proteins was confirmed by LC-MS/MS analysis.
Pol II preparation and fluorescent labeling
Pol II was prepared from the S. cerevisiae strain BJ5464 as described 42 and treated with lambda phosphatase during purification. The Pol II subunit RPB3 contains an N-terminal biotin acceptor peptide, which can be biotinylated in vitro by the bacterial protein ligase BirA (4) and used for site-specific labeling with fluorescent streptavidin conjugates. For this, 200 μg Pol II were incubated with 6 μg BirA, 100 μM D(+)-biotin and 2 mM ATP for 2 h at 20 °C in Pol II buffer (10 mM HEPES pH 7.2, 200 mM KCl, 5 % glycerol, 2 mM DTT). Excess biotin was removed using a Micro Bio-Spin 6 column (Biorad) according to the manufacturer’s suggestions. A small fraction of biotinylated Pol II was bound to streptavidin-coupled Dynabeads M-280 (Thermo Fisher Scientific) to confirm quantitative biotinylation. The remaining biotinylated Pol II was reacted with Alexa 594-coupled streptavidin (Thermo Fisher Scientific, ~2x molar excess) for 20 min at 20 °C. Pol II was then separated from unbound streptavidin by size-exclusion chromatography using a Superose 6 10/300 column (GE healthcare) equilibrated in Pol II buffer. Pol II-containing fractions were pooled, concentrated (100 kDa MWCO Amicon Ultra spin filter unit), and flash-frozen aliquots were stored in the dark at −80 °C.
CTD phosphorylation
GST-yCTD was phosphorylated using the recombinant S. cerevisiae TFIIH kinase module. For this, 50 μM GST-yCTD were incubated with 0.4 μM kinase module and 3 mM ATP for 1 h at 30 °C in kinase reaction buffer (20 mM HEPES pH 7.4, 200 mM NaCl, 5 mM MgCl2, 10 % glycerol, 1 mM TCEP). Upon completion, the phosphorylation reaction was quenched by addition of EDTA to a final concentration of 10 mM. Phosphorylation of MBP-hCTD was performed using the recombinant human TFIIH kinase module. For this, MBP-hCTD (100 μM) was incubated with 2 μM kinase module in reaction buffer (20 mM HEPES pH 7.4, 260 mM NaCl, 20 mM MgCl2, 20 μM ZnCl2, 10 % glycerol, 2 mM TCEP). The reaction was started by addition of 8 mM ATP, incubated for 1 h at 30 °C and quenched by addition of 40 mM EDTA. Control reactions lacking either the kinase or ATP were conducted in both cases under identical conditions. After completion of GST-yCTD and MBP-hCTD phosphorylation experiments, all reactions were mixed with 20 % dextran at a ratio of 1:4 (vol/vol) and then analyzed microscopically (as described below). To study phosphorylation-induced dissolution of preformed CTD droplets, MBP-hCTD was mixed at a final concentration of 20 μM with 16 % dextran containing 1.6 mM ATP, 4 mM MgCl2, 20 μM ZnCl2, and 1 mM TCEP to induce phase-separation. Immediately before imaging, the reaction was started by addition of human TFIIH kinase module to a final concentration of 0.4 μM and immediately analyzed by microscopy.
Kinase activity assay
Kinase activity was analyzed by mobility shift assays. One microgram CTD fusion protein from kinase and control reactions was separated on 4-12% tris-glycine Protean TGX polyacrylamide gels (Biorad) and stained with Coomassie solution (InstantBlue, Expedeon). Phosphorylation of the CTD substrates by human and yeast TFIIH kinase modules results in a pronounced decrease of electrophoretic mobility. Phosphorylation of the CTD residue Ser5 was confirmed by immunoblotting. For this, samples (100 ng/lane) were separated on 4-12 % tris-glycine Protean TGX gels and blotted onto a PDVF membrane with a Trans-Blot Turbo Transfer System (Biorad). The membrane was blocked for 1-2 h at room temperature with 5 % (w/v) milk powder in phosphate-buffered saline containing 0.1 % Tween-20 (PBST). The blocked membrane was then incubated with either anti-MBP HRP conjugate (ab49923; Abcam) or anti-GST HRP conjugate (RPN1236; GE healtcare) for 2 h at room temperature. SuperSignal West Pico Chemiluminescent Substrate (Thermo Fisher) was used to develop the membrane before scanning with a ChemoCam Advanced Fluorescence imaging system (Intas Science Imaging). For immunoblot analysis of CTD phosphorylation, the membrane was subsequently stripped by incubation in stripping buffer (200 mM glycine-HCl pH 2.2, 0.1 % SDS, 1 % Tween-20), blocked with 5 % (w/v) milk powder in PBST, and probed overnight at 4 °C with primary CTD antibody against phosphorylated Ser5 (3E10; diluted 1:60 in 2.5 % (w/v) milk powder in PBST). The anti-Ser5 CTD antibody was a kind gift of Dirk Eick (Molecular Epigenetics Research Unit, Helmholtz Center, Munich). The membrane was then incubated with an anti-rat HRP-conjugate (A9037, Sigma-Aldrich) in 2.5 % milk-PBST for 1 h at room temperature and developed as describe above.
Disorder prediction
Recent cryo-EM analysis of mammalian RNA polymerase II could derive an atomic model only to RPB1 position P1487 43, indicating a high conformational flexibility of the following RPB1-linker and the C-terminal repeat domain. We thus used the VLXT predictor implemented in PONDR 44 to calculate the disorder propensity for the human RPB1 residues 1488-1970.
CD spectroscopy
Far-UV CD measurements were performed on a Chirascan spectrometer (Applied Photophysics, Ltd) at 25 °C using a 0.2 mm path length cuvette. The concentration of MBP-hCTD and MBP-yCTD was 5 μM in 20 mM NaPi, pH 7.4. CD spectra were recorded from 180 to 280 nm with an integration time of 0.5 s, and experiments were repeated three times. The spectra of human and yeast CTD were obtained through subtraction of the spectrum of MBP and correction of the baseline using buffer. Data are expressed in terms of the mean residual ellipticity (θ) in [deg/(cm2 dmol)].
NMR spectroscopy
Peptides comprising one (1R-CTD; 7 residues), two (2R-CTD; 14 residues) and three (3R-CTD; 21 residues) YSPTSPS-repeats were synthesized by GenScript with acetyl- and amide protection groups at the N- and C-termini, respectively. NMR spectra were recorded at 5 °C on Bruker 600 and 700 MHz spectrometers with triple-resonance cryogenic probes for 1.0 mM 3R-CTD peptide (20 mM HEPES, pH 7.4, 200 mM NaCl, and 90% H2O/10% D2O) and 0.5 mM phosphorylated 3R-CTD peptide. For phosphorylation, 3R-CTD was incubated with 2 μM TFIIH kinase at 37 °C during 18h (20 mM HEPES, pH 7.4, 200 mM NaCl, 3 mM MgCl2, 3 mM ATP, and 90% H2O/10% D2O). Spectra were processed with the software TopSpin (Bruker) and analyzed using CCPN Analysis 45. Sequence-specific backbone and side-chain resonance assignments of nonphosphorylated and phosphorylated 3R-CTD peptide were achieved through 1H-15N heteronuclear single quantum coherence (HSQC) and 1H-13C HSQC experiments at natural abundance, together with 2D 1H-1H TOCSY (100 ms mixing time) and 2D 1H-1H NOESY (200 and 300 ms mixing time) experiments. Resonance assignments of 3R-CTD were further validated through comparison with NMR spectra recorded for 1R-CTD and 2R-CTD peptides.
Differential Interference Contrast (DIC) and fluorescence microscopy
Droplet formation of protein samples was monitored by DIC and fluorescence microscopy. Samples were fluorescently labeled using Alexa Fluor 488 Microscale Protein Labeling Kit (Thermo Fisher Scientific, #A30006) according to manufacturer’s instructions. Small amounts (<0.5 μM) of labeled protein, which are not sufficient to induce droplet formation by itself, were mixed with unlabeled protein to the final concentration indicated in the text. In experiments with Ficoll PM 400 (Sigma, #F4375) at the final concentration of 150 mg/ml, buffer containing 20 mM HEPES, 200 mM NaCl, pH 7.4 was used. In experiments using dextran T500 (Pharmacosmos) as a crowding agent, it was added to reach the indicated final concentrations in 20 mM HEPES, 220 mM NaCl, pH 7.4. 5-10 μl of samples were loaded onto glass slides, covered with ø 18 mm coverslips and sealed. DIC and fluorescent images were acquired on a Leica DM6000B microscope with a 63x objective (water immersion) and processed using the FIJI software (NIH). In experiments requiring MBP-tag removal, fusion proteins were incubated with TEV protease in molar ratio TEV:protein=1:25 for 3 h at 25 °C. Complete tag removal was confirmed by Coomassie staining of the digested samples. Phase separation of yeast and human CTD without a tag was detected by DIC and fluorescent microscopy.
In experiments with aliphatic alcohols, the MBP tag was cleaved off from MBP-yCTD as indicated above, followed by addition of the protein to a premix containing dextran (final concentration 16%) and 1,6-hexanediol (Sigma, #240117) or 2,5-hexanediol (Sigma, #H11904). The final yCTD concentration in the sample was 50 μM and hexanediol concentrations varied from 2.5 to 10%. Samples were imaged by DIC microscopy as indicated above.
All experiments with droplet formation were performed at room temperature except when the influence of temperature was tested. In the later case, samples containing 20 μM MBP-hCTD in 20 mM HEPES, 220 mM NaCl, pH 7.4 with 16% dextran were incubated for 1.5 h on ice (4 °C), room temperature (22 °C) or in an incubator at 37 °C before microscopy analysis. The influence of ionic strength was tested with 10 μM MBP-hCTD and indicated NaCl concentrations in 20 mM HEPES, pH 7.4 containing 16% dextran.
Co-recruitment experiments
For CTD co-recruitment experiments, droplets were made with 20 μM MBP-hCTD or GST-yCTD in 20 mM HEPES, 220 mM NaCl, pH 7.4 containing 16% dextran. Droplets were visualized through addition of 0.6 μM of tetramethylrhodamin (TMR)-labeled peptide with the sequence YSPTSPS, i.e. corresponding to one consensus heptad repeat. Subsequently, small amounts (< 0.5 μM) of Alexa 488 labeled GST-yCTD or MBP-hCTD were added to MBP-hCTD or GST-yCTD pre-formed droplets, respectively. Co-recruitment was confirmed by imaging on Leica DM6000B microscope as described above using DIC in combination with red and green channels for fluorescence (GFP and N3 filter cubes).
For Pol II co-recruitment experiments, Alexa 594-labeled Pol II (final concentration 0.02 μM) was mixed with pre-formed droplets of 25 μM GST-yCTD in 20 mM HEPES, 220 mM NaCl, pH 7.4 containing 16% dextran that were visualized by addition of Alexa 488 labeled GST-yCTD (final concentration of 2.3 μM). Co-recruitment was documented by DIC and fluorescent microscopy using red and green channel (GFP and N3 filter cubes) on Leica DM6000B microscope as described.
In vitro FRAP experiments
The dynamics of human and yeast CTD molecules in the phase-separated state were investigated by fluorescence recovery after photobleaching (FRAP). MBP-tagged human and yeast CTD proteins were first labeled on a single Cys residue that is present C-terminal to the TEV protease cleavage site (see above) using Alexa Fluor 488 C5 Maleimide dye (Thermo Fisher Scientific, #A10254) according to manufacturer’s recommendations. Briefly, proteins were incubated in a light protected Eppendorf tube with a dye freshly dissolved in DMSO in a molar ratio 1:15=protein:dye in 20 mM HEPES pH 7.4, 300 mM NaCl, 1 mM TCEP, 10% glycerol for 3 h at room temperature. Excess label and salt were removed by desalting samples twice with 0.5 mL 7000 MWKO Zeba spin desalting columns (Thermo Fisher Scientific, #89882). MBP tag was then cleaved from both labeled and unlabeled human and yeast CTD using TEV protease as indicated above. Droplets for FRAP measurements were made in 16% dextran T500 in 20 mM HEPES, 220 mM NaCl, pH 7.4 by adding mixtures of labeled and unlabeled yCTD (or hCTD) in a molar ratio of 1:100 to the final CTD concentration of 20 μM. To minimize droplet movement, FRAP recordings were done after approx. 30 minutes, which is the time required for freshly formed droplets to settle down on the glass slide and become less mobile.
FRAP experiments were recorded on a Leica TCS SP8 confocal microscope using 63x objective (water immersion) at a zoom corresponding to pixel size 96 nm x 96 nm and using the 488 argon laser line. A circular region of approx. 1.4 μm in diameter was chosen in a region of homogenous fluorescence away from the droplet boundary and bleached with 10 iterations of full laser power. Recovery was imaged at low laser intensity (0.057%). 50 frames were recorded with one frame per 330 ms. Pictures were analyzed in FIJI (NIH) and FRAP recovery curves were calculated using standard methods. For calculating half time recoveries, normalized values from each recording were separately fit to a single exponential model and half time recoveries were presented as mean ± standard error.
Cell line establishment and characterization
Human U2OS osteosarcoma cells (Research Resource Identifier: RRID:CVCL_0042) were grown in a Sanyo copper alloy IncuSafe humidified incubator (MCO-18AIC(UV)) at 37°C/5.5% CO2 in low glucose DMEM with 10% FBS (full recipe: 500 mL DMEM (ThermoFisher #10567014), 50 mL fetal bovine serum (HyClone FBS SH30910.03 lot #AXJ47554) and 5 mL Penicillin-streptomycin (ThermoFisher #15140122)) and were passaged every 2-4 days before reaching confluency. Plasmids expressing N-terminally tagged (either Dendra2 or Halo) α-amanitin resistant mutated (N792D) human RPB1 were stably transfected into U2OS cells using Fugene 6 following manufacturer’s instruction (Promega #E2692). The RPB1-52R vectors encode the 52 CTD repeats originally present in the endogenous RPB1 cDNA. The RPB1-25R expressing vectors contain only 25 repeats out of the 52 repeats corresponding to repeats 1 to 21 and repeats 49 to 52. The RPB1-70R cell lines are expressing either a Dendra2-RPB1 protein containing 66 repeats in its CTD (repeats 1 to 51, then repeats 38 to 52) or an Halo-RPB1 protein containing 70 repeats in its CTD (repeats 1 to 47, then repeats 42 to 47 then repeats 38 to 52) as assessed by sequencing of the RPB1 mRNA respectively expressed in these cells. Details of cloning strategies are available upon request. α-amanitin (SIGMA #A2263) was used during the stable selection process at a concentration of 2 μg/mL and was used thereafter in permanence in the culture of the cells at a concentration of 1 μg/mL to avoid endogenous RPB1 re-expression as described in 3. Even though these lines cannot genotypically be considered as endogenously tagged (the wild-type endogenous RPB1 gene is still there, a cDNA expressing the tagged version of RPB1 is incorporated in the genome), phenotypically they can as the expression of endogenous RPB1 protein is replaced by the tagged version of the protein at all time.
RT-PCR analysis (Superscript III with oligo (dT)20, Invitrogen (#18080093) and NEB Phusion® High-Fidelity DNA Polymerase (#M0530S) followed by sequencing was performed to confirm the sequence of the RPB1-CTD expressed in the various cell lines (more details of all the molecular biology characterizations available upon request).
Western Blot
Cells were collected after ice-cold PBS wash by scraping into 0.5 mL/10 cm plate of high-salt lysis buffer (0.5 M NaCl, 50 mM HEPES, 5 mM EDTA, 0.5% NP-40 and protease inhibitors), with 125 U/mL of benzonase (Novagen, EMD Millipore), passed through a 25G needle, rocked at 4°C for 30 min, centrifuged at maximum speed at 4°C for 20 min. Supernatants were quantified by Bradford method. Same amount of proteins were loaded onto 7% Bis-Tris SDS-PAGE gel, transferred onto nitrocellulose membrane (Amershan Protran 0.45 um NC, GE Healthcare) for 2 hr at 80V, blocked in TBS-Tween with 5% milk for at least 1 h at room temperature and blotted overnight at 4°C with primary antibodies (anti-Pol II (N20) from SantaCruz #sc-899, anti-LaminA from Abcam #ab26300) in TBS-T with 5% milk. HRP-conjugated secondary antibodies were diluted 1:5000 in TBS-T with 5% milk and incubated at room temperature for one hour.
FACS analysis
Expression of the exogenous RPB1 protein was assessed by flow cytometry analysis on live cells on a BD LSRFortessa, performed according to the manufacturers’ protocols. For the Halo-tagged line, Halo-TMR labelling (500 nM) was performed for 30mn at 37°C before harvesting the cells.
Cell imaging conditions
For live-cell imaging, the medium was identical except DMEM without phenol red was used (ThermoFisher #31053028). U2OS cells expressing α-amanitin-resistant Halo-RPB1-25R, Halo-RPB1-52R or Halo-RPB1-70R were grown overnight with α-amanitin on plasma-cleaned 25 mm circular no 1.5H cover glasses (Marienfeld High-Precision 0117650). Prior to all experiments, the cover glasses were plasma-cleaned and then stored in isopropanol until use. For live-cell FRAP experiment, cell preparation was identical except cells where grown on glass-bottom (thickness #1.5) 35 mm dishes (MatTek P35G-1.5-14-C).
PALM imaging
Six movies of ~ 50,000 frames were acquired for each condition at 30 ms/frame. The axial drift was corrected in real time with a perfect focus system. A cylindrical lens was added to the system to induce astigmatism in the point-spread function (PSF) of the optical setup. 300,000 detections were collected on average per movie. Single molecule detection and localization was performed with a modified version of the multiple-target tracking algorithm. The 3D position of single detection was inferred from the lateral elongation of the PSF. The lateral drift of the sample was corrected by using fluorescent beads (TetraSpeck microspheres). To correct for blinking of the Dendra2 fluorophore, detections in a disk of 30 nm and adjacent in time were grouped and averaged.
Nuclei and nucleoli were automatically detected and segmented for further processing. N(r) is the estimate of the expected number of neighbors within a distance r of a given point of the sample: where P is the set of all detections and Np the total number of detections. The f function 46 47 corrects, for biases generated by points located at short distances to the borders (nucleus or nucleoli): where d(i,j) is the distance between i and j and Cin the length of the part of the circle of radius d(i,j) centered on i which is inside the area of study, the nucleus.
The null hypothesis, complete spatial randomness (CSR) is a homogeneous Poisson process with intensity λ, equal to the density of detection in the area of study
We estimated four spatial statistics based on N(r): n(r), K(r), L(r) and G(r) 47,48. The local neighbor density function, is defined as n(r) = N(r) /πr2. The K-Ripley function is defined as K(r)=N(r)/λ. The linearized K-Ripley function is given by The pair density function G(r) is simply the derivative of the K(r).
Under CSR, the expected value taken by n(r) (resp. K(r), L(r) and G(r)) is λ (resp. πr2, 0 and 1). The triangulation of the areas was performed with a custom python script and we used the ADS R package 49 to estimate the four spatial statistics. In order to estimate the standard deviation and standard error associated with these measurements we performed a bootstrapping analysis of the data set. We randomly selected 10,000 detections from each original data set a 100 times and feed these sub sampled data set to the R script computing the spatial statistics.
Single-molecule imaging (spaSPT)
After overnight growth, cells were labeled with 50 nM PA-JF549 50 for ~15-30 min and washed twice (one wash: medium removed; PBS wash; replenished with fresh medium). At the end of the final wash, the medium was changed to phenol red-free medium keeping all other aspects of the medium the same (and adding back α-amanitin). Single-molecule imaging was performed on a custom-built Nikon TI microscope equipped with a 100x/NA 1.49 oil-immersion TIRF objective (Nikon apochromat CFI Apo TIRF 100x Oil), EM-CCD camera (Andor iXon Ultra 897; frame-transfer mode; vertical shift speed: 0.9 μs; −70°C), a perfect focusing system to correct for axial drift and motorized laser illumination (Ti-TIRF, Nikon), which allows an incident angle adjustment to achieve highly inclined and laminated optical sheet illumination 51. An incubation chamber maintained a humidified 37°C atmosphere with 5% CO2 and the objective was also heated to 37°C. Excitation was achieved using a 561 nm (1 W, Genesis Coherent) laser for PA-JF549. The excitation laser was modulated by an acousto-optic tunable filter (AA Opto-Electronic, AOTFnC-VIS-TN) and triggered with the camera TTL exposure output signal. The laser light was coupled into the microscope by an optical fiber and then reflected using a multi-band dichroic (405 nm/488 nm/561 nm/633 nm quad-band, Semrock) and then focused in the back focal plane of the objective. Fluorescence emission light was filtered using a single band-pass filter placed in front of the camera using the following filters: Semrock 593/40 nm bandpass filter. The microscope, cameras, and hardware were controlled through NIS-Elements software (Nikon).
We recorded single-molecule tracking movies using our previously developed technique, stroboscopic photo-activation Single-Particle Tracking (spaSPT) 28,30. Briefly, 1 ms 561 nm excitation (100% AOTF) of PA-JF549 was delivered at the beginning of the frame to minimize motion-blurring; 405 nm photo-activation pulses were delivered during the camera integration time (~447 μs) to minimize background and their intensity optimized to achieve a mean density of ~1 molecule per frame per nucleus. 30,000 frames were recorded per cell per experiment. The camera exposure time was 7 ms resulting in a frame rate of approximately 134 Hz (7 ms + ~447 μs per frame).
spaSPT data was analyzed (localization and tracking) and converted into trajectories using a custom-written Matlab implementation of the MTT-algorithm 52 and the following settings: Localization error: 10-6.25; deflation loops: 0; Blinking (frames): 1; max competitors: 3; max D (μm2/s): 20.
We recorded ~9-10 cells per replicate and performed three independent replicates on three different days. Specifically, across three replicates we imaged 29 cells for 25R Halo-RPB1 and obtained 448,362 trajectories with 690,682 unique displacements at a mean density of 1.2 localizations per frame. Similarly, we imaged 30 cells for 52R Halo-RPB1 and obtained 324,928 trajectories with 619,247 unique displacements at a mean density of 1.1 localizations per frame. Finally, we imaged 26 cells for 70R Halo-RPB1 and obtained 333,720 trajectories with 571,345 unique displacements at a mean density of 1.0 localizations per frame.
Model-based analysis of single-molecule tracking data using Spot-On
To analyze the spaSPT data, we used our previously described kinetic modeling approach (Spot-On) 28,30. Briefly, we analyze each replicate separately and the reported bound fractions and free diffusion coefficients are reported as the mean +/- standard deviation from analyzing each replicate separately. We merged the data from all cells (~9-10) for each replicate, compiled histograms of displacements and then fit the displacement cumulative distribution functions for 7 time points using a two-state model that assumes that Pol II can either exist in an immobile (e.g. chromatin-associated) or freely diffusive state:
Where and
Here, FBOUND is the fraction of molecules that are bound to chromatin, DBOUND is the diffusion coefficient of chromatin bound molecules, DFREE is the diffusion coefficient of freely diffusing molecules, r is the displacement length, ∆τ is lag time between frames, ∆z is axial detection range, σ is localization error (35 nm) and ZCORR corrects for defocalization bias (i.e. the fact that freely diffusion molecules gradually move out-of-focus, but chromatin bound molecules do not). Model fitting and parameter optimization was performed using a non-linear least squares algorithm (Levenberg-Marquardt) implemented in the Matlab version of Spot-On (v1.0; GitLab tag 92cdf210) and the following parameters: dZ=0.7 μm; GapsAllowed=1; TimePoints: 8; JumpsToConsider=4; ModelFit=2; NumberOfStates=2; FitLocError=0; D_Free_2State=[0.4;25]; D_Bound_2State=[0.00001;0.05];
Diffusion coefficient calculations
The observed free diffusion coefficients obtained from fitting the spaSPT data with the Spot-On model (Brownian motion) were 3.74 +/- 0.178 μm2/s, 2.97 +/- 0.0912 μm2/s and 2.34 +/- 0.049 μm2/s for the 25R, 52R and 70R versions of Halo-Rpb1, respectively (mean +/- standard error). Given that the molecular weight of e.g. 25R is lower, one would expect the diffusion coefficient to be higher. To estimate whether this large difference could be explained by size alone or whether it might be due to reduced multivalent interactions, we consider the Stokes-Einstein relation according to which the diffusion coefficient is given by: where kB is Boltzmann’s constant, T is the absolute temperature, η is the viscosity of the liquid (the nucleoplasm here; assumed to be the same for both 25R, 52R and 70R) and r is the radius. The Stokes-Einstein equation assumes the particle to be a sphere and accordingly the radius is given by the volume, V:
In turn, the volume is related to the mass, m, and density, p: where NA is Avogadro’s constant and MW is the molecular weight in atomic mass units (Daltons). Thus, the diffusion coefficient is related to the molecular weight by: Thus using 25R and 52R as the example, the ratio between the diffusion coefficients of 25R and 52R Halo-Rpb1 (assuming that the density is the same) is:
According to UniProt (P24928) the molecular weight of wild-type Rpb1 is 217.2 kDa (52R). The molecular weight of the HaloTag is 33.6 kDa. Thus, the molecular weight of Halo-Rpb1-52R is ~250.8 kDa, the molecular weight of Halo-Rpb1-25R is ~230.9 kD and the molecular weight of Halo-Rpb1-70R is ~258.1 kD. Thus, the expected difference in diffusion coefficients is:
If we compare this to the experimentally observed ratio:
It becomes clear that size/mass difference alone cannot explain the large difference in diffusion coefficient that we observe in cells. To be comprehensive, below we list the Stokes-Einstein expected and the observed diffusion coefficient ratios for all the combinations: For all three combinations, the observed ratio cannot be explained by the change in size/mass. Instead this indicates a higher propensity of the full-length CTD to engage in intermolecular interactions. Moreover, in the above calculations we have just considered the change in the mass of Rpb1. In reality, Rpb1 is likely diffusing as part of the Pol II holocomplex, thus the relative difference due to the smaller CTD (e.g. ~20 kDa between 25R and 52R) is actually much smaller than the calculations using only Rpb1 would suggest and thus the expected difference in diffusion coefficients due to mass/size would be even much closer to 1. We conclude that the mass/size difference between the 25R, 52R and 70R Pol II enzymes cannot explain their observed differences in diffusion coefficients.
FRAP in cells
FRAP experiments were performed and analyzed as previously described 30. Briefly, FRAP was performed on an inverted Zeiss LSM 710 AxioObserver confocal microscope equipped with a motorized stage, a full incubation chamber maintaining 37°C/5% CO2, a heated stage, an X-Cite 120 illumination source as well as several laser lines. Halo-TMR was excited using a 561 nm laser. Images were acquired on a 40x Plan NeoFluar NA1.3 oil-immersion objective at a zoom corresponding to a 100 nm x 100 nm pixel size and the microscope controlled using the Zeiss Zen software. In FRAP experiments, 300 frames were acquired at either 1 frame per second allowing 20 frames to be acquired before the bleach pulse to accurately estimate baseline fluorescence. A circular bleach spot (r = 10 pixels) was chosen in a region of homogenous fluorescence at a position at least 1 μm from nuclear and nucleolar boundaries. The spot was bleached using maximal 561 nm laser intensity and pixel dwell time corresponding to a total bleach time of ~1 s. We generally collected data from 5 cells per cell line per condition per day and all presented data is from at least 3 independent replicates on different days.
To quantify and drift-correct the FRAP movies, we used a previously described custom-written pipeline in MATLAB 30. Briefly, we manually identify the bleach spot. The nucleus is automatically identified by thresholding images after Gaussian smoothing and hole-filling (to avoid the bleach spot as being identified as not belonging to the nucleus). We use an exponentially decaying (from 100% to ~85% (measured) of initial over one movie) threshold to account for whole-nucleus photobleaching during the time-lapse acquisition. Next, we quantify the bleach spot signal as the mean intensity of a slightly smaller circle (r = 0.6 μm), which is more robust to lateral drift. The FRAP signal is corrected for photobleaching using the measured reduction in total nuclear fluorescence (~15% over 300 frames at the low laser intensity used after bleaching) and internally normalized to its mean value during the 20 frames before bleaching. We correct for drift by manually updating a drift vector quantifying cell movement during the experiment. Finally, drift- and photobleaching corrected FRAP curves from each single cell were averaged to generate a mean FRAP recovery. We used the mean FRAP recovery in all figures and error bars show the standard error of the mean.
DATA AVAILABILITY STATEMENT
The in vitro datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request. The PALM and SPT data are available at http://doi.org/10.5281/zenodo.1188488. The raw spaSPT data is available in Spot-On readable CSV format in the form of single-molecule trajectories. The Spot-On Matlab code is available together with a step-by-step guide at Gitlab: https://gitlab.com/tjian-darzacq-lab/spot-on-matlab. For additional documentation, please see also the Spot-On website https://SpotOn.berkeley.edu and previous publications 28,30.
ACKNOWLEDGMENTS
We thank Seychelle M. Vos for the MBP-hCTD expression plasmid and advice on purification, Sandra Schilbach for the yeast TFIIH kinase module as well as Andrea Boltendahl and Carina Burzinski for help with cloning and purification. We are grateful to Susmitha Ambadipudi for help with initial microscopy measurements and discussions about LLPS. We thank Luke Lavis for generously providing JF dyes. We thank Anatalia Robles, Carla Inouye, Shuang Zheng, Mallory Haggart and Janeen Lim for technical and administrative assistance. In vivo FRAP experiments were conducted at the CRL Molecular Imaging Center, supported by Gordon and Betty Moore Foundation. We thank all current members of the Cramer, Zweckstetter and Tjian-Darzacq labs for discussions and and Robert Tjian for critical reading of the manuscript. M.Z. was supported by the Deutsche Forschungsgemeinschaft (SFB860; project B02), by the Cluster of Excellence and DFG Research Center Nanoscale Microscopy and Molecular Physiology of the Brain and the advanced grant ‘LLPS-NMR’ of the European Research Council. P.C. was supported by the Deutsche Forschungsgemeinschaft (SFB860; project A13), the advanced grant ‘TRANSREGULON’ of the European Research Council, and the Volkswagen Foundation. X.D. was supported by NIH grant UO1-EB021236 and the California Institute of Regenerative Medicine grant LA1-08013.