Abstract
Evolutionary conservation and lineage-specific diversification of existing proteins forms the basis for evolving complexity of protein-protein interaction networks and presumably, thereby that of organism. GAGA associated factor (GAF) belongs to BTB/POZ and zinc finger family of transcription factors and it is conserved from flies to humans. Emerging evidence shows indispensable roles of vertebrate GAF (vGAF a.k.a. ThPOK) in functionally divergent developmental processes like hematopoiesis, adipogenesis and lactation. vGAF is a sequence-specific DNA binding transcription factor with multiple context-dependent roles in gene activation/repression, enhancer-blocking and more recently, it has shown to be the part of ribonucleoprotein complexes as well. In order to understand the molecular basis of these diverse functions, we analyzed the protein-protein interactome of vGAF. This analysis shows vGAF’s association with chromatin remodelers, RNA metabolic machinery, transcriptional activators/repressors, and components of DNA repair machinery, thereby provides a plausible explanation for the diverse molecular functions of vGAF. Our findings discern novel role of vGAF in several molecular processes like DNA repair and RNA metabolism. We further tested the biological significance of our protein-protein interaction data and show a novel function of vGAF in DNA repair and cell-survival after UV induced DNA damage. Consistent with these results, analysis of high-throughput RNA-seq data shows the downregulation of vGAF in samples of skin cutaneous melanoma for which the primary cause is UV induced DNA damage. These findings suggest vGAF as an early diagnostic biomarker for skin cutaneous melanoma. Taken together, our study reveals a molecular basis for the diverse functions of vGAF. We uncover its role in DNA repair and provide an explanation for key roles of such evolutionary conserved factors in processes like development and disease.
- Abbreviations
- ACF
- ATP-utilizing chromatin assembly and remodeling factor)
- ADCR
- ATP dependent chromatin remodelers
- AS
- Alternative Splicing
- BER
- Base excision repair
- BTB/POZ
- Broad-Complex, Tramtrack and Bric a brac/POxvirus and Zinc finger domain
- DmGAF
- Drosophila melanogaster GAF
- GAF
- GAGA associated Factor
- GO
- Gene ontology
- GOBP
- Gene Ontology biological process
- GOCC
- Gene ontology cellular compartment
- GOMF
- Gene ontology molecular function
- ISWI
- Imitation SWI
- lincRNA
- Long intergenic non-coding RNA
- MTT
- 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide
- NER
- Nucleotide excision repair
- NHEJ
- Non-homologous end joining
- NST
- Normal skin tissue
- NuRD
- Nucleosome remodeling and deacetylase
- SCM
- Skin cutaneous melanoma
- SWI/SNF
- SWItch/Sucrose Non-Fermentable
- ThPOK
- T helper POK
- vGAF
- Vertebrate GAF
1. Introduction
GAGA associated factor (GAF) of Drosophila melanogaster, DmGAF, is a developmentally important transcription factor that has been implicated in diverse nuclear processes like gene activation, polycomb-mediated silencing, enhancer-blocking, position effect variegation and chromosomal segregation [1-6]. Vertebrate GAF (vGAF), the evolutionarily conserved counterpart of DmGAF in vertebrates, is encoded by zbtb7b gene [7]. ZBTB7B/vGAF/ThPOK has been shown to be essential for the proper development of the hematopoietic and adipose tissues [8-10]. A recent study also points out the role of vGAF in the onset of lactation and in the production of milk lipids [11]. The molecular functions of vGAF that translate into these biological processes are diverse in nature. vGAF (ZBTB7B/ThPOK) was first identified as a tissue-specific activator of collagen genes in mouse dermis [12,13]. Several other studies also point the role of vGAF in transcriptional activation of genes involved in cytokine signaling like Socs1, Cish and TNF-alpha [14,15]. Paradoxically, multiple other studies report that vGAF has a rather repressive effect on the transcription of another set of genes like UDP glucose dehydrogenase, eomesodermin, and cd8 [16-18]. These are not, however, the only nuclear processes where vGAF has been shown to be involved. vGAF has been reported to bind at the enhancer-blocker elements in the murine Hox clusters and is essential for tissue-specific nuclear-lamina proximal positioning of IgH and cyp3a loci, suggesting an architectural role of vGAF in chromatin organization [19,20]. Moreover, a recent study has shown a developmentally important association of vGAF with ribonucleoprotein complexes containing a lincRNA (long inter-genic non-coding RNA) [10]. The remarkable functional versatility of vGAF, at least in part, may stem from the structural domains of the protein [21]. The zinc-finger domains present at C-terminal of vGAF allow sequence-specific tethering of vGAF molecules at numerous loci in the genome, while, the N-terminal BTB/POZ domain being a protein-protein interaction domain endows GAF with an ability to recruit protein-complexes of diverse functionalities at these loci. Several pieces of evidence suggest the importance of protein-protein interactions in the multifunctional roles of GAF proteins [16,20,22-24]. We hypothesize that like DmGAF, the functional versatility of vGAF depends on its interacting partners, which themselves are functionally diverse.
In the present study, we sought to decipher a comprehensive protein-interaction network of vGAF. We used mass spectrometry and identified 585 proteins that co-immunoprecipitate with vGAF. These proteins belong to diverse functional classes, viz., chromatin remodelers, transcriptional activators/repressors, RNA processing factors and components of DNA repair machinery. We show that vGAF co-localizes to the site of DNA damage and further establish an important role of vGAF in DNA repair and cell-survival after UV induced DNA damage. The diversity of the vGAF-associated protein complexes observed in this study explains the molecular basis for the diverse functions of this protein, and also implicate its function in multiple other nuclear processes.
2. Material and Methods
2.1 Plasmids and antibodies
ThPOK cDNA (IMAGE ID-6309645) was obtained from Open-Biosystems (Dharmacon). The N-terminal 3X-FLAG-tagged ThPOK expression plasmid was generated by cloning the ThPOK coding DNA sequence (CDS) in the backbone of pEGFPC-3 (Clontech) plasmid after replacing the CDS of EGFP with 3X-FLAG. CDS of MBD3, Cbx5, HDAC1, p65, and RBM14 were amplified from C2C12 myoblast cell cDNA using Q5® High-Fidelity 2X Master Mix. Expression plasmids of EGFP-tagged MBD3, Cbx5, HDAC1, p65, and RBM14 were generated by cloning the respective CDS of proteins into the pEGFPC-1 plasmid (See Sup. Table 3 for primer and restriction site details).
For western blotting, anti-FLAG M2 mouse monoclonal antibody was procured from Sigma and anti-GFP (ab290) rabbit polyclonal was from Abcam. p-Histone H2A.X (Ser-139)/γ-H2A.X (SC-10196) was obtained from Santacruz. Secondary antibodies anti-Mouse HRP (ab6820) and anti-Rabbit HRP (ab6802) were obtained from Abcam. For immuno-staining, p-Histone γ-H2A.X (Ser-139) (clone JBW301, cat. 05-363) was from Millipore while anti-FLAG rabbit polyclonal (F7425) antibody was procured from Sigma. Fluorochrome-conjugated secondary antibodies anti-Rabbit IgG (H+L) Alexa-flour 647 (cat. 711-605-152) and anti-Mouse IgG (H+L) Cy3 (cat. 115-165-062) were obtained from Jacksons laboratories.
2.2 RNA isolation and cDNA preparation
Total RNA was prepared from C2C12 myoblast cells using TRIzol (Ambion, Invitrogen) following the manufacturer’s instructions. One microgram of total RNA was used for preparing cDNA using PrimeScript™ 1st strand cDNA Synthesis Kit (Clontech) following the manufacturer’s protocol.
2.3 Cell culture and transfection
C2C12 myoblast cells were maintained in Dulbecco‘s modified Eagle‘s medium (DMEM) supplemented with 20% fetal bovine serum (FBS) and 1X GlutaMax (Invitrogen). HEK293 cells were maintained in DMEM supplemented with 10% FBS. C2C12/HEK293 cells were grown in 100mm tissues culture dishes and were transfected with 12ug plasmid DNA using Lipofectamine-LTX (C2C12 cells) or Lipofectamine-3000 (HEK293) following the manufacture’s instruction. Primary fibroblast cell cultures were established using depilated skin and lung tissue of vGAF knockout mice (The Jacksons laboratory: Stock No:027663) and wild type sex-matched littermates using a previously published protocol [8,25]. All cell cultures were maintained at 37° C with 5% CO2 in a humidified chamber.
2.4 Immuno-precipitation, protein-extraction and western blotting
C2C12 myoblast/HEK293 cells were harvested 36 hours post transfection and washed twice with ice-cold PBS. Cells were resuspended in cell-lysis buffer (50mM Tris pH 7.4, 150mM NaCl, 0.5mM MgCl2, 1% Triton-X 100, 10% glycerol, 1X protease inhibitor cocktail (Roche), 1mM PMSF, 270U/ml DNAase-I (sigma)) and incubated on ice for 45 minutes with intermittent mixing. The cell debris were removed by spinning the cell lysate at 14000rpm/4 °C for 10 min. ANTI-FLAG M2 (Sigma) mouse monoclonal or anti-GFP rabbit polyclonal (ab290) antibody was incubated with Protein-G/A magnetic beads (Dynabeads, Invitrogen) in PBST for 2hrs at 4 °C. Subsequently, Antibody-Protein-G/A magnetic bead complex was incubated with total cell lysate at 4 °C for overnight. The immune-complexes captured on beads were washed thrice with cell-lysis buffer (without DNAase-I) and eluted in the laemmli buffer.
Protein extracts were prepared from skin explants or primary cells. Skin explants were pulverized in liquid nitrogen. Pulverized skin or cells were resuspended in protein-extraction buffer (50mM Tris pH 7.4, 150mM NaCl, 1% Triton-X 100, 1% SDS, 10% glycerol, 10mM DTT, 1X protease inhibitor cocktail (Roche), 1mM PMSF) and incubated on ice for 45 minutes with intermittent mixing. The cell and tissue debris were removed by spinning the extracts at 14000rpm/4° C for 10 min. Protein samples were resolved on SDS-PAGE and transferred onto PVDF membrane followed by antibody-mediated detection using enhanced chemiluminescence method.
2.5 UV treatment
UV treatments were carried out using CL-1000 UV cross-linker (UVP). Exponentially growing cultures of primary cells were exposed to UVC (254nm) at a dose of 5J/m2 (lung cells) or 30J/m2 (skin cells). C2C12 cells were grown on glass coverslips and 36 hours after the transfection were UVC treated at a dose of 5J/m2. Depilated skin from vGAF knockout and wild type mice were cut into 1mm2 size and were kept with their dorsal side up in 6 well tissue culture plate. Skin explants were exposed to UVC at a dose of 60J/m2. Prior to protein extract preparation, tissues and cells were allowed to recover for 3hrs in DMEM medium supplemented with 10% FBS.
2.6. MTT assay
Skin primary cells were UV treated in 24 well tissue culture plates after 1-day post seeding at a density of 5000 cells/well. Cells were allowed to grow for another 96 hours and were washed with PBS before adding 100ul of MTT solution (0.5mg/ml MTT in PBS) to each well. Cells were incubated with MTT solution for another 4 hours before adding 150ul of DMSO to each well for the solubilization of farmazan crystals. Finally, the absorbance was measured at 570nm using using Multiskan microplate photometer (Thermo Scientific, USA).
2.7 Immuno-staining and microscopy
Cells were grown over glass coverslips and fixed with 4% paraformaldehyde. Next to the fixation, cclls were permeablized in PBST (0.5% Triton-X100) and were kept in blocking solution (1X PBS, 0.1% Tween-20, 1%BSA) for 1 hour. Cells were incubated with anti-γ-H2AX (1:1000) and anti-FLAG (1:1000) antibodies for 1 hour at room temperature. Fluorochrome-conjugated secondary antibodies used for subsequent detection of primary antibody binding were Anti-Rabbit IgG (H+L) Alexa-flour 647 (1:1000) and anti-Mouse IgG (H+L) Cy3 (1:500). Finally, coverslips were mounted on glass slides using Vectashield™ mounting media (with DAPI).
The optical sections of immune-stained cell nuclei were captured using Ziess LSM 880 confocal microscope with 63X objective at 3X zoom. Single plane optical sections of 5um thickness were used for co-localization analysis using ‘Colocalization Finder’ plugin of ImageJ.
2.8 Mass spectrometry and protein identification
Immuno-precipitated samples were fractioned by 12% SDS-PAGE. The gels were stained with Imperial™ protein stain (Thermo-Scientific), destained and washed with Milli-Q water several times. Each gel lane was sliced into six pieces and these gel-pieces were trypsin digested using a previously published protocol [26]. The resulting tryptic peptides were resolved using reverse phase chromatography. 18ul of the sample was loaded on reverse phase C18-Biobasic PicoFrit™ chromatography columns using Easy nLC-II (Thermo-Scientific) with a flow rate of 0.4ul/min; peptides were eluted on 60 min gradient (3% to 70% acetonitrile). Chromatographically separated peptides were analyzed using Q-exactive mass spectrometer. The top 10 peptide precursors were selected for MS/MS analysis. The mass spectra obtained were searched against mouse proteome from UniProt using SEQUEST HT algorithm incorporated in Thermo Proteome Discoverer (version 1.4.0.288). Enzyme specificity was set to full trypsin digestion with maximum 2 missed cleavages. Precursor mass tolerance was set to 10ppm and fragment mass tolerance was 0.6Da. Peptide identifications were accepted only if they pass the following criteria: Maximum Delta Cn value - 0.1, Minimum Xcorr value – 1.92, peptide confidence – High. Proteins identifications were accepted only if they cross the Percolator score −10. We identified a non-redundant and cumulative list of 853 proteins from three FLAG-vGAF pull-downs while two negative control pull downs yielded a total of 935 non-redundant proteins. We removed the proteins that were common between vGAF pull down and negative control pull down to identify 585 specific interactors of vGAF.
2.9 Gene ontology analysis of protein interactome
List of protein interactors was analysed using Cytoscape 3.5.1 and its plugin ClueGO [27,28]. In order to identify the functionally enriched clusters of Gene ontology (GO) terms, we performed enrichment analysis of GO terms (Biological process, Molecular function, Cellular compartment) using two-sided hypergeometric test with Bonferroni step down p-value correction. We used GO term fusions, and terms were considered significant only at p=<0.01. The resultant significant terms were visualized in an organic network layout where nodes were connected above 0.4 kappa threshold.
2.10 Bioinformatic analysis of gene expression data
We used Expression-DIY module of GEPIA (Gene Expression Profiling Interactive Analysis) to analyze RNA-seq data for skin cutaneous melanoma (SCM) and normal skin tissue (NST) from TCGA (The Cancer Genome Atlas) and GTEx (Genotype-Tissue Expression) databases. Box-plot function was used for plotting the differential expression of genes in SCM and NST samples using default settings. We used the stage-plot function for plotting the expression of vGAF across major stages of skin cutaneous melanoma using default settings
3. Result
3.1 Identification of vGAF associated protein interactome
To define the protein interactome of vGAF, we used immuno-affinity based purification of protein complexes and subsequently identified the proteins using liquid chromatography coupled with mass-spectrometry (LC-MS/MS). We tagged vGAF with an N-terminal 3X FLAG-tag and expressed the recombinant protein in C2C12 myoblast cells. Immuno-staining of FLAG-vGAF transfected cells with anti-FLAG antibody shows a very specific nuclear localization of FLAG-vGAF (Sup. Fig. 1). The whole cell lysate was used to immuno-precipitate the vGAF containing protein complexes using a monoclonal anti-FLAG antibody or a non-specific IgG (Fig. 1). Whole cell lysate prepared from empty-vector transfected C2C12 cells was also used for immuno-precipitation using the anti-FLAG antibody to serve as the negative control in the mass-spectrometry experiment. Immuno-precipitation experiments were done in triplicates (FLAG-vGAF transfected cells) or in duplicates (empty-vector transfected cells). Finally, the immuno-precipitated protein samples were analyzed using LC-MS/MS. Altogether, we could identify 585 proteins that specifically interact with vGAF across our three replicates (Sup. Table 1). Although, we have used the total cell extracts for all our experiments, cellular component ontology for these 585 proteins showed a statistical overrepresentation of nuclear proteins, suggesting the specificity of FLAG-vGAF immuno-precipitation (Sup. Fig. 2). We further validated our mass spectrometry data by reverse co-IPs followed by western blots. To this end, we selected five candidate proteins from the list of vGAF protein interactome viz. HDAC1, MBD3, (both are the parts of NuRD chromatin remodeling complex) [29], RBM14 (a protein that is involved in both DNA double stand break repair and transcription coupled alternative splicing) [30,31], CBX5 (Protein associated with heterochromatin) [32] and p65 (a context dependent transcription factor) [33]. These proteins were N-terminally GFP-tagged in pEGFPC-1 plasmid and were co-transfected with 3X-FLAG-tagged vGAF plasmid in HEK293 cells (Sup. Fig. 3a & 3b). Immuno-precipitation of GFP-tagged candidate proteins from co-transfected HEK293 cells shows enrichment of vGAF compared to IgG controls suggesting an in-vivo interaction between vGAF and candidate proteins (Fig. 2a & 2b). These results validate the efficacy of our experiment and identification of novel interacting partners of vGAF.
3.2 Gene ontology analysis of vGAF protein interactome
Gene ontology analysis of the vGAF protein interactome enabled us to identify the functional diversity among vGAF interacting partners. Fig. 3 shows a map of significantly enriched Gene ontology (GO) terms associated with vGAF interactome. The same results with additional details on genes associated with each GO term and statistical significance are presented in Sup. Table 2. The most significant Biological process associated GO terms (GOBP) in our analysis are ‘DNA metabolic process,’ ‘DNA Repair’ and ‘DNA Replication.’ Consistent with this we also find cellular component associated GO terms (GOCC) ‘Replication fork’ and ‘DNA replication factor C complex,’ suggesting a novel interaction of vGAF with components of DNA replication and repair. Another set of GOBP terms like ‘regulation of cellular macromolecule biosynthetic process,’ ‘nucleic acid templated transcription’, ‘transcription from RNA polymerase II promoter’ and ‘positive/negative regulation of gene expression’ etc. delineate the association of vGAF with RNA-polymerase-II associated macromolecular transcriptional complexes. Several other Molecular functions associated GO terms (GOMF) like ‘core promoter sequence-specific DNA binding’ and ‘sequence-specific double-stranded DNA binding’ also indicate the interaction of vGAF with other sequence-specific DNA binding proteins. GO analysis also shows the presence of GOMF terms ‘DNA-dependent ATPase activity’, ‘ATPase activity, coupled’ and GOCC term ‘SWI/SNF superfamily-type complex,’ suggesting the association of vGAF with ATP dependent chromatin remodelers. Interestingly, our analysis suggests the interaction of vGAF with components of RNA metabolic machinery through the presence of GOBP terms ‘RNA metabolic process’, ‘regulation of RNA metabolic process,’ ‘ribonucleoprotein complex assembly’ and GOCC term ‘spliceosomal complex’. In addition, our results show the presence of several unexpected GO terms viz. ‘Positive regulation of viral process,’ ‘Regulation of viral transcription,’ ‘ESCRT III complex,’ ‘cAMP response element binding’ and ‘Positive regulation of translation’ etc. Collectively, GO analysis of vGAF protein interactome clearly shows that vGAF interacts with protein complexes of diverse functionalities and nature.
3.3 GAF associates with ATP dependent chromatin remodelers
ATP dependent chromatin remodelers (ADCRs) are key players in the regulation of chromatin packaging and thereby essential for various DNA dependent nuclear functions [34]. There are four different families of chromatin remodeling complexes viz. SWI/SNF, ISWI, CHD and INO80 [29]. All these complexes have an ATPase subunit and a number of non-catalytic subunits, which further diversify these complexes into sub-families [29]. In our analysis, we identify all the components of NuRD complex that belongs to the CHD family of ADCRs (Table 1). We identify Mi-2beta/CHD4 that is the ATPase subunit of this complex, along with that, we also identify all other non-catalytic subunits viz. MBD3, HDAC1, HDAC2, RbAp46, p66-alpha/beta and DOC-1. Similarly, we identify both the components of ACF subfamily of ISWI complexes, viz. SNF2H (ATPase subunit) and BAZ1A (non-catalytic subunit) [29]. On the contrary, we do not identify most of the components of SWI/SNF and INO80 complexes, other than BAF53a, ARID1A (SWI/SNF and INO80) & RUVBL1 (INO80) [29]. These results suggest that vGAF interacts with whole ACF and NuRD remodeling complexes.
3.4 vGAF interacts with a spectrum of DNA repair proteins and is essential for efficient DNA repair and cell survival after UV induced DNA damage
Cellular response to the DNA damage is mediated by DNA damage response (DDR) signaling pathways. ATM (ataxia-telangiectasia mutated), ATR (ATM- and Rad3-Related), and DNA-PKcs (DNA-dependent protein kinase) are the most upstream kinases of DDR signaling pathway [35]. We identify ATR and its obligate interactor ATRIP in our vGAF IP mass-spec data. ATR is of significant importance as it responds to both double strand DNA breaks and a variety of DNA lesions that interfere with DNA replication [36,37]. Further, we find all three RPA70, RPA14 and RPA32 proteins that form complex with ssDNA and recruit ATR/ATRIP complex to the site of DNA damage/replication fork [38]. We find several other DNA repair proteins that are involved in DNA break repair including Ku60 and Ku80, suggesting a physical association between vGAF and DNA repair machinery (Table 2). The phosphorylated form of histone variant H2AX (γ-H2AX) quickly accumulates at the site of double strand DNA breaks within minutes after the induction of DNA damage [39]. It has extensively been used both as a spatial and quantitative marker for DNA damage [39]. In order to determine whether vGAF co-localizes to the sites of DNA double strand breaks, we performed immunostaining of γ-H2AX 1-hour after the UV treatment of C2C12 myoblast cells expressing FLAG-vGAF. As shown in Fig. 4a, co-immunostaining of γ-H2AX (green) and FLAG-vGAF (magenta) shows several regions where these two proteins co-localize with each other, suggesting an association of vGAF with doubled strand DNA break lesions. To further substantiate our results, we used tissues and cells derived from vGAF knock out mice and asked if vGAF could affect the process of DNA-damage repair. We measured the levels of γ-H2AX using western blotting and used it as a proxy for the level of DNA damage in the tissues or cells used for protein extract preparation. vGAF has been shown to express predominantly in adult mouse skin [12,13]. Protein extracts prepared 3-hours after UV treatment of the skin explants shows a higher accumulation of γ-H2AX in vGAF KO skin compared to the wild-type controls, suggesting a delayed repair of DNA-damage in KO skin tissue compared to wild-type tissue (Fig. 4a). We further checked the survival of skin primary fibroblast 4-days after the UV treatment using MTT assay. Here again, we see that primary fibroblast cells from vGAF KO skin are more sensitive to UV treatment compared to the control (Fig. 4b). To further confirm our results, we derived primary fibroblast cells from the lungs which is another organ where vGAF show a significant expression [40]. Protein extracts prepared 3-hours after UV treatment of primary lung fibroblasts show an increased signal of γ-H2AX in western blots compared to the protein extract from the wild-type lung primary fibroblast cells (Fig. 4c). Collectively, these results show that vGAF is essential for efficient DNA repair after UV induced DNA damage. Furthermore, these results serve as a biological validation of our IP mass-spec data that suggest a physical association between vGAF and DNA repair machinery.
3.5 Expression of vGAF is downregulated in skin cancer tissue
UV induced DNA damage is the primary cause of skin cancers and that prompted us to compare the expression data of vGAF (ZBTB7B) and DNA repair proteins that interact with it (Table 2), in skin cutaneous melanoma (SCM) and normal skin tissue (NST). GEPIA (Gene Expression Profiling Interactive Analysis) is a web-based tool for the analysis of TCGA (The Cancer Genome Atlas) and GTEx (Genotype-Tissue Expression) database [41]. We analyzed gene expression data from these two databases using GEPIA and show that SCM tissue samples express vGAF at a much lower level than the NST samples (Fig. 5a), while other DNA repair proteins which were taken in this study are either upregulated or show no significant difference in expression in SCM vs. NST (Sup. Fig. 4). Further analysis also shows that vGAF expression does not change significantly across all the stages of SCM suggesting a consistent downregulation of vGAF even in the stage-0 of SCM (Fig. 5b). Collectively, these results suggest that vGAF is heavily downregulated in SCM even at stage-0, and thus low levels of vGAF in skin biopsies can be used as a diagnostic biomarker for SCM.
4. Discussion
Gene-knockout mouse model and extensive cell culture studies have revealed functions of vGAF in a variety of biological processes viz. development of CD4+ T-cells, brown-fat development, lactation and thermogenesis [8,9,10,11]. Intriguingly, the molecular functions of vGAF that form the basis for these biological processes are confounding in nature. Studies suggest that vGAF is a repressor of cd8 gene in CD4+ T-cells while vGAF forms a ribo-nucleoprotein complex with lincRNA to activate thermogenic gene expression program in adipocytes [16,10]. Moreover, several cell culture based reports on vGAF indicate an even a broader array of molecular activities associated with vGAF like nuclear lamina mediated repression of key developmental genes and enhancer-blocking activity in mammalian Hox clusters [19,20]. Studies in both Drosophila melanogaster and mammals have suggested the importance of protein-protein interactions in the molecular processes associated with GAF proteins [16,20,22-24,42-44]. Here, we have identified a comprehensive protein interactome of vGAF to understand the diversity of vGAF protein interacting complexes.
Existing literature on vGAF shows that it binds on the promoter of genes like UDP glucose dehydrogenase, eomesodermin, Col1a1, Socs1 and Cish and regulates the gene expression [15,17,18,44]. In our study, we also identify a number of transcription factors and components of RNA-polymerase macromolecular complexes. The interaction of vGAF with these protein complexes can modulate the expression of genes where it binds on promoters. Intriguingly, we identify several proteins that are involved in RNA metabolic processes like alternative splicing & mRNA maturation. The functional significance of this association between vGAF and components of AS is exemplified by a high-throughput molecular screen which identifies vGAF as one of the factors that affects developmentally important AS events [45]. Gene ontology analysis of vGAF protein interactome also suggests the interaction of vGAF with proteins involved in the regulation of viral life cycle. A recent report shows that knockdown of vGAF in A549 cells results in almost 50% reduction of Influenza A virus (IAV) titers 24 hours post infection (hpi) compared to the mock treatment, suggesting the functional relevance of the proteins identified in our analysis [46].
Unexpectedly, we find components of ESCRT-III complex in our vGAF protein interactome list. ESCRT complex proteins are required for multivesicular body/endosome biogenesis [47]. The specific role of ESCRT-III complex members is in membrane remodeling (budding and scission) reactions during multivesicular body biogenesis [47,48]. Interestingly, recent studies have shown that ESCRT-III complex is implicated in reformation of nuclear envelope, and co-localizes with Lamin-B receptor and LAP2-beta during the assembly of nuclear envelop after mitotic exit [49]. LAP2-beta is a known interactor of vGAF and has been identified in our studies as well [20]. Microscopic studies have also revealed that vGAF binds on the mitotic chromosome and interacts with lamin B1 during the assemble of nuclear envelop [20]. These independent studies further support a physical link between vGAF and ESCRT-III complex.
Our analysis shows an association of vGAF with whole ACF and NuRD chromatin remodeling complexes. Apart from their gene regulatory functions, both ACF and NuRD complexes are involved in DNA replication and repair [50,51]. In our analysis, the DNA repair associated functions of these chromatin remodeling complexes are of vital importance as we also identify a number of other DNA repair proteins that interact with vGAF. We identify both Ku60 and Ku80, the two major proteins that form complex with DNA ends after a double strand DNA break. Furthermore, our analysis shows the presence of several cNHEJ (classical Non-homologous end joining) accessory proteins (PNKP, Aprataxin, APLF) and ligase3, a component of alternative NHEJ [52]. We also identify the components of base excision DNA repair, viz., XRCC1, Polb and ligase3 [53]. These results show a physical association of vGAF with components of DNA repair machinery suggesting a possible functional role of vGAF in DNA repair process. The functional aspects of this physical association between vGAF and DNA repair machinery is further supported by our results that show that vGAF is necessary for efficient DNA repair and cell survival after DNA damage.
The novel role of vGAF in DNA repair process is quite surprising considering the established functions of this protein as a zinc-finger transcription factor with sequence-specific DNA binding ability. However, we have recently started understanding the role of sequence-specific zinc-finger transcription factors in the maintenance of genome integrity [54]. ZBTB7A/LRF, a paralogue of vGAF in mammals, has recently been shown to be involved in cNHEJ DNA repair pathway [55]. Interestingly, LRF is a very well reported interacting partner of vGAF and it is present in our vGAF protein interaction data as well. DNA damage and repair processes has closely been linked to tumorigenesis [56]. UV-induced DNA damage is the major cause of tumor formation in skin tissue where vGAF also expresses predominantly [57]. Our results show that vGAF is heavily downregulated in SCM and its expression does not change significantly across major stages of melanoma progression, suggesting downregulation even in stage 0 of SCM. We speculate that low expression level of vGAF in the cells interferes with the assembly of DNA-repair protein complexes and thus makes the cell less efficient in repairing the DNA lesions.
Recent studies have established that DNA repair is closely linked with two other major nuclear processes viz. transcription and alternative splicing [58,59]. Noticeably, our vGAF protein interactome data includes several proteins, which are uniquely or redundantly involved in these three processes. For example, vGAF interacts with factors (like RBM14, RBMX and PRPF19) that are involved in both splicing/RNA processing and DNA damage repair [30,31,60-63]. Likewise, our analysis shows that vGAF associates with transcription factors which are involved in DNA damage repair (LRF, ZNF281) and RNA processing (LRF, YY1) [45,55,64-67]. Altogether, these observations suggest that vGAF can influence all three major nuclear processes (Transcription, Splicing and DNA repair) and stands a central position in nuclear-proteome-wide interaction network where molecular circuits of transcription, splicing and DNA repair criss-cross each other.
5. Conclusion
We report a comprehensive protein interactome of vGAF in C2C12 myoblast cells. Analysis of the interactome suggests that vGAF interacts with a functionally diverse set of proteins belonging to transcriptional regulation, chromatin remodeling, RNA metabolism, viral life cycle and DNA repair. We unravel an important function of vGAF in DNA repair and cell survival after UV induced DNA damage. Our study shows that vGAF is downregulated in all the major stage of SCM and, therefore, can be used as an early diagnostic biomarker. Taken together, our study provides a molecular basis for the diverse function of vGAF and also indicates its unexplored functions in biological processes.
6. Acknowledgment
AS acknowledge GATE-SRF fellowship from CSIR towards the PhD program. Authors thank CCMB-Proteomics facility members for performing the mass-spectrometry.