ABSTRACT
The T-box transcription factor TBX1 has critical roles in the cardiopharyngeal lineage and the gene is haploinsufficient in DiGeorge syndrome, a typical developmental anomaly of the pharyngeal apparatus. Despite almost two decades of research, if and how TBX1 function triggers chromatin remodeling is not known.
Here, we explored genome-wide gene expression and chromatin remodeling in two independent cellular models of Tbx1 loss of function, mouse embryonic carcinoma cells P19Cl6, and mouse embryonic stem cells (mESCs). The results of our study revealed that the loss or knockdown of TBX1 caused extensive transcriptional changes, some of which were cell type-specific, some were in common between the two models. However, unexpectedly we observed only limited chromatin changes in both systems. In P19Cl6 cells, differentially accessible regions (DARs) were not enriched in T-BOX binding motifs; in contrast, in mESCs, 34% (n=47) of all DARs included a T-BOX binding motif and almost all of them gained accessibility in Tbx1-/- cells.
In conclusion, despite a clear transcriptional response of our cell models to loss of TBX1 in early cell differentiation, chromatin changes were relatively modest.
INTRODUCTION
TBX1 is a transcription factor encoded by a gene that is haploinsufficient in DiGeorge / 22q11.2 deletion syndromes and in the mouse (Baldini et al., 2017; Greulich et al., 2011; McDonald-McGinn et al., 2015). It is a critical player in the development of the pharyngeal apparatus, which gives rise to organs and structures that are affected by many birth defects. The mechanisms by which TBX1 regulates transcription are only now beginning to emerge, but many questions remain. It binds DNA to a typical T-BOX consensus motif (Castellanos et al., 2014; Fulcoli et al., 2016), and interacts with transcription regulators such as chromatin remodeling complexes, histone modifiers, as well as repressive co-factors (Chen et al., 2012; Fulcoli et al., 2016; Okubo et al., 2011; Stoller et al., 2010), and positively regulates H3K4 monomethylation (Fulcoli et al., 2016). In the mouse, Tbx1 is expressed early in development (from around E7.5) in the cardiopharyngeal mesoderm, the developing anterior foregut/pharyngeal endoderm and in the surface ectoderm. Timed-deletion of the gene has revealed a requirement as early as E7.5-E8.0 (Xu et al., 2005) for the development of the 4th pharyngeal arch artery that will form much later. While this phenomenon could be explained by a number of mechanisms, one possibility is that TBX1 primes enhancers for downstream activation or repression, thereby creating asynchrony between the time of requirement and the phenotypic consequences. To address this issue, we have used two cellular models that respond transcriptionally to Tbx1 gene dosage, mouse P19Cl6 and embryonic stem cells (mESCs), but that are at an early stage of differentiation, and we tested the effects of Tbx1 inactivation on transcription and on chromatin remodeling. mESCs (Tbx1+/+ and Tbx1-/-) were subjected to a widely used cardiac mesoderm differentiation protocol (Kattman et al., 2011) and selected using a fluorescence activated cell sorting (FACS) approach. We selected a subpopulation that expresses the highest level of Tbx1 and pursued it for ATAC-seq (Buenrostro et al., 2015) and RNA-seq. The results obtained from the two cellular models indicate that TBX1 inactivation does not have a strong effect on chromatin remodeling at the differentiation stages tested despite having significant effects on transcription. We discuss possible mechanisms to explain our results.
MATERIALS AND METHODS
P19Cl6 cells
We plated 5×105 cells in a 35-mm dish in Dulbecco-Modified Minimal Essential Medium supplemented (Sigma-Aldrich #M4526) with 10% fetal bovine serum (Gibco #10270106). After 24 hours, at confluence, we added 10uM 5-Azacytidine (5-Aza, Sigma-Aldrich #A2385) to induce differentiation. For ATAC-seq, cells were harvested after a further 24 hr. For quantitative ATAC experiments in time course, 24 hr after 5-Aza treatment we replaced the media with fresh media containing 1% DMSO. Samples for qATAC were harvested 13 hr after transfection (T1), 24 hr after 5-Aza induction (D1), and 24 hr after DMSO treatment (D2).
Mouse embryonic stem cells (mESCs)
E14-Tg2a mESCs were cultured without feeders and maintained undifferentiated on gelatin-coated dishes in GMEM (Sigma Cat# G5154) supplemented with 103U/ml ESGRO LIF (Millipore, Cat# ESG1107), 15% fetal bovine serum (ES Screened Fetal Bovine Serum, US Euroclone Cat# CHA30070L), 0.1 mM nonessential amino acids (Gibco, Cat# 11140-035), 0.1 mM 2-mercaptoethanol (Gibco, Cat# 31350-010), 0.1 mM L-glutamine (Gibco, Cat# 25030081), 0.1 mM Penicillin/Streptomycin (Gibco, Cat# 10378016), and 0.1 mM sodium pyruvate (Gibco, Cat# 11360-070). The cells were passaged every 2-3 days using 0.25% Trypsin-EDTA (1X) (Gibco, Cat# 25200056) as the dissociation buffer.
For differentiation, E14-Tg2a mESCs were dissociated with Trypsin-EDTA and cultured at 75,000 cells/ml in serum-free medium: 75% Iscove’s modified Dulbecco’s medium (Cellgro Cat# 15-016-CV) and 25% HAM F12 media (Cellgro #10-080-CV), supplemented with N2 (GIBCO #17502048) and B27 (GIBCO #12587010) supplements, penicillin/streptomycin (GIBCO #10378016), 0.05% BSA (Invitrogen Cat#. P2489), L-glutamine (GIBCO #25030081), 5mg/ml ascorbic acid (Sigma A4544) and 4.5 × 10−4 M monothioglycerol (Sigma M-6145). After 48 hr in culture, the EBs were dissociated with trypsin-EDTA and reaggregated for 40 hr in serum-free differentiation media with the addition of 8ng/ml human activin A (R&D Systems Cat#. 338-AC), 0.5 ng/ml human BMP4 (R & D Systems Cat# 314-BP), and 5ng/ml human VEGF (R&D Systems Cat#. 293-VE). The 2-day-old EBs were dissociated and 6×104 cells were seeded onto individual wells of a 24-well plate coated with 0.1% gelatin in StemPro-34 medium (Gibco #10639011), supplemented with SP34 supplement, L-glutamine, 5mg/ml ascorbic acid, 5 ng/ml human-VEGF, 10 ng/ml human bFGF (R&D Systems 233-FB-025), and 50 ng/ml human FGF10 (R&D Systems 338-FG-025). After 48 hr, we added new StemPro-34 media, supplemented with SP34 supplement, L-glutamine, 5mg/ml ascorbic acid and keep for 96 hr. We harvested cells for RNA-seq and ATAC-seq analysis at day 4 of differentiation (Fig. 1A).
CRISPR-Cas9-mediated targeting of mESCs
Tbx1 knockout was induced in E14-Tg2a using Alt-R™ CRISPR-Cas9 System (IDT) following the manufacturer’s specifications. This genome editing system is based on the use of a ribonucleoprotein (RNP) consisting of Alt-R S.p. Cas9 nuclease complexed with an Alt-R CRISPR-Cas9 guide RNA (crRNA:tracrRNA duplex). The crRNA is a custom synthesized sequence that is specific for the target (Tbx1KO: /AltR1/rUrG rGrCrC rGrArG rUrArC rArCrU rArCrC rArCrC rGrUrU rUrUrA rGrArG rCrUrA rUrGrC rU/AltR2/) and contains a 16 nt sequence that is complementary to the tracrRNA. Alt-R CRISPR-Cas9 tracrRNA-ATTO 550 (5nmol catalog n. 1075927) is a conserved 67 nt RNA sequence that is required for complexing to the crRNA so as to form the guide RNA that is recognized by S.p. Cas9 (Alt-R S.p. Cas9 Nuclease 3NLS, 100 µg catalog n. 1081058). The fluorescently labeled tracrRNA with ATTO™ 550 fluorescent dye is used to FACS-purify transfected cells. The protocol involves 3 steps: 1) annealing of the crRNA and tracrRNA, 2) assembly of the Cas9 protein with the annealed crRNA and tracrRNAs, and 3) delivery of the ribonucleoprotein (RNP) complex into mESC by reverse transfection. Briefly, we annealed equimolar amounts of resuspended crRNA and tracrRNA to a final concentration (duplex) of 1 μM by heating at 95°C for 5 min and then cooling to room temperature. The RNA duplexes were then complexed with Alt-R S.p. Cas9 enzyme in OptiMEM medium to form the RNP complex, which was then transfected into mESCs using the RNAiMAX transfection reagent (Invitrogen). After 48 hr incubation, cells were trypsinized and ATTO 550+ (transfected) cells were purified by FACS. Fluorescent cells (approx. 65% of the total cell population) were plated at very low density to facilitate colony picking. We picked and screened by PCR 96 clones. Positive clones were confirmed by DNA sequencing.
Fluorescence activated cell sorting (FACS)
For FACS, EBs were collected and allowed to settle by gravity. After washing with PBS, the cells were dissociated using the Embryoid Body dissociation kit (cod. 130-096-348 Miltenyi Biotec) according to the manufacturer’s protocol. Dissociated cells (1x 106 cells/100 μl) were incubated with primary antibodies (PDGFRα-APC, mouse cod.130-102-473; KDR VEGFR2-PE (KDR), mouse cod. 130-120-813 Miltenyi Biotec) directly conjugated (1:50) in PBS-BE solution (PBS, 0.5%BSA, 5mM EDTA) for 20 min on ice. Subsequently, cells were washed twice with 2ml of PBS-BE. Cells were sorted using the BD FACS ARIAIII™ cell sorter.
Total RNA was isolated using QIAzol lysis reagent (QIAGEN) and for qRT-PCR it was reverse-transcribed using the High Capacity cDNA Reverse Transcription kit (Applied Biosystem catalog. n. 4368814). Quantitative real-time PCR was performed using SYBR Green PCR master mix (Applied Biosystem catalog. n. 4309155). Relative gene expression was evaluated using the 2^-ΔCt method, and Gapdh expression as the normalizer. Primer sequences are listed on Tab. 2.
ATAC-seq assay
Differentiated P19Cl6 cells and mESCs were collected and then washed two times in PBS, harvested, counted using a haemacytometer chamber and pelleted. 50.000 cells/sample for P19Cl6 and 15.000 cells/sample for mESC were treated with Tagment DNA Buffer 2x reaction buffer with Tagment DNA Enzyme (Illumina) according to the manufacturer’s protocol. After washes in PBS, cells were suspended in 50 μL of cold lysis buffer (10mM Tris-HCl, pH 7.4, 10mM NaCl, 3mM MgCl2, 0.1% IGEPAL CA-630) and immediately spun down at 500 x g for 10 min at 4 °C. Fresh nuclei were treated with Transposition mix and Purification (Illumina #FC121-130), the nuclei were incubated at 37 °C in Transposition Reaction Mix (25 μL reaction buffer, 2.5 μL Transposase, 22.5 μL Nuclease free water), purified using Qiagen MinElute PCR Purification Kit (Cat No./ID: 28006) and eluted in 10 μL of nuclease free water. Sequencing libraries were prepared from linearly amplified tagmented DNA. Fragmentation size was evaluated using the Agilent 4200 TapeStation. We sequenced two biological replicates for each experimental point. Sequencing was performed with an Illumina NextSeq500 machine, in paired-end, 60bp reads.
Quantitative PCR ATAC (qATAC)
P19Cl6 cells were plated at a density of 5×105 cells per well on a 35-mm tissue culture dish containing 25 pmol of a pool of Silencer Select Pre-Designed Tbx1 SiRNA (Life Technology), non-targeted control (Life Technology) and 7.5 μl of RNAiMAX Reagent (Life Technology) diluted in 500 μl of Opti-MEM Medium (TermoFischer #31985062). We collected samples at 3 time points (see P19Cl6 paragraph above and scheme on Fig. 4A). For each time point we assayed two biological replicates. Cells were collected, washed, trypsinized and counted. Chromatin from 5×104 cells was then tagmented, purified and used for quantitative PCR evaluation. To this end, we have used loci bound by TBX1 (Fulcoli et al., 2016) and located in open chromatin. For real-time PCR, we used biological duplicates for each time point and each duplicate was divided into two technical replicates. Two different controls were used: Gapdh promoter (positive control) representing open chromatin, and a desert island locus (negative control), which does not contain any genes in a range of about 80 kb. Quantification was performed using 2^-ΔCt calculation relative to Gapdh promoter (positive control). The data are expressed as the average of two biological replicates and the standard deviation. Primer sequences are reported on Tab. 2.
RNA was also extracted from each sample and the expression of genes associated with the above loci was evaluated using real-time reverse transcription PCR (qRT-PCR). Quantitation was performed using relative quantification (RQ) and calculated with the standard error of the mean of two biological replicates.
RNA-seq
mESCs in dishes were washed with cold PBS to which 1 mL of Trizol was added. Lysates were harvested and vortexed in order to promote the lysis of cells. 200 μL of chloroform was added to 1 mL of lyste in order to separate the phases. The mixture was centrifuged at 12000g for 15 min. The upper phase was removed and transferred into a new tube containing 500 μL of isopropanol and the solution was incubated for 20 min. at room temperature. After 20 min. the solution was centrifuged for 10 min. at 12000g. Pelleted RNA was washed twice with Ethanol 80% and centrifuged for 5 min. at 7500g. Pellet were resuspended, and the concentration was estimated using a Nanodrop. Libraries were prepared according to the Illumina strand specific RNA-seq protocol. Libraries were sequenced on the Illumina platform NextSeq 500, in paired end, 75bp reads.
RNA-seq and ATAC-seq data analysis
Expressed and differentially expressed (DE) genes related to the analysis of the RNA-seq samples in the P19Cl6 cellular model were retrieved from published datasets (Fulcoli et al., 2016). Mouse ESCs RNA-seq raw sequences were first evaluated for quality using FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/), then mapped to the mouse genome (mm9) using TopHat2 2.0.14 (Kim et al 2013) with -r 170 --mate-std-dev 50 --transcriptome-index transcriptome --library-type fr-secondstrand -N 3 --read-edit-dist 5 and all other parameters as default. The transcriptome was built using the Mus_musculus.NCBIM37.67.gtf annotation downloaded from the ensemble database (http://www.ensemble.org). Only uniquely mappable sequences were retained for further analysis. For each sample, the gene expression was quantified in terms of raw counts using HTseq 0.7.1 (Anders et al., 2015) with -m intersection-nonempty -s reverse for all annotated genes. The next analysis was carried out using RNASeqGUI 1.2.1 (Russo et al., 2016), where the expressed genes were first selected using the proportion test, then the raw counts were normalized using the upper quartile method. Finally, the DESeq2 module was used to identify differentially expressed (DE) genes. Genes with adjusted p-values <0.05 were considered DE. Pathway analysis was carried out for both cellular models using gprofiler2 (Raudvere et al., 2019), with the expressed genes as background and a threshold of 0.05 for the FDR value.
For ATAC-seq analysis, FastQC quality check showed 10-20% contamination of Nextera Transposase Sequence primers (Turner, 2014) in the range 33 to 47 bp. We removed these sequences using cutadapt (Martin, 2011) with the following option -a CTGTCTCTTATACACATCTCCGAGCCCACGAGAC -A CTGTCTCTTATACACATCTGACGCTGCCGACGA. Sequences were then aligned to the mouse genome (mm9) using Bowtie2 2.3.4.3 (Langmead and Salzberg, 2012) with the default parameters. Only uniquely mappable reads were retained. A customized R script was used to remove reads with mates mapping to different chromosomes, or with discordant pairs orientation, or with a mate-pair distance >2 kb, or PCR duplicates (defined as when both mates are aligned to the same genomic coordinate). Reads mapping to the mitochondrial genome were also removed. Coverage heat-maps and average enrichment profiles (TSS +/- 10Kb) in each experimental condition were obtained using ngs.plot (Shen et al., 2014) and evaluated on the expressed genes of the cellular models.
ATAC peaks were identified using MACS2 2.1.2.1 (Feng et al., 2012) with the option -- nomodel --shif100 --extsize 200 that are the suggested parameters to handle the Tn5 transposase cut site. In particular, peak calling was performed independently on each ATAC-seq sample of the P19Cl6 cellular model. Then, for each condition a consensus list of enriched regions was obtained using the intersectBed function from the BedTools 2.29 (Quinlan and Hall, 2010), with the default minimum overlap and retaining only the peak regions common to both replicates. In contrast, for the mouse ES cellular model, due to the lower coverage, the two replicates were first merged into a single signal, after which MACS2 was applied to the pooled samples for each experimental condition.
For the P19Cl6 ATAC cellular model, unless otherwise specified, differentially enriched regions (DARs) were obtained using DEScan2 1.6.0 (Righelli et al., 2018) by loading all of the MACS2 peaks, and performing the peak consensus with the finalRegions function (zThreshold=1, minCarriers=2 parameters) and using edgeR with estimateDisp, glmQLFit (robust=TRUE parameter), glmQLFTest, in this order and with the defaults parameters. For the pooled mouse ES samples, the function sicer_df (Zang et al., 2009) was used to identify the DARs, setting 200, 400, 0.00001, 0.0001 as parameters for window size (bp), gap size (bp), and FDR_vs_Input, FDR, respectively.
Peaks, consensus peaks, and DARs were annotated with genes using ChIPseeker 1.22 (Yu et al., 2015) by associating to each peak/region the nearest gene, setting the TSS region [-3000, 3000] and downloading the annotation “may2012.archive.ensembl.org” from Biomart (using the makeTxDbFromBiomart function) and the org.Mm.eg.db database. Finally, the annotated genes were intersected with the DE genes.
For both cellular models, transcription factor binding motifs were obtained using the findMotifsGenome program of the HOMER suite (http://homer.ucsd.edu/homer/).
Volcano plots, pie-charts, and other data reshaping were performed using standard R-scripts. Overlaps among different regions were identified using the intersectBed function from the BedTools 2.29, with default minimum overlap and reporting each original entry once.
RESULTS
Cell differentiation models
P19Cl6 cells were subjected to a differentiation protocol that has been previously described (Mueller et al., 2010) (Fig. 1A). Cells were transfected with scrambled or Tbx1-targeted siRNAs, harvested at day 1 of differentiation and processed for ATAC-seq in 2 biological replicates.
Mouse ES cells were targeted using CRISPR-Cas9 in order to generate a homozygous Tbx1 loss of function mutation by inserting multiple stop codons and polyA signals into exon 5 by homologous recombination. We obtained 2 correctly targeted homozygous mutant clones and selected one of them (5H) for further experiments because it did not express any Tbx1 mRNA (Fig. 1D-E). Clone 5H and the parental cell line (WT) were subjected to a differentiation protocol (Kattman et al., 2011) according to the scheme shown in Fig. 1A. WT cells expressed Tbx1 at day 4, while no expression was detected in Tbx1-/- cells (Fig. 1F).
To enrich for Tbx1-expressing cells, we subdivided the population of mES cells by FACS using the standard markers PDGFRA and KDR (a.k.a. VEGFR2) at day 4 of differentiation (Fig. 1B). We extracted RNA from sorted populations, KDR+;PDGFRa-, KDR+;PDGFRa+, KDR-;PDGFRa+, and from the total, unsorted population. qRT-PCR showed that by far the highest expression of Tbx1 was in the KDR-;PDGFRa+ population (Fig. 1C). The same fractionation was performed on Tbx1-/- 5H cells and we found that the quantitative distribution of cells in the fractions was similar (Fig. S1, S2). Therefore, ATAC-seq and RNA-seq assays were performed using KDR-;PDGFRa+ day 4 cells from the WT and Tbx1-/- lines, in 2 biological replicates.
Chromatin accessibility assay
A) P19Cl6 cells
Control (scrambled siRNA treated) and Tbx1 depleted (Tbx1KD) cells exhibited a similar distribution and intensity of ATAC-seq signal, which was mostly localized to the promoter region of genes, as expected (Fig. 2A-C).
We next compared ATAC-seq data with TBX1 ChIP-seq data previously reported for this cell line, and under the same differentiation conditions (Fulcoli et al., 2016). Surprisingly, we found that only 80 ATAC peaks (out of a total of 23759 non-targeted siRNA peaks) overlapped with 72 TBX1 ChIP-seq peaks (3% of the 2388 TBX1 peaks), indicating that most TBX1 binding sites are located in closed chromatin, i.e. ATAC-negative.
We next compared chromatin accessibility profiles in control and Tbx1KD cells in order to identify differentially accessible regions (DARs) between the two conditions. We found a total of 177 DARs, of which, 72 (41%) had reduced accessibility and 105 (59%) had increased accessibility following Tbx1 knockdown (Fig. 3A). The 177 DARs were annotated with 165 distinct genes, according to the gene symbol (Supplementary Tab. 1). Comparison of this gene list with previously identified differentially expressed genes (DEGs) (Fulcoli et al., 2016) revealed that only 16 (9%) were differentially expressed (Fig. 3A, list in Supplementary Tab. 1); thus, in most cases, chromatin changes identified in our dataset were not associated with transcriptional changes measured by RNA-seq. We examined the distribution of DARs relative to gene features and found that compared to the distribution of all peaks in the WT population (Fig. 2C), there was a relatively low presence in the promoter regions (32.2% vs. 61.1%) and a relatively higher representation in distal intergenic regions (42.9% vs. 25.8%).
Analysis of DAR sequences identified a set of known motifs of transcription factors with homeodomains (OCT4, NANOG), high mobility group domains (SOX2, SOX17, SOX15), and zinc finger domains (ZIC1, ZIC2, ZIC3) (Fig. 3C). Interestingly, several of these proteins are pluripotency factors, but we did not detect any enrichment of T-BOX binding motifs. This is consistent with the finding that only one of all of the DARs identified here overlapped with TBX1 ChIP-seq peaks (indicated in Supplementary Tab. 1).
The finding that almost no TBX1 ChIP-seq peaks changed accessibility after loss of TBX1 was very surprising to us. In order to test whether accessibility changes might follow Tbx1 KD at a later time than the one tested here, we performed a time-course experiment on 5 ChIP-seq peaks located in open chromatin and associated with 5 target genes (Fulcoli et al., 2016). The experimental scheme is shown in Fig. 4A. At each time point, we carried out quantitative ATAC (qATAC) in control and Tbx1KD cells. At each point, we also measured the expression of the target genes. The results (Fig. 4B-C), which were normalized for the accessibility at the GAPDH promoter, confirmed that at D1 (the time tested by ATAC-seq) there was no change in chromatin accessibility. However, at D2, 4 out of 5 loci showed increased accessibility in the Tbx1KD condition. In all cases, gene expression changed at earlier time points (T1 or D1, Fig. 4C), suggesting that differential expression, for these genes, preceded chromatin changes. These results suggest that the effects of Tbx1 KD on chromatin accessibility are most likely to be indirect.
B) Mouse ES cells
We performed ATAC-seq assays on two biological replicates of differentiating WT and Tbx1-/- mES cells selected by FACS (PDGFRA+;KDR-). The distribution of ATAC-seq peaks was similar between control and mutant cells (Fig. 5A-C). We found a total of 138 DARs, of which 26 (19%) decreased accessibility, and 112 (81%) increased accessibility in Tbx1-/- cells. Their distribution, compared to WT peaks distribution (Fig. 5C) showed a relatively lower representation in the promoter regions (43.5% vs. 75.2%) and relatively higher representation in the distal intergenic regions (31.2% vs. 15%), similarly to what we found in P19Cl6 DARs. The 138 peaks were annotated with 133 distinct genes, according to the gene symbol (Supplementary Tab. 3), of which 12 (8.7%) were also differentially expressed (Supplementary Tab. 3). Sequence analysis of DARs revealed an enrichment of T-BOX binding motifs (Fig. 5D), suggesting that TBX1 or other T-BOX proteins might occupy some of these sites. The two T-BOX motifs shown in Fig. 5D are almost identical, and one or the other was found in 47 DARs (34% of all DARs) (Supplementary Tab. 3, T-BOX column). Interestingly, 43 out of 47 (91%) were more accessible in mutant cells, suggesting that in these cells, TBX1 may work to maintain the chromatin close at selected loci, consistent with results obtained in the time course experiment with P19Cl6 cells.
Transcriptional profiling
Transcriptional changes in response to Tbx1 knockdown have been reported for P19Cl6 cells (Fulcoli, 2016). Here, we performed RNA-seq analysis on WT and Tbx1-/- cells that derive from the same differentiation experiments (2 biological replicates) as the ATAC-seq experiments described above. Results revealed 642 genes to be differentially expressed; 412 down-regulated, 230 up regulated in Tbx1-/- cell line compared to the parental WT cell line in two biological replicates (Supplementary Tab. 2). A list of all genes expressed in these cells is shown in Supplementary Tab. 4. Next, we carried out functional profiling/gene ontology analyses with g:Profiler2 using DEGs from mES cells and from P19Cl6 cells (Fulcoli et al., 2016) using identical criteria. Results are shown side-by-side on Tab. 1. Results were very similar in the two models for the gene ontology category “biological process” because in both cases, DEGs were enriched in developmental genes. However, we noted substantial differences in the category of “cellular component” where mESC showed strong enrichment of genes related to the extracellular matrix (ECM), while in contrast, P19Cl6 cells showed enrichment for genes related to intracellular components. KEGG pathway analysis showed again a strong enrichment of ECM-related pathways but with some limited overlap with the P19Cl6 results, as both models showed focal adhesion to be among the enriched pathways. The enrichment of the KEGG pathways “ECM-receptor interaction” and “basal cell carcinoma” categories in the mESC model is also consistent with recent findings in the mouse (Alfano et al., 2019; Caprio et al., 2020).
DISCUSSION
Data analysis of two cell culture models provided a snapshot of the chromatin accessibility, as measured by ATAC-seq, with and without TBX1 function, or dosage reduction. The use of cell culture systems has limitations because they do not mimic complex developmental processes, but they also have some advantages because they are relatively homogeneous compared to whole-organ or whole-embryo material. This is particularly true for our mESC model in which we used a specific subpopulation at a specific differentiation point. In both models, loss of TBX1 led to significant transcriptional changes, as measured by RNA-seq, indicating that both models respond robustly to loss or reduced dosage of TBX1.
In differentiating P19Cl6 cells, the intersection of ATAC signals with a map of TBX1 binding sites, which was previously published for the same cell line and under the same experimental conditions, revealed that almost all of the binding sites were located in ATAC-negative regions. Thus, TBX1 binding does not require accessible chromatin, at least by ATAC assay. Unfortunately, we were not able to confirm this finding in differentiating mES cells because in our hands, currently available batches of commercial anti-TBX1 antibodies failed to perform in ChIP experiments. In future experiments it will be of interest to establish whether TBX1 can function as a pioneer factor. In a recent paper, it was shown that TBX20, a T-BOX transcription factor that belongs to the same sub-family as TBX1, mostly binds (2/3 of the cases) in closed chromatin regions in endocardial cells (Boogerd et al., 2016). Thus, T-BOX proteins may not need ATAC+ regions to bind chromatin. It is also possible that TBX1, at early stages of differentiation, contributes to maintaining the chromatin closed at selected loci. Indeed, in mESC experiments almost all the DARs with a T-BOX binding motif showed increased accessibility in Tbx1-/- cells.
Gene ontology (GO) analysis of DEGs in response to loss of TBX1 in the two models revealed broad differences, but also some similarities. In both cases, DEGs were enriched in genes involved in developmental processes, as expected; in both cases, the focal adhesion KEGG pathway was significantly enriched. The latter has been validated recently in different cultured cells and in mouse mutants (Alfano et al., 2019). However, in general, GO enrichment was more dispersed in P19Cl6 cells compared to differentiated mES cells, where there was higher enrichment of specific GO terms, perhaps reflecting a more differentiated state and/or a more homogeneous cell population. Particularly evident was the presence of ECM-related genes in the mESC-derived cells.
We selected PDGFRA+;KDR-cells for our studies on the basis of Tbx1 gene expression; data in the literature suggest that mESC-derived PDGFRA+;KDR-cells have a marker profile that is similar to paraxial mesoderm (Craft et al., 2013; Sakurai et al., 2006; Tanaka et al., 2009), which includes head mesenchyme, a tissue that expresses high levels of Tbx1. The mesenchymal nature of the PDGFRA+;KDR-cell population is also consistent with high expression of genes encoding collagens and vimentin. Tbx1-expressing mesenchymal cells contribute to various tissues of the neck and face, including some muscle, bones, and connective tissue (Adachi et al., 2020).
Despite a significant transcriptional response to loss of TBX1, we detected a very modest chromatin response in both models. We have previously proposed that TBX1 is a priming factor that regulates deposition of H3K4me1 in H3K27Ac-negative regions, presumed to be inactive enhancers (Fulcoli et al., 2016). This hypothesis is consistent with our finding of closed chromatin at most TBX1-binding sites. However, H3K4me1 triggers a number of mechanisms that eventually lead to the opening of the chromatin (reviewed in (Calo and Wysocka, 2013)). Thus, we would have expected a more extensive chromatin remodeling than the one that we observed. However, H3K4me1 deposition may also lead to long range chromatin changes, which were not tested in our experiments (Yan et al., 2018). In any case, it is possible that the putative priming activity of TBX1 leads to chromatin changes that are downstream of the differentiation time window tested here. A recent study tested the effect of FOXA2 loss of function on chromatin remodeling during ES-based endoderm differentiation into pancreatic cells (Lee et al., 2019). FOXA2 is a pioneer transcription factor that, like TBX1, regulates H3K4me1 deposition in early phases of differentiation (to definitive endoderm), but its loss did not cause significant ATAC-seq changes at this stage. It was only at later stages of differentiation, when H3K27 acetylation occurred, that ATAC-seq changes became significant (Lee et al., 2019). Our qATAC time course results, though limited to a small number of loci, is consistent with the hypothesis that chromatin remodeling may occur at later stages of differentiation. The development of optimized protocols that will allow the monitoring of Tbx1-expressing cells throughout their differentiation will help to address this hypothesis in the future.
Author Contributions
AC, IA, GL, RF, performed wet lab experiments. AC, MF, DR, CA, performed bioinformatics data analysis, EI, CA, AB provided funds, AB wrote the manuscript, EI, AC, IA, GL, CA contributed to writing and editing.
SUPPLEMENTARY MATERIAL
Supplementary Figure 1
Analysis of surface markers PDGFRA and KDR expression in Tbx1+/+ and Tbx1-/- mESC lines at day 4 of differentiation. A, D) Tbx1+/+ and Tbx1-/- cells labeled by primary antibodies PDGFRA (APC-A) and KDR (PE-A). 4 fractions were identified: Q1-2 = PDGFRA+/KDR-; Q2-2 = PDGFRA+/KDR+; Q4-2= PDGFRA-/KDR+; Q3-2= PDGFRA-/KDR-; B-C-E-F) Negative controls (NC) are cells not incubated with primary antibodies.
Supplementary Figure 2
Plots showing FACS controls. A) Tbx1+/+ cells labeled by PDGFRA-APC antibody and isotype control-PE (left); and by isotype control-APC and KDR-PE antibodies (right); B) Tbx1-/- cells labeled by PDGFRA-APC antibody; and isotype control-PE (left), by isotype control-APC and KDR-PE antibodies (right).
Supplementary Table 1
Differentially accessible regions in P19Cl6 cells with gene annotation, intersection with differentially expressed genes, and intersection with TBX1 ChIP-seq peaks.
Supplementary Table 2
Differentially expressed genes in differentiating mESC. Columns contain the Ensembl Identifier, the corresponding Gene Symbol, the log2 FoldChange of KO/WT and the Adjusted P-value (cut off =0.05).
Supplementary Table 3
Differentially accessible regions in differentiating mES cells with gene annotation, intersection with differentially expressed genes.
Supplementary Table 4
Genes expressed in PDGFRA+;KDR-mES cells at d4. Columns contain the Ensembl Identifier, the corresponding Gene Symbol and the normalized upper quartile gene expression counts for each sample.
ACKNOWLEDGMENTS
This work was funded in part by grants from the Italian Ministry of Research PRIN 20179J2P9J (to AB, EI, GL, and CA), the Campania SATIN Project (to AB), and the Fondation Leducq grant 15CVD01 (to AB and EI). We wish to thank the FACS core of the Institute of Genetics and Biophysics for their assistance with cell sorting.
Footnotes
The T-box transcription factor TBX1 has critical roles in the cardiopharyngeal lineage and the gene is haploinsufficient in DiGeorge syndrome, a typical developmental anomaly of the pharyngeal apparatus. Despite almost two decades of research, if and how TBX1 function triggers chromatin remodeling is not known. Here, we explored genome-wide gene expression and chromatin remodeling in two independent cellular models of Tbx1 loss of function, mouse embryonic carcinoma cells P19Cl6, and mouse embryonic stem cells (mESCs). The results of our study revealed that the loss or knockdown of TBX1 caused extensive transcriptional changes, some of which were cell type-specific, some were in common between the two models. However, unexpectedly we observed only limited chromatin changes in both systems. In P19Cl6 cells, differentially accessible regions (DARs) were not enriched in T-BOX binding motifs; in contrast, in mESCs, 34% (n=47) of all DARs included a T-BOX binding motif and almost all of them gained accessibility in Tbx1-/- cells. In conclusion, despite a clear transcriptional response of our cell models to loss of TBX1 in early cell differentiation, chromatin changes were relatively modest.