Abstract
Wiring a complex brain relies on cell-type and temporal-specific expression of genes encoding cell recognition molecules regulating circuit formation1–3. Though genetic programs driving cell-type specificity have been described4–6, how precise temporal control is achieved remains unknown. Here, we describe a global program for the temporal regulation of cell-type-specific wiring genes in the Drosophila visual system. We show that the Ecdysone Receptor induces a synchronous cascade of transcription factors in neurons throughout the visual system in a highly stereotyped fashion. Single-cell RNA-seq analysis of neurons lacking transcription factors in the cascade revealed that targets are cell-type dependent and these are enriched for wiring genes. Knock-down of different transcription factors in this cascade led to defects in sequential steps in wiring. Taken together, this work identifies a synchronous, global program for temporal control of different sets of wiring genes in different neurons. We speculate that this global program coordinates development across cell types to choreograph specific steps in circuit assembly.
MAIN
Animal behavior is dependent upon the formation of neuronal circuits with high fidelity. Many cell-surface proteins, particularly of the Immunoglobulin (Ig), Leucine-Rich Repeat and Cadherin families, promote interactions between neurons during circuit assembly1, 7. Several developmental strategies have evolved to meet the demands of cellular recognition specificity during circuit construction. These include the expression of combinations of different recognition molecules in different neurons, and reusing the same molecules to determine multiple, spatially and temporally separated neuron-neuron interactions2, 3, 8. Wiring specificity, thus, relies critically on genetic programs which regulate the cell-type and temporal specificity of genes encoding recognition molecules. Many programs that define cell-type specificity have been described4–6. Here we set out to assess how temporal specificity is determined.
Dynamic expression of wiring genes
To identify genetic programs regulating the temporal control of wiring genes, we profiled the transcriptomes of post-mitotic lamina neurons (L1-L5) in the Drosophila optic lobe (analogous to bipolar cells in the mammalian retina8), every 12h from 24hAPF (hours After Pupa Formation) to adults (Fig. 1a, b). These time points encompass all aspects of circuit formation for lamina neurons once their growth cones have reached their target neuropil9, 10. For transcriptional profiling, we modified a recently developed single-cell RNA-Seq-based approach that utilizes the natural genetic variation (i.e detected by single nucleotide polymorphisms (SNPs)) amongst different Drosophila wildtype lines (DGRP11) to determine gene expression in many different samples and different time points in one experiment (Kurmangaliyev et al. revision under review, Fig. 1b, Extended data Fig. 1, Supplementary Table 1). This approach eliminates issues due to batch effects that limit transcriptome analyses combining data from independent experiments. Using this approach, we acquired transcriptomic data for 8269 cells distributed between seven timepoints (24hAPF – Adult) (Fig. 1b, Extended data Fig. 1b, Supplementary Table 2). Using the SNPs within the transcripts encoded by the different DGRP strains, we assigned each cell to a specific time point. We integrated all cells and performed unsupervised clustering independent of global changes occurring over time yielding five clusters (see Methods). We used previously identified lamina neuron-type specific genes to assign each cluster to a cell-type12 (Extended data Fig. 1c). For all subsequent analyses, we used normalized gene expression values prior to integration. The expression patterns were similar to expression data from bulk RNA-seq (this study, Supplementary Table 3) and from a large single cell sequencing study of visual system neurons from our laboratory (Kurmangaliyev et al. revision under review) (Extended data Fig. 2).
Application of k-means clustering to group genes based on their expression dynamics revealed several groups that showed dynamic patterns of expression (e.g. gene groups 5, 7, 9 and 10 in Fig. 1c; also see Extended data Fig. 3 and Supplementary Table 4). Dynamically expressed genes (especially early-peaking gene groups) were enriched (over expected by chance) for genes encoding Immunoglobulin Superfamily (IgSF) proteins, as well as GO terms related to wiring (Fig. 1c, Extended data Fig. 3, Supplementary Table 5). Genes encoding IgSF proteins expressed in a dynamic temporal fashion (see Methods), were also expressed in a cell-type-dependent manner (Extended data Fig. 4a). Thus, as previously described13, 14, wiring genes are expressed in a cell-type and temporally dynamic fashion during circuit assembly.
Synchronous expression of the EcR Cascade
To identify transcription factors (TFs) that drive gene expression in a dynamic fashion, we looked at TFs that were differentially expressed between pairs of consecutive time points for each neuron-type across development (Extended data Fig. 4b, see Methods). Interestingly, for each neuron-type, dynamically expressed TFs were enriched for the Nuclear Receptor Transcription Factor Pathway (Reactome Pathway Enrichment Analysis15, p-value < 10-5, Extended data Fig. 4c). These TFs, including the Ecdysone Receptor (EcR) and others regulated by Ecdysone16, 17, exhibit synchronous patterns of expression in lamina neurons over development (red lines in Fig. 2a, Extended data Fig. 5a, b). In addition, EcR also undergoes a temporal-specific isoform switch due to the use of an alternative transcription start site, from predominantly EcR-B1 to EcR-A in these neurons (Fig. 2a, Extended data Fig. 5), as determined by staining with isoform-specific antibodies and bulk RNA sequencing. This temporal switch in isoform expression has been reported previously for other neurons in the central nervous system18.
Immunostaining for EcR-B1, EcR-A, Hr3 and Ftz-f1 (Extended data Fig. 5a), and our analysis of the whole optic lobe transcriptome (Kurmangaliyev et al. revision under review) revealed that this transcriptional cascade has near-synchronous dynamics in all optic lobe neurons throughout development (Fig. 2a). The timing and order of expression of these TFs correlates with a mid-pupal pulse of Ecdysone16, 17, 19–21 (Fig. 2a). A role for this transcriptional cascade in various developmental contexts, including neuronal remodeling of specific neuron-types during metamorphosis has been studied in detail22, 23. The coordinate and visual-system-wide expression of these transcription factors during pupal development raised the intriguing possibility that this cascade acts as a temporal regulator of circuit formation more broadly.
Temporal control of cell-type specific wiring genes
We next sought to address whether the EcR cascade drives the temporal expression of genes in lamina neurons. Analysis of putative cis-regulatory regions24 for dynamically expressed genes in lamina neurons showed a significant enrichment for binding sites for TFs in the EcR cascade (Supplementary Table 6, see Methods), consistent with their direct role in driving these patterns of gene expression. Additionally, ATAC-Seq25 at 40hAPF, 60hAPF and 72hAPF in L1 neurons (see Methods and Extended data Fig. 6, Supplementary Table 7) identified an enrichment for predicted binding sites for TFs in this cascade (e.g. EcR, Hr3, Hr4 and Ftz-f1) in regions with dynamic patterns of accessibility24 (Extended data Fig. 6d, Supplementary Table 6). This is consistent with these TFs controlling chromatin structure, transcription of specific genes or both26.
To identify genes regulated by the EcR cascade, we targeted expression of a dominant negative allele of EcR (EcRDN, EcR-B1 W650A27), EcR RNAi and Hr3 RNAi to lamina neurons. Transcriptional profiling was performed using a single-cell RNA-Seq-based assay (Fig. 2b) to compare transcriptomes of wildtype and mutant lamina neurons at four timepoints (24hAPF, 48hAPF, 72hAPF and Adult) in a pooled fashion using the DGRP strategy (Supplementary Table 1). By comparing the transcriptomes of 3606 wildtype and 1503 EcRDN-expressing lamina neurons, we identified 921 differentially expressed genes (fold change ≥ 2, p-value ≤ 0.05), out of a total of 3200 genes expressed in lamina neurons across development. These were distributed between the five lamina neuron-types and four time-points (Extended data Fig. 7, Supplementary Table. 8). Importantly, EcRDN-expressing lamina neurons continued to express their cell-type specific TFs, suggesting no changes in cell-fate (Extended date Fig. 7b). In contrast to L1 and L3-L5, EcRDN had modest effects on gene expression in L2. This is consistent with lower EcRDN expression in L2 neurons at 48hAPF using this GAL4 driver line (Extended data Fig. 7e). Analysis of the 921 differentially expressed transcripts led to four conclusions. First, EcR-dependent genes are enriched for those showing dynamic expression in wildtype cells (Extended data Fig. 7f, 11). Second, a large set of genes dependent upon this cascade is expressed in a cell-type-dependent fashion in wildtype lamina neurons (Fig. 2c, Extended data Fig. 8). Third. the cell-type dependent genomic targets of this cascade were highly enriched for IgSF genes (Hypergeometric test, p-value = 1.25 × 10-18, see Methods) as well as GO terms associated with wiring (Fig. 2d, e, Supplementary Table 9). And finally, it was not uncommon for EcRDN to have different effects on the same gene in different neuron types. This may reflect different levels of expression of EcRDN. In some cases, however, opposite effects in different lamina neurons were observed (Fig. 2d).
Three lines of evidence indicated that genes affected by the expression of EcRDN were largely bona fide targets of this transcriptional cascade. First, most genes affected by EcRDN were affected in a similar way by EcR RNAi, although the change was typically less pronounced (Extended data Fig. 9, 12a; Supplementary Table 10). Second, changes in gene expression in Hr3 RNAi and EcRDN were similar, especially at later time points, as expected from their genetic relationship (Fig. 2a; Extended data Fig. 10, 12b; Supplementary Table 11). And third, expression of several putative EcR-dependent genes in brains cultured ex vivo28 depended on the inclusion of Ecdysone in the medium (see Methods, Extended data Fig. 13, 16b). Thus, EcR, either directly or indirectly, controls the temporal expression of wiring genes in a cell-type-specific manner.
The cell-type-specific, temporal control of wiring genes by the EcR cascade suggests that it plays a role in circuit formation. Consistent with this notion, we observed a disorganization of lamina neuron arbors in the medulla neuropil upon pan-lamina expression of EcRDN, which was largely rescued by overexpression of wildtype EcR-B1 (Extended data Fig. 14a). A disorganized neuropil was also seen upon targeted expression of EcR RNAi. Disruption of the axon terminal morphology of T4/T5 neurons in the lobula plate neuropil, a more central visual system structure, was also seen upon targeted expression of EcRDN to these neurons (Extended data Fig. 14b). Together these data support the idea that the EcR transcriptional cascade regulates genetic programs governing the appropriate expression of cell surface and secreted proteins required for proper wiring of the visual system.
EcR regulates the L3-specific expression of Netrin
We next sought to assess whether loss of EcR function gave rise to similar phenotypes seen for loss-of-function mutations in cell surface or secreted proteins regulating wiring in L1-L5. Loss of EcR activity resulted in phenotypes similar to Netrin mutants in L329, 30 (Fig. 3a-d) and mutations in DIP-β, encoding an IgSF protein, in L431 (Extended data Fig. 15). Here we focus on the relationship between EcR and Netrin.
Netrin protein is required in L3 axon terminals in the M3 layer to promote adhesion of photoreceptor R8 growth cones to processes in the same layer. In Netrin mutants, R8 terminals extend into the developing M3 layer, but are unable to stably adhere to it and retract to the surface of the medulla30. At least one other, yet to be identified, signal is necessary for R8 to recognize the M3 layer30. From sequencing of mutants and anti-Netrin antibody staining, EcR activity is required for Netrin expression in L3 (Fig. 3a, b, Extended data Fig. 16b). Targeting expression of EcRDN in L3 leads to an R8 mis-targeting phenotype. Here most R8 terminals extend beyond the M3 layer and some remain at the surface of the medulla. The EcRDN phenotype is partially rescued by overexpression of Netrin in developing EcR-mutant L3 neurons (Fig. 3c). Differences between the Netrin mutant and EcRDN phenotypes are consistent with the notion that EcR regulates one or more signals in L3, in addition to Netrin, necessary for R8 targeting30 (Fig. 3d).
As EcR is expressed in the same way in all lamina neurons, how is the cell-type specificity of wiring genes determined? Interestingly, genes specifically expressed in L3, including Netrin, are enriched for targets of the L3-specific TF Erm identified in a previous study12 (Hypergeometric test, p-value = 2.7 × 10-18; Fig. 3e; Extended data Fig. 17). Erm is expressed selectively in L3 early in lamina development and remains expressed into the adult12, 32 (Extended data Fig. 1c). Thus, here the EcR pathway confers temporal specificity to a continuously expressed cell-type-specific TF to ensure the precise spatial and temporal expression of wiring genes.
Stepwise assembly of L5 connectivity
We next sought to assess whether the EcR cascade regulates sequential steps in wiring. For this we turned to L5. L5 axon morphology in the medulla forms in a stepwise manner33 (Fig. 4b). L5 targets to M5 and then arborizes in M1 and M2 (M1/2) (Fig. 1a, 4b). Arborization occurs in two steps: 1) initial branching in M1 (between 48hAPF and 60hAPF) and, 2) growth from M1 to M2 (between 75hAPF and 90hAPF) (Fig. 4b). EcRDN or EcR RNAi expression blocks step 1, while Hr3 and ftz-f1 RNAi each blocks step 2 (Fig. 4a, b). These findings are consistent with the notion that EcR regulates the expression of genes that promote extension of dendrites into M1 and that Hr3 and Ftz-f1 execute a subsequent developmental step, some 24 hours later (Fig. 4b). Hr3 directly regulates ftz-f1 expression17 (see Fig. 2a, Extended data Fig. 10d). Thus, Ftz-f1 may act alone or in combination with Hr3, to regulate the expression of a set of genes necessary to control targeting of dendrites to M2. As L5 makes synaptic connections with different partners in M1 and M234, EcR may control cell surface proteins regulating synaptic specificity in M1, whereas synaptic specificity in M2 may be controlled by cell recognition proteins regulated by Hr3, Ftz-f1, or both.
The phenotypes resulting from Hr4 RNAi are qualitatively different from knocking down levels of EcR, Hr3 and Ftz-f1. Loss of Hr4 results in promoting extension of L5 branches beyond M2 to M5 (Fig. 4a). Hr4 may regulate a stop signal, preventing extension beyond M2 to deeper layers. Alternatively, as Hr4 is expressed at a much earlier step in development (see Fig. 2a), it may repress a growth pathway acting in the initial extension into M1. As knock down experiments of EcR pathway components led to marked changes in the expression of genes encoding different cell surface and secreted proteins implicated in wiring (Fig. 4c, Supplementary Table 12), these or discrete combinations of them, may function at different steps or in specific spatial domains of L5 axons to establish specific patterns of synaptic connections.
Discussion
Wiring the brain requires precise cell-type and temporally restricted expression of genes encoding cell surface proteins2, 3, 12, 13, 35. Our studies uncovered a strategy that plays a widespread, yet specific role, in regulating this process. We describe a genetic program expressed in most, if not all, visual system neurons that controls the precise temporal patterns of expression of wiring genes (Fig. 4d). These have cell-type specific outputs which reflect the integration of a shared temporal program with diverse pathways controlling the development of different cell types. We show that this temporal program regulates discrete steps in wiring specificity.
The timing of expression of wiring genes results from the activation of a transcriptional cascade by the steroid hormone Ecdysone, comprising multiple transcription factors (such as EcR, Hr3 and Ftz-f1) including members of the nuclear receptor family. These produce sequential waves of gene expression16 (Fig. 2, 4d) which regulate different steps in wiring in different neurons. The control of a cell-type-specific set of wiring genes is made possible, at least in part, by a combinatorial mechanism in which many members of this transcriptional cascade work with different cell-type-specific TFs to produce different outputs36–38. Mechanisms that establish cellular transcriptional landscapes early in development have been identified39–41, including cell-type-specific TFs (e.g. Erm). These are expressed in postmitotic neurons and determine multiple cell-type-dependent features, including transcription of genes required for circuit formation4, 42, 43. By extrapolation of the dual control of Netrin by Erm and EcR in L3, we speculate that it is the coordinate activities of these two classes of TFs that regulates developmental programs specifying connectivity.
Dynamic expression of wiring genes, including cell-recognition molecules, poses an additional challenge for gene expression control. How are the transcriptomes of diverse neuron-types coordinated to ensure proper wiring? A global temporal signal, such as Ecdysone, is a possible mechanism to coordinate the development of multiple neuron types. For instance, the Ecdysone-dependent timing of Netrin expression in L3 growth cones matches the arrival of R8 growth cones in M3 (Fig. 3d; Extended data Fig. 16a). Further experiments will be necessary to determine whether Ecdysone not only regulates the timing of developmental steps, but also ensures that these are coordinated between different neuron-types in the complex choreography leading to circuit assembly.
Our studies rest on decades of genetic and biochemical experiments establishing the Ecdysone-pathway as a temporal regulator of major developmental transitions, including larval molts and metamorphosis, as well as controlling sequential divisions of postembryonic neuroblasts and neuronal re-modelling16, 23, 37, 38, 44. Here we show that this program has been co-opted for the temporal regulation of differentiation, and the coordinate development of different neuron types leading to the complex circuitry of the fly visual system. The ability of the EcR cascade to function as a flexible temporal regulatory module used in multiple developmental contexts may be similar to modules regulating spatial patterning (e.g. Hox genes and BMP/Wnt/Hh pathways45–47) which act in context-dependent ways to generate different morphological outcomes. These findings raise the intriguing possibility that similar timing mechanisms may also regulate the sequence of events regulating wiring in other systems48 including the maturational sequence in the assembly of neural circuits in the mammalian brain49.
Methods
Fly husbandry and stocks
Maintenance and rearing of fly lines, as well as staging of pupae was done as described in Tan et al.12 Fly lines used in this work are listed in Supplementary Table 1.
Multiplexed single cell transcriptomic analysis
For transcriptomic analysis of developing wildtype lamina neurons, w; UAS-H2A-GFP; 9B08-Gal4/Tm6B, tb females were crossed with males from different DGRP backgrounds (wildtype, see Supplementary Table 1 for list of DGRPs used). F1 prepupae were staged as in Kurmangaliyev et al.35 Males were only included in the analysis if no significant differences in gene expression were found with female pupae of the same genotype and developmental time point. Pupae corresponding to different developmental stages (see Fig. 1b) were all dissected, dissociated and processed at the same time. Each developmental time point was represented by ≥ 2 DGRP heterozygotes, and only one animal/ DGRP heterozygote was dissected. Tissue dissociation, FACS and preparation of single-cell libraries using 10X Genomics Chromium (v3) were carried out similar to Kurmangaliyev et al., except that H2A-GFP expression was used to enrich for lamina neurons (see Extended data Fig. 1a). All libraries were sequenced on a NextSeq500 platform (single-end 75bp).
For transcriptomic analysis involving UAS EcRDN (BDSC #6872), males carrying a single DGRP autonomous chromosome – w; (UAS EcRDN or UAS tdTom); DGRP/ Tm6b, tb were crossed with w; UAS-H2A-GFP; 9B08-Gal4/Tm6B females. For experiments involving UAS EcR RNAi (BDSC #9326) or UAS Hr3 RNAi (BDSC #27253), w; DGRP; (UAS wRNAi or UAS EcR RNAi or UAS Hr3 RNAi)/ Tm6B, tb males were crossed with UAS Dcr2; UAS H2A-GFP; 9B08 Gal4/ Tm6B, tb females (see Fig. 2b and Supplementary Table 1). UAS tdTom and UAS wRNAi were used as controls.
scRNA-seq data pre-processing
Raw fastq read files were processed using Cell Ranger (3.1.0) with default parameters. Seurat V3 was used for all preliminary analyses. FlyBase reference genome (release 6.2950) was used for alignment and annotation. Profiled single cells were identified as coming from a particular developmental time-point and genetic background based on the natural genomic variance within the different parent DGRP lines. This was done using the pipeline described in Kurmangalyev et al., with the following modifications: the count of minor allele that equals to 2 among the analyzed DGRP strains were used as genomic variants to assign the profiled single cells to different DGRP parent lines. Only single cells with: 1) number of genes between 200 and 3000, 2) number of mitochondrial transcripts < 20% of all transcripts, and 3) assignment to a unique DGRP parent line; were used for all downstream analyses.
To identify different lamina neuron subtypes, all cells from a particular experiment were integrated as described in Seurat v3 workflow and then subjected to unsupervised clustering, thus disregarding global temporal gene expression changes (similar to Kurmangaliyev et al.). Previously identified lamina neuron subtype specific genes12 were used to assign each cluster to a cell-type. Cells not assigned to a lamina neuron-type cluster were removed from subsequent analyses. Average expression for each gene for each cell-type at a particular timepoint and genetic background was calculated using normalized expression values prior to integration.
Bulk RNA-seq
Bulk RNA-Seq analysis of L1 neurons at 40hAPF, 60hAPF and 72hAPF was performed as in Tan et al.12 At least 2 replicates were generated for each timepoint. A minimum of 2000 cells were sorted for each experiment. cDNA libraries were created using the SMART-Seq2 protocol51, which were then sequenced together using the HiSeq4000 platform (paired-end 50bp).
Raw fastq reads files were mapped to FlyBase reference genome (release 6.29) using STaR (2.6.0) and only uniquely mapped reads were collected. Genes with counts per million (CPM) ≥ 4 in more than 2 samples were used for normalization with R package edgeR (3.26.8). Gene expression was quantified using RPKM units (Reads Per Kilobase of exon per Million reads mapped), calculated based on reads in the sum of exons using customized scripts. The correlation between biological replicates were calculated using Spearman correlation for top 500 genes with the highest variation across all samples.
Bulk ATAC-seq
At least 8000 L1 neurons were FACS purified at 40hAPF, 60hAPF and 72hAPF as described in Tan et al.12 Biological duplicates were generated for each timepoint. ATAC-Seq libraries were generated as per the following protocol, which was modified from the one described in Buenrostro et al.52: 1) FACS purified cells were collected in 300ul 1X PBS and spun at 800xg for 8’ at 4°C. 2) Cell pellet was then directly resuspended in 50µl of a tagmentation enzyme mix (25µl Nextera DNA library prep kit – Tagment DNA Buffer, 19µl Nuclease-free water, 5µl 1% IGEPAL CA630, 1µl Nextera DNA library prep kit - Tagment DNA Enzyme) and incubated at 37°C for 30’ with constant agitation at 400rpm. 3) DNA was purified from the tagmentation mixes using the Qiagen minElute Reaction Cleanup Kit as per the manufacturer’s protocol. 4) A 10μl qPCR reaction was setup to determine the optimum number of amplification cycles (similar to Buenrostro et al.). 5) A larger PCR reaction (40μl) was then setup and PCR products in the 200-500bp range were gel purified. 6) Steps 4 and 5 were repeated, this time with barcoded primers (see Buenrostro et al.) to allow multiplexing of all libraries onto a single lane of HiSeq4000 (paired-end 50bp).
Raw fastq reads files were mapped to FlyBase reference genome (release 6.29) using Bowtie2 (2.2.9) and uniquely mapped genes were kept. All samples were pooled together prior to peak calling. Read start positions were shifted +4 or -5bp and used for peak calling using MACS2 (2.1.1) with parameters “-q 0.01 --nomodel --shift -100 --extsize 200”. There were 26,122 peaks identified. Then bedtools multicov (2.27.1) was used to sum the total reads within each peak separately for each sample. Peaks with CPM ≥ 4 were used for counts normalization with edgeR. Correlations between biological replicates were calculated using Spearman correlation for the top 500 peak regions with the highest variation of peak levels across all samples. Distribution of the top 5000 peaks across different genomic features at 40hAPF, 60hAPF and 72hAPF was determined using the R package ChIPseeker (1.20.0). Differential peak analysis was performed using edgeR. Fold-change ≥ 2 and adjusted p-value ≤ 0.05 were chosen as the cutoffs to define peaks with differential accessibility between time points. Each ATAC peak was associated to the nearest gene based on proximity to the transcription start site using ChIPseeker. To compare change of ATAC-seq peak accessibility over time and change in expression of nearest gene, ln(fold change ATAC-seq peak read coverage) was plotted against ln(fold change expression of nearest gene) for each ATAC-Seq peak (Extended data Fig. 6). Bulk RNA-Seq data was used for this, as this transcriptome dataset had been generated for the same time points as for ATAC-Seq. This was done separately for peaks differentially accessible between 40hAPF and 60hAPF (Extended data Fig. 6b (left)) and between 60hAPF and 72hAPF (Extended data Fig. 6b (right)).
Comparison of scRNA-Seq with other datasets
For comparison of lamina neuron transcriptomes generated here with lamina neuron transcriptomes generated in Kurmangaliyev et al., 500 genes with the highest variance across all cell types and all timepoints were used to calculate Spearman correlation. Comparison of L1 transcriptome generated using scRNA-Seq and L1 transcriptome obtained from bulk RNA-Seq was done similarly.
k-means cluster analysis
Prior to clustering, all average gene expression from scRNA-seq transcriptomic analysis were normalized to maximum expression in each cell type across development. k-means clustering was performed using kmeans function in R (centers=10 and nstart=20).
Analysis of gene group traits
IgSF enrichment
A list of IgSF members was obtained from the InterPro database (IPR036179). Enrichment within each gene group over expected by chance was then calculated using hypergeometric distribution analysis. The set of genes expressed in a cell-type across development was taken as the universal set (N). Then number of genes in a group (n) and number of IgSFs expressed in a cell-type (m) were used as the sample size and number of successes in population respectively. Enrichment was calculated based on actual overlap between n and m, and the expected overlap between the two, defined by (n x m)/N. For enrichment of IgSF members within cell-type dependent targets of EcR, universal set was taken as all genes expressed in all lamina neurons over development. Sample size and number of successes in the population were taken as: the total number of IgSF genes expressed over development in all lamina neurons (n), and the total number of cell-type dependent targets of EcR (m) respectively. Enrichment was calculated as described above.
Gene Ontology enrichment analysis
Gene Ontology enrichment analysis for each gene group was performed using a R package TopGO (2.36.0) using GO terms obtained from FlyBase (GO-gene association file version 2.1). Weight algorithm and Fisher’s exact test were used to calculate p-value. Categories that have p-value ≤ 0.01 were considered as enriched.
Effect of perturbation of EcR and Hr3
For each perturbation, expression of each gene was normalized to maximum expression in each cell type across development and genetic background. The normalized gene expression of all genes was averaged separately for each genetic background to plot average behaviors for each gene group (Extended data Fig. 11).
Differential gene expression analysis
Wilcoxon rank-sum test (fold change ≥ 2, adjusted p-value ≤ 0.05) was used to identify genes differentially expressed between timepoints, cell-types or as a consequence of EcR/ Hr3 perturbations. Dynamic TFs and IgSF genes were defined as ones with differential expression between consecutive timepoints. Cell-type dependent genes were defined as ones with differential expression in at least one pairwise comparison amongst all lamina neurons. Cell-type specific genes were defined as ones that are differentially expressed in all pairwise comparison including one lamina neuron-type. Bonferroni adjustment was used for multiple testing correction.
Reactome pathway analysis
Dynamic transcription factors (separately for each cell-type) were entered into the ‘Analyze Data’ tool on reactome.org (https://reactome.org/PathwayBrowser/#TOOL=AT). Reactome pathway analysis for Drosophila melanogaster was then run with default parameters.
TF binding site analysis
i-cisTarget (https://gbiomed.kuleuven.be/apps/lcb/i-cisTarget/)24 was used (using default parameters) to identify TF binding sites enriched within putative cis-regulatory regions and ATAC-Seq peaks. dm6 version of the Drosophila genome, and version 5 of the i-cisTarget database were used for all analyses. All ATAC-Seq peaks overlapping with transcription start sites were excluded from TF binding site analyses to reduce the occurrence of promoter-enriched sequence motifs in i-cisTarget results.
All EcR-pathway TF binding motifs enriched within different sets of ATAC-Seq peaks were additionally verified using the Homer (v4.7) mergePeaks utility53. Briefly: first, the top 5000 occurrences of the motif across the genome were called using the homer scanMotifGenomeWide utility. Second, Homer suite mergePeaks was then used to evaluate if the extent of overlap between the top 5000 occurrences of the motif and the set of ATAC-Seq peaks is higher than expected by chance. p-value ≤ 0.01 was considered as significantly more overlap than expected by chance.
Immunohistochemistry and microscopy
Immunohistochemistry and microscopy were performed as described in Xu et al.54 with the following modifications: 1) For experiments involving staining of lamina neuron presynaptic sites, brains were fixed using glyoxal (3.12% glyoxal, 0.75% acetic acid and 20% ethanol, pH adjusted to 5.0) for 30’ at RT and then washed 3X with PBST. 2) All images were acquired using an LSM880 confocal microscope.
Primary antibodies used in this study were: mAb24B10 (1:20, DSHB), rabbit anti-NetB (1:500, gift from Akin Lab), rabbit anti-dsRed (1:400, Clontech 632496), chicken anti-GFP (1:1000, Abcam ab13970), rat anti-Flag (1:200, Novus Biologicals, NBP1-06712), mouse anti-EcR-B1 (1:20, DSHB AD4.4), mouse anti-EcR-A (1:10, DSHB 15G1a), rabbit anti-Hr3 (1:50, gift from Thummel Lab), mouse anti-svp (1:20, DSHB 5B11), rabbit anti-erm (1:100, gift from Wang Lab), rat anti-bab2 (1:500, gift from Laski Lab), mouse anti-V5 (1:200, BioRad MCA1360), guinea pig anti-Bsh (1:400, see Tan et al. 2015).
Secondary antibodies used in this study were: goat anti-mouse 488 (1:500, ThermoFisher A-32723), goat anti-mouse 568 (1:500, ThermoFisher A-11031), goat anti-mouse 647 (1:500, ThermoFisher A-21235), goat anti-chicken 488 (1:1000, ThermoFisher A-11039), goat anti-rabbit 568 (1:500, ThermoFisher A-11011), goat anti-guinea pig 647 (1:500, ThermoFisher A-21450), goat anti-rabbit 647 (1:200, ThermoFisher A-32733), goat anti-rat 568 (1:500, ThermoFisher A-11077).
Image analysis
All images for figures were created using ImageJ (v2.1.0). Details for quantification of phenotypes are given below.
R8 targeting
Imaris (v9.1.2) was used to measure the distance of R8 terminals from the top of the medulla (M0), as well as the distance of 6th medulla layer (M6) from the top of the medulla. The latter was estimated using mAb24B10 staining, which labels R7 neurons that terminate in M6. Values reported on the X-axis of Fig. 3c are (R8-M0)/ (M6-M0).
L5 morphological defects
Imaris was used to visualize individual neurons. Each neuron was then manually assigned to one of the stated categories.
Quantification of BRP puncta in the lamina
The distance between the proximal end of the lamina and the distal end of the lamina was measured using Imaris individually for all lamina cartridges. A line was drawn at the measured distance divided by 2. All BRP puncta more distal of this line were counted manually.
Ex vivo culture of dissected brains
We used a protocol similar to the one described in Özel et al.28 Briefly, brains were dissected from 22hAPF pupae in pre-warmed medium (ThermoFisher #2172004) with either 1:1000 dilution of 1mg/ml 20-HydroxyEcdysone (dissolved in 100% Ethanol, Sigma H5142) or equivalent volume of 100% ethanol. Brains were then incubated in ∼ 200ul of medium ± 20-HydroxyEcdysone in 96 well plates for 26h at 25°C in a humidified chamber. Thereafter, brains were fixed, stained and imaged using the aforementioned protocols for immunohistochemistry and microscopy.
In the presence of 20-HydroxyEcdysone in the media, strong expression of EcR-B1 and Hr3 is observed throughout the optic lobe, while no expression of Ftz-f1 is seen (Extended data Fig. 13a). When 20-HydroxyEcdysone is omitted from the culture medium, no expression of Hr3 is seen (as expected), however weak induction of Ftz-f1 is observed. In vivo, by 22hAPF (when brains are removed from pupae), Ecdysone levels have begun to rise and may be sufficient to activate EcR-B1 in neurons21. As Hr3 activation requires a subsequent drop in Ecdysone titers17, we expect the weak expression of Ftz-f1 may be due to the transfer of 22hAPF brains to an Ecdysone-free medium.
Other statistics
To compare the dynamics of genes affected by EcRDN (fold change ≥ 2, p-value ≤ 0.05 between WT and EcRDN at any time point) and genes unaffected by EcRDN, normalized wildtype expression for each gene was divided by the maximum expression at any time point throughout development. Variance was then calculated for each gene over time. Two-tailed Student’s t-test was then used to calculate the statistical difference between the distributions of variance for these two sets of genes. This was done separately for each cell-type.
Hypergeometric test was used to evaluate enrichment of targets of Erm within L3-specific targets of EcR. The set of genes expressed in a L3 neurons across development was taken as the universal set (N). Then number of targets of Erm from Peng et al.32 (m) and number of L3-specific targets of EcR (n) were used as the number of successes in population and sample size respectively.
For all box-plots, solid line depicts median, while the upper and lower bounds of the box depict the third and first quantile of the data spread respectively.
Kolmogorov-Smirnov (KS) test (Fig. 3c, 16c), two-tailed Student’s t-test (Extended data Fig. 7e, 7f, 11, 15a), two-tailed Fisher’s exact test (Fig. 3e, 4a) were performed using the following basic R functions (R 3.6.1) respectively: ks.test, t.test, fisher.test. Raw data for all analyses is available upon request.
Data and code availability
All raw sequencing data and codes will be provided upon request. They will be made publicly available prior to publication.
Supplementary Tables
Supplementary Table 1. Fly strains used in this study
Supplementary Table 2. Developmental transcriptome of lamina neurons. Normalized expression values are given for all expressed genes.
Supplementary Table 3. Transcriptome of L1 neurons at 40hAPF, 60hAPF and 72hAPF by bulk RNA-Seq. Numbers in the table are RPKM values.
Supplementary Table 4. Gene groups generated by k-means clustering. Relative expression values are given for all expressed genes.
Supplementary Table 5. Gene Ontology (GO) analysis of gene groups generated by k-means clustering.
Supplementary Table 6. TF binding site analysis using i-cisTarget (see Methods).
Supplementary Table 7. ATAC-Seq analysis of L1 neurons at 40hAPF, 60hAPF and 72hAPF. Measure of reads mapped to each peak are given as RPKM values. Peaks with differential accessibility between consecutive time points are also given.
Supplementary Table 8. Normalized expression in WT and EcRDN-expressing lamina neurons. Genes differentially expressed between WT and EcRDN are also given.
Supplementary Table 9. Gene Ontology (GO) analysis of common and cell-type dependent targets of EcR.
Supplementary Table 10. Normalized expression in wRNAi and EcR RNAi-expressing lamina neurons. Genes differentially expressed between wRNAi and EcR RNAi are also given.
Supplementary Table 11. Normalized expression in wRNAi and Hr3 RNAi-expressing lamina neurons. Genes differentially expressed between wRNAi and Hr3 RNAi are also given.
Supplementary Table 12. Lists of genes likely to be involved in step 1 or step 2 of L5 axonal morphogenesis (see Fig. 4c).
Author contributions
S.J., Y.L., Y.K. and S.L.Z. designed experiments. S.J., Y.L., P.M. and B.P. acquired data. S.J., Y.L. and Y.K. analyzed the data. S.J., Y.L. and S.L.Z. wrote the manuscript with input from all co-authors.
Competing interest declaration
The authors declare no competing interests.
Additional information
Supplementary Information is available for this paper.
Acknowledgements
We thank Dr. Pecot (Harvard), Dr. Yamanaka (UC Riverside), Dr. Black (UCLA), Dr. De Robertis (UCLA), Dr. Riddiford (Univ. of Washington) and Dr. Truman (Janelia Research Campus) for helpful discussions. We thank Dr. Schuldiner (Weizmann Institute of Science) and members of the Zipursky lab for feedback on the manuscript, and Dr. Diaz de la Loza for help with figure illustrations. We would like to specifically acknowledge Juyoun Yoo’s (Zipursky Lab) and Rachel Hodge’s (Jones Lab, UCLA) help with ATAC-Seq library prep and immunostaining. We also thank the BSCRC Sequencing Core (UCLA) and the TCGB core (UCLA) for help with library preparation and sequencing; the BSCRC FACS core (UCLA) and the Witte lab (UCLA) for assistance with FACS purification of lamina neurons; and the IDRE Statistics Consulting (UCLA) and Dr. Balliu (UCLA) for assistance with the statistical analysis of data. Reagents provided by Dr. Akin (UCLA), Dr. Laski (UCLA), Dr. Wang (Duke-NUS), Dr. Pecot (Harvard), Dr. Thummel (Univ. of Utah) and Dr. Bashaw (Univ. of Penn.) were critical for this work. This work was supported by NIH T32-NS048004 Neurobehavioral Genetics Training Grant (S.J.), Helen Hay Whitney Foundation (S.J.) and Whitcome Fellowship (Y.L.). S.L.Z is an investigator of the Howard Hughes Medical Institute.