Abstract
Limb development has long served as a model system for coordinated spatial patterning of progenitor cells. Here, we identify a population of naïve limb progenitors and show that they differentiate progressively to form the skeleton in a complex nonconsecutive three-dimensional pattern.
Single-cell RNA sequencing of the developing mouse forelimb revealed three progenitor states: naïve, proximal and autopodial, as well as Msx1 as a marker for the naïve progenitors. In vivo lineage tracing confirmed this role and localized the naïve progenitors to the outer margin of the limb, along the anterior-posterior axis. Sequential pulse-chase experiments showed that the progressive transition of Msx1+ naïve progenitors into proximal and autopodial progenitors coincides with their differentiation to Sox9+ chondroprogenitors, which occurs along all the forming skeletal segments.
Indeed, tracking the spatiotemporal sequence of differentiation showed that the skeleton forms progressively in a complex pattern. These findings suggest a new model for limb skeleton development.
Introduction
Limb development has long served as a central model system for studying organ formation. Over the years, extensive research has identified key components in the genetic program that controls both patterning and differentiation of the different tissues composing the limb, as well as the complex signaling involved in regulating this genetic program (Delgado and Torres, 2017; Huang, 2017; Johnson and Tabin, 1997; Nassari et al., 2017a; Niswander, 2003; Tickle, 2005; Zuniga, 2015). These studies have produced the basic concepts of how progenitor cells pattern and differentiate along the three axes to form a complex functional organ (Petit et al., 2017; Tabin and Wolpert, 2007a; Zeller et al., 2009).
The mouse forelimb starts to develop around E9.5 as a small outgrowth from the body wall. Initially, the limb bud comprises seemingly homogeneous undifferentiated mesenchymal cells covered by a layer of ectoderm. At the distal side, along the anterior-posterior border, the ectoderm thickens to form the apical ectodermal ridge (AER). As development proceeds, the limb is elongated and the hand plate is formed. Concurrently, the development of the limb skeleton is initiated, as mesenchymal cells form condensations that prefigure the future skeletal elements.
The vertebrate limb skeleton is organized in three segments: stylopod, containing humerus in the forelimb or femur in the hindlimb, zeugopod, comprising radius and ulna or tibia and fibula and autopod, comprising the wrist or ankle and digits. Surgical removal of the AER during early wing-bud development resulted in severe truncation of the distal elements. Moreover, the later the AER was removed, the more distal elements were formed. These findings led to the perception that the skeletal elements of the limb form in a proximal-to-distal order under the regulation of the AER (Saunders, 1948; Summerbell et al., 1973). Several models have attempted to explain this mode of development (Tabin and Wolpert, 2007a). The progress zone model is named after a distal domain under the AER, where limb progenitor cells are postulated to be located (Saunders, 1948; Summerbell et al., 1973). According to this model, the longer these progenitors spend in the progress zone, the more distal their progeny become. Once the cells exit this domain, their fate is determined. The result is that the first cells to leave the progress zone form the stylopod, the next to leave form the zeugopod, and the cells that exit last form the autopod (Saunders, 1948; Summerbell et al., 1973; Wolpert, 2002). An alternative model posits that progenitors of the limb segments are specified early in development, organized in three parallel stripes, and then expand progressively in a proximodistal order (Dudley et al., 2002; Sun et al., 2002). The two-signal model suggests that proximal and autopod progenitors are specified by two opposing signals deriving from the flank and AER, respectively. Later on, as the limb bud grows, a third domain of the zeugopod is formed in the middle (Mariani et al., 2008; Mercader et al., 1999a, 2000).
Recently, several works have studied mouse and chick limb development at single-cell resolution (Desanlis et al., 2020a; Feregrino et al., 2019; Kelly et al., 2020). Despite their findings, fundamental aspects of this process are still missing. For example, the identity of limb progenitors and their spatial distribution are unclear, as we lack marker genes to identify them. The temporal changes the transcriptome of these progenitors undergo during development and the sequence by which they differentiate to form the limb skeleton have yet to be uncovered, too.
In this work, we establish a single-cell atlas of the developing limb and characterize limb progenitors. We identified three progenitor populations, namely naïve, proximal and autopodial limb progenitors. We established Msx1 as a marker for the naïve progenitors and their location in the outer margin of the developing limb, along the anterior-posterior axis. We then showed that these Msx1+ naïve progenitors transit progressively and simultaneously into either proximal or autopodial progenitors. Moreover, the progressive contribution of these progenitors to the forming skeleton occurs simultaneously all along the proximal-distal axis of the limb. Finally, temporal analysis of the differentiation of Msx1 lineage cells revealed that the skeleton forms in a complex nonconsecutive three-dimensional pattern, which extends to the level of the single element.
Results
Single-cell RNA sequencing provides a comprehensive cellular and molecular atlas of the major mesenchyme-derived cells types of the developing limb
To date, the identity of limb mesenchymal progenitor cells and their differentiation paths are only partially understood. To obtain a deeper and unbiased molecular characterization of mesenchymal progenitor cells, we generated transcriptional maps of mesenchymal lineages in the developing limb between E10.5 and E14.5 by applying a massively parallel single-cell RNA-seq (MARS-seq). During this time window, mesenchymal cells undergo major patterning and differentiation to form the different tissues of the limb, including muscle connective tissue, tendons, ligaments and skeleton. To ensure representation of the different cell types and differentiation states, including rare subpopulations, we combined lineage and reporter-based single-cell analysis using Sox9 and Scx, the earliest known markers for skeletal and tendon cells, respectively (Akiyama et al., 2002a; Bi et al., 1999; Schweitzer et al., 2001).
At E10.5, we sampled a Sox9-GFP transgenic mouse line and collected both Sox9+ and negative cell populations. Because Sox9 marks multiple cell types (Akiyama‡ et al., 2005; Nagakura et al.; Soeda et al., 2010; Sugimoto et al., 2013), to follow the dynamics of different lineages and to ensure the representation of tendons we generated a compound mouse model containing Sox9-CreERT2 (Soeda et al., 2010), tdTomato (Madisen et al., 2010) and Scx-GFP (Pryce et al., 2007). Tamoxifen was administered at different developmental time points between E9.5 and E12.5 and samples were harvested 48 h later. Thus, we sampled four cell populations: tdTomato-positive (Sox9+ skeletal lineage), GFP- positive (Scx+ tendon cells), tdTomato-GFP double-positive cells, and double-negative cells (other progenitors). Then, 32,000 quality-filtered cells (see Methods) were subjected to MARS-seq to generate a limb cellular atlas. We used the MetaCell algorithm (Baran et al., 2019) to identify homogeneous and robust groups of cells, referred to as meta-cells (MCs; see Methods). To focus on mesenchymal cell lineages, MCs of non-mesenchymal origin, such as red blood cells, muscle cells, ectoderm-derived cells and Schwann cells, were excluded from the analysis (Table S1; Methods). The remaining 250 MCs were grouped into 12 molecularly distinct populations (Fig. 1B-F; Fig. S1).
To annotate clusters as chondrocytes or connective tissue fibroblasts, we performed differential gene expression analysis. We used the established markers Sox9, Col2a, Wwp2, and Col9a2 for the former and Col1a1, Col3a1, Scx, Osr1, Dcn, and Lum for the latter (Akiyama et al., 2002b; Bell et al., 1997; Bi et al., 1999; Lefebvre et al., 1997; Liu et al., 2015; Nakamura et al., 2011; Schweitzer et al., 2001; Stricker et al., 2012; Vallecillo-García et al., 2017). Results showed that subsets C1-6 represent distinct chondrocyte subpopulations and groups CT0-CT2 represent distinct connective tissue fibroblast subpopulations (Fig. 1C,D,F; Fig. S1). Cells in subsets P1 and P3 expressed neither fibroblast nor chondrocyte markers and were therefore annotated as mesenchymal progenitors (Fig. 1C-F). A large gene signature defined these progenitor types including Hist1h2ao (Table S2), which was expressed by P1-P3 cells but not by any of the differentiated cell types (Fig. 1E,F). P1 cluster was characterized by co-expression of Hmga2, Asb4, Igdcc3, Msx1 and Lhx9. Msx1 and Lhx9 are transcription factors (TFs) that are expressed in limb mesenchyme and play a key role in limb patterning (Bensoussan-Trigano et al., 2011; Lallemand et al., 2005; Tzchori et al., 2009), whereas Hmga2 is widely expressed in undifferentiated cells during embryogenesis (Ulrike Hirning-Folz, Monika Wilda, Volkhard Rippe, Jorn Bullerdiek, 1998; Xianjin Zhou, Kathleen F. Benson, 1995). Asb4 is a ubiquitin ligase and Igdcc3 is a member of the immunoglobulin superfamily, with a suggested role in early embryogenesis (Salbaum, 1998). Interestingly, these progenitors lacked a spatial signature.
P2 cells expressed a gene module that largely overlapped with that of P1; however, P2 cells lacked the expression of Msx1 and Lxh9. Instead, this cluster was characterized by high expression of Shox2, Dlk1, Zfhx3-4, Scx, and Col3a1. Shox2 is a TF that is expressed at the proximal limb bud, where it acts as a patterning gene (Sun et al., 2013). The P3 subset was characterized by high expression of Hoxd13, Msx1, Lhx9 and Aldh1a2. The known role of these genes in regulating autopod patterning suggests that this cluster represents progenitors of the autopod (Fromental-Ramain et al., 1996; Scotti et al., 2015a). Overall, we identified three progenitor populations, two of them carrying a signature of either proximal or autopodial markers, whereas the third population lacked a typical signature. Subsets CT0-CT2 were characterized by the expression of Col5a1, Col3a1, Col1a1 and Osr1, all of which are markers of connective tissue/tendon. In addition, these cells also expressed Lgals1, which was implicated in modulating cell-cell and cell-matrix interactions. Other identified markers were Kctd12, encoding for a potassium channel, and tropomyosin (Tpm1). Interestingly, CT0 cells did not express the key tendon marker Scx. Instead, this cluster was characterized by expression of the markers Msx1, Hoxd13, and Aldh1a2, similar to the P3 signature. The transcriptional similarity between CT0 and P3 suggests that some P3 autopod progenitors give rise to CT0 cells. CT1 and CT2 were both characterized by the expression of tendon markers Scx, Dcn and Lum (Liu et al., 2015; Schweitzer et al., 2001). In addition, cells of both clusters expressed the TF Zfhx4, Zfhx3 marked CT1 cells, whereas CT2 was marked by high expression of Dlk1, Igfbp5 and Sparc. In the chondrocyte compartment, subsets C1 and C2 displayed low expression levels of cartilage-specific ECM genes (cECM), such as Col2a1, Col9a1-3, Col11a1-2 and Hapln1, indicating the early differentiation stage of these clusters. In line with this, C1 displayed high expression levels of Asb4, Shox2, Hmga2 and Igdcc3, which marked cluster P2, suggesting that cluster C1 originates from P2 cells. Cluster C2 displayed high expression of the TFs Ebf1 and Sfrp2, a soluble modulator of Wnt signaling, and of the autopod marker Hoxd13, suggesting that this cluster represents early autopodial chondrocytes. Subset C3 displayed expression of Hoxd13, intermediate levels of cECM genes, and high Tgfbi expression, suggesting that it represents more mature autopodial chondrocytes. Subset C4 displayed intermediate expression levels of cECM genes in combination with several fibroblast markers, specifically high expression of Col1a1 and low expression of Scx, Col3a1 and Lgals1. These results suggest a chondro-tendinous identity of these cells (Blitz et al., 2013; Sugimoto et al., 2013). Subset C5 displayed high expression levels of cECM genes along with the autopod marker Hoxd13. Therefore, it represents the most mature state of autopodial chondrocytes. Subset C6 displayed the highest expression levels of all cECM genes, thus likely to represent the most differentiated chondrocytes.
To elucidate the temporal dynamics of identified cell populations, we annotated the enrichment of each population over time. As shown in Figure 1G, dramatic changes in cell type composition were observed during development. While P1 was the most abundant cell population at E10.5 (52%), it gradually decreased until it was completely diminished by E14.5. Proximal progenitors (P2) also constantly decreased from 31% at E10.5 to 3% by E14.5. In contrast, autopodial progenitors (P3) were rare at E10.5 (4%), increased up to 21% by E12.5, and later on started to decline reaching 6% by E14.5. Among the differentiated cell types, early proximal chondrocytes (C1) were rare at E10.5 (8%). By E11.5, C1 increased to 20% and early autopodial chondrocytes (C2) appeared (5%), concurrently with the appearance of tendon fibroblasts CT1 (4%) and CT2 (3%). At E12.5, C1 dramatically decreased (6%), while C2, CT1 and CT2 increased (16%, 7%, and 6%, respectively). Additionally, at E12.5 we saw the appearance of more mature chondrocytes C3-C6 (8%, 3%, 2%, and 6%, respectively) and CT0 (5%). At E13.5, C1 continued to decrease (2%) along with a decrease in C2 (10%), whereas more mature chondrocytes (C3-C6), CT1, CT2 and CT0 increased (chondrocytes, 10%, 5%, 6%, and 7%, respectively; CT, 13%, 12%, and 10%, respectively). By E14.5, C1 was diminished, C2 continued to decrease (7%), C3 and CT1 displayed a slight reduction (8%, 12%), while C4-C6, CT2 and CT0 further increased (to 10%, 9%, 14%, 13%, and 15%, respectively).
Overall, these data reveal the main mesenchyme-derived cells types in the developing limb, as well as valuable cell type-specific markers for studying their differentiation trajectories and dynamics. We identified three populations of progenitors, including autopodial and proximal progenitors and a third progenitor population that lacks spatial signature. Interestingly, we failed to identify a zeugopodial progenitor population. Finally, the gradual reduction in progenitor cells and increase in differentiated cells suggests a progressive differentiation process in the developing limb.
Characterization of limb progenitors
Our meta-cell analysis identified three transcriptionally distinct populations of progenitors, namely P1, proximal P2 and autopodial P3 (Fig. 2A). To gain insight into transcriptional mechanisms and molecular pathways regulating these cells, we computationally extracted annotated MCs from these three subpopulations and computed the Pearson’s correlation coefficients for each pair of genes across all cells (see Methods). Hierarchical clustering of the correlation matrix (Fig. 2B; Table S3) revealed four gene modules. Module 1 was enriched for components of signaling pathways such as TGF-β/activin and BMP (Bmp2, Gdf5, Dlx5, Dlx6, Inhba, Bambi), Wnt (Wnt5a) and Fgf (Fgf12, Sp9), as well as for retinoic acid synthesis enzymes (Aldh1a2, Rdh10). This module also contained several TFs that regulate limb patterning (Msx1, Msx2, Lhx2 and Lxh9) as well as TFs that are essential specifically for autopod patterning, such as Hoxa13, Hoxd13, Hoxd12, Tfap2a, and Tfap2b (Shen et al., 1997; Zhao et al., 2011), thus representing an autopodial genetic program. Module 2 was enriched with genes involved in matrix formation (ccdc80, Lox, Eln) and calcium binding proteins (Egfl6, Sned1, Sparc, Piezo2) as well as with Wnt signaling components Dkk2 and Fzd8. These results suggest that this module represents a progressive stage of cell differentiation.
Module 3 was enriched for several signaling pathways, such as Wnt (Rspo4) and BMP (Grem1), and a subset of homeobox genes including Hoxd4, Hoxd8, Hoxd9, Hoxa11, and Shox2. Interestingly, this module was also enriched with genes associated with the maintenance of pluripotency and stem cell function, including Igdcc3, Sall4, Lin28b, Tfapc2 and Trim71, suggesting that this module represents a stemness genetic program (Chang et al., 2012; Melton et al., 2010; Pastor et al., 2018; Patterson et al., 2011; Rybak et al., 2009; Wang et al., 2006; Worringer et al., 2014; Yu et al., 2007; Zhang et al., 2006, 2016; Zhao et al., 2011). Module 4 was enriched for signaling pathways such as Igf (Igf2, Igfbp5, Igfbp3, Igf1) and Wnt (Ror1, Dact1). Additionally, it contained proximally expressed genes such as Meis1, Meis2, Pkdcc, Meox1, Pitx2, Emx2, and Irx3, thus representing a proximal gene program (Campbell et al., 2012; Delgado et al., 2020; Li et al., 2014; Mercader et al., 1999b, 2009; Pellegrini et al., 2001; Probst et al., 2011; Reijntjes et al., 2007; Vickerman et al., 2011). Interestingly, module 4 also contained genes associated with tendon and connective tissue formation, such as Scx, Tcf15, Osr2 and Cxcl12 (Nassari et al., 2017b; Wilson-Rawls et al., 2004).
We next examined the temporal activity of these four modules (Fig. 2C). Results showed that 8% of the autopodial module 1 gene expression came from E10.5 cells, 12% from E11.5 cells and 25-27% from E12.5-E14.5 cells. These results suggest that autopodial gene program is detectable already at E10.5 cells and becomes more prominent at E12.5-E14.5. In module 2, 8% of the gene expression was associated to E10.5 cells, with constantly increasing contribution at later developmental stages (15% at E11.5, 19% at E12.5, 24% at E13.5, and 34% at E14.5), suggesting that this module represents a late genetic program. In the stemness module 3, 45% of gene expression was associated to E10.5 cells, followed by a decline in contribution at later stages (28% at E11.5, 12% at E12.5, 8% at E13.5, and 6% at E14.5). These results indicate that cells turn off the expression of stemness genes as development proceeds. In the proximal module 4, gene expression was associated mostly to E10.5 and E11.5 cells (27% and 26%, respectively), whereas E12.5-E14.5 cells contributed 16%, 14%, and 16%, respectively. These results suggest that proximal genes are expressed by cells throughout this period.
Next, we utilized our gene module information to classify the three identified progenitor populations. To determine how these functional gene modules are distributed across cell types, we calculated scores of each module for each MC. Examination of proximal (4) vs autopodial (1) module scores showed that MCs of proximal P2 and autopodial P3 were completely separated (Fig. 2D). P2 was enriched with proximal module genes, whereas P3 was enriched with autopodial module genes, further confirming our annotation. Interestingly, P1 MCs had low levels of both proximal and autopodial scores, with some MCs overlapping with proximal or autopodial MCs. The location of P1 MCs between P2 and P3 with some overlap suggests that P1 MCs transit into proximal and autopodial progenitor states.
P1 MCs had higher levels of expression of the stemness module (3) scores, as compared to P2 and P3, which expressed intermediate and high expression of the late module (1) scores, respectively (Fig. 2E), further supporting the naïve state of P1. Finally, examination of proximal and stemness scores showed a clear separation between the three progenitor groups. P1 was characterized by high-to-intermediate stemness score combined with low proximal scores, whereas P2 was characterized by intermediate stemness scores combined with high proximal scores. P3 displayed the lowest levels of both proximal and stemness scores (Fig. S2A,B).
Overall, these results classify limb progenitors as proximal, autopodial and naïve, and uncover TFs and signaling circuits that regulate these populations. Temporally, we show that proximal and autopodial gene modules are active throughout the developmental process. Additionally, a transition from stemness to late genetic program was observed. To reveal the spatial distribution of these progenitors in vivo, we searched for specific markers for the three progenitor populations. For that, we first analyzed the most differentially expressed genes between proximal and autopodial progenitors (Table S4). As shown in Fig. 2F,G and Table S4, Shox2 and the known proximal markers Meis1 and Meis2 were upregulated in proximal progenitors, while Tfap2b, Hoxd13, Hoxa13 and Msx1 were among the most differentially expressed genes in the autopodial progenitors. We selected Shox2 and Hoxd13 as markers for our in situ experiments due to the higher expression levels of these two genes. Naïve progenitors displayed significantly lower levels of Hoxd13 (1.3 fold, p = 10.7 –log10; Table S5). Comparison of gene expression between naïve and proximal subpopulations showed that Msx1 was among the most differentially expressed genes by P1 cells (Fig. 2H; Table S6). Because Msx1 was also expressed by P3 cells, we examined the combination of Msx1 and Hoxd13. As seen in Fig. 2I, this combination clearly separated between P1 and P3 MCs. Thus, we defined Msx1+/Hoxd13- cells as naïve progenitors, Shox2+ cells as proximal progenitors and Msx1+/Hoxd13+ as autopodial progenitors.
To study the spatiotemporal distribution of the three progenitor populations during limb development, we conducted whole-limb triple in situ hybridization chain reaction (HCR) using Msx1, Shox2 and Hoxd13 probes. As seen in Fig. 2J, at E10.5, Msx1 was expressed in the outer margin of the limb forming an arc-like pattern along the anterior-posterior axis. The arc extended both dorsally and ventrally from the AP midline, more so dorsally (Fig. S2C-E). At the anterior-proximal side of the arc, Msx1 expression domain was the widest (Fig. 2J, Fig. S2F,G). At the dorsal side, the most posterior Msx1 expression domain overlapped with Hoxd13 expression domain, demarcating the location of the autopodial progenitors (Fig. 2J, Fig. S2H,I). Shox2 proximal progenitors were found at the core of the limb bud, encircled by the Msx1 expression domain.
At E11.5 (Fig. 2K, Fig. S2J-P), the arc-like pattern of Msx1 expression along the AP axis was maintained, as were the size asymmetries along the AP and DV axes. The overlap between Msx1 and Hoxd13 expression domains expanded dorsally and ventrally as well as anteriorly, occupying the most distal front of the limb. Shox2 expression domain extended throughout most of the proximal limb segment.
At E12.5 (Fig. 2L), Msx1 arc-like expression domain was still visible. Msx1 and Hoxd13 expression domains occupied the interdigital space and most of the outer margin of the autopod, with the exception of the anterior region of the developing thumb and a small posterior region, which were positive only for Msx1 (Fig. S2Q-R). Shox2 expression domain occupied most of the proximal limb segment. Areas of overlap between Shox2 and Msx1 and between Msx1 and Hoxd13 were observed for several days, supporting the transition of P1 cells into either P2 or P3 cells, as suggested by our analysis.
Together, these data suggest that P1 represents naïve multipotent progenitor cells, which are marked by Msx1 and are located in the outer margin of the forming limb. These cells differentiate into P2 proximal progenitors, marked by Shox2, and P3 autopodial progenitors, which are co-marked by Msx1 and Hoxd13. Another important finding is the co-existence of P2 and P3 cells at E10.5-E12.5, which suggests that during this time window, P1 cells transit progressively and simultaneously into P2 and P3 cells.
Msx1 marks the naïve progenitors of the limb
A central hypothesis raised by our single-cell data is that the transcription factor Msx1 marks the most naïve limb mesenchymal progenitors. If indeed this is the case, this TF should be expressed at the onset of limb development and its lineage should give rise to all mesenchyme-derived tissues, including cartilage, tendon and muscle connective tissue. To test this prediction, we first studied the expression of Msx1 at E9.5, the onset of limb development. As seen in Figure 3A and in agreement with previous studies (Coudert et al., 2005), Msx1 expression was observed in the cells of the forming forelimb. To examine directly the contribution of the Msx1 lineage to the different mesenchymal limb tissues, we utilized the previously described Msx1-CreERT2 knock-in mouse line (Lallemand et al., 2013) crossed with Rosa26-tdTomato (Madisen et al., 2010) and Scx-GFP (Pryce et al., 2007) mice. As seen in Figure 3B-F, a single dose of tamoxifen at E9.5 marked cells of the entire skeleton, tendons and muscle connective tissue in the E14.5 forelimb. These results confirm our single-cell data showing that Msx1 marks the naïve multipotent mesenchymal progenitors. Moreover, they confirm the high efficiency of the Msx1-CreERT2 knock-in allele in activating the Rosa26-tdTomato reporter.
Our single-cell data suggested that during developmental, Msx1 naïve progenitors progressively transition into proximal and autopodial progenitors. To demonstrate this process in vivo, we combined sequential pulse-chase genetic lineage tracing, using the Msx1-CreERT2; Rosa26-tdTomato mice, with whole-mount in situ HCR for Msx1. By applying sequential short (30 h) chase periods, we could follow the temporal dynamics of Msx1 cell differentiation, whereas in situ HCR provided the position of the Msx1 naïve progenitors at the end of the chase. Differences in position between Msx1+ naïve progenitors and their descendants would provide spatiotemporal information on the progression of this process. To demonstrate the transition of Msx1+ naïve progenitors to proximal progenitors we performed HCR for Shox2, which marks the latter cells.
At E10.5, while restricted Msx1 expression was observed at the outer margin of the limb along the anterior-posterior axis, tdTomato signal was detected throughout the limb, including in Msx1+ cells. Shox2 expression was observed in tdTomato+Msx1- cells, which were surrounded by tdTomato+Msx1+ cells (Figure 4A). These results indicate that at that stage, some of the pulsed cells maintained their naïve state, mostly at the margin of the limb, whereas the center was occupied by Msx1 lineage cells that had lost their naïve state and adopted a new fate, some as proximal progenitors. At E11-E11.5, Msx1 expression in the outer margin of the limb was maintained, whereas tdTomato-expressing cells were found in the anterior proximal domain but not in the proximal posterior domain. Shox2 expression was observed in the center of the limb, overlapping the tdTomato signal except at the posterior side. These results demonstrate the transition of Msx1 naïve progenitors into proximal progenitors between E10.5 and E11.5 (Fig.4B-C). At E12.5, Msx1 expression was observed in the autopod margin and interdigital space, demarcating metacarpals 2-5. tdTomato-expressing cells were found in the autopod margin and a few in the anterior proximal domain. Shox2 expression was observed in the proximal limb, stylopod and zeugopod, where the anterior and posterior sides adjacent to the autopod overlapped partially with the tdTomato signal (Fig. 3D), demonstrating that the transition from Msx1 naïve progenitors into proximal progenitors was still taking place between E11.5 and E12.5. Expression of Msx1 also by autopodial P3 progenitors raised the possibility that proximal P2 cells are derived from P3 as well. However, P3 cells also expressed Hoxa13, whose lineage was previous shown to contribute only to the autopod (Scotti et al., 2015b), negating this possibility.
Finally, we examined the level of cell fate stability of the Msx1+ naïve progenitors during development. For that, we compared between Msx1+ naïve progenitors from different time points for differentially expressed genes. As seen in Fig. S3A-F, Igdcc3, Asb4, and Hmga2 were found to be highly expressed by E10.5 and E11.5 P1 cells as compared to later stages, whereas E12.5 P1 cells displayed higher expression of the autopodial marker Hoxd13. Comparison between E12.5 and E13.5 P1 cells did not reveal differentially expressed genes. Overall, this analysis indicates that the naïve progenitors undergo a mild transcriptional change over time, as they largely maintain their transcriptional program.
Together, these results confirm and expand our single-cell data and show that Msx1+ progenitors are the most naïve multipotent mesenchymal progenitors, which give rise to all mesenchyme-derived tissues of the limb. We demonstrated the progressive transition of Msx1+ naïve progenitors to proximal progenitors, which takes place for several days, concomitantly with autopod development. Moreover, we showed that the addition of new proximal progenitors took place mainly on the anterior side and, to a lesser extent, on the posterior side.
Differentiation of Msx1 lineage cells to Sox9+ chondroprogenitors occurs progressively and simultaneously along the different skeletal segments
Having found progressive differentiation of Msx1+ naïve progenitors, we proceeded to study the dynamics of the differentiation of this lineage into chondroprogenitors by comparing the expression of Msx1 to that of Sox9, the earliest known chondro-osteogenic marker (Akiyama‡ et al., 2005; Bi et al., 1999; Lefebvre et al., 1997). For that, we established a chondrogenic gene module anchored to Col2a1, a bona fide chondrogenic marker (Benoit de CrombruggheU, Veronique Lefebvre and Weimin Bi, Shunichi Murakami, 2000), and used it to compute a chondrogenic score. Cells from each day were ordered by chondrogenic score and binned into 65 bins. The mean expression of Sox9 and Msx1 was calculated for each bin, and trend line and confidence interval were calculated (see Methods). This analysis revealed that at all sampling time points, cells with low chondrogenic score expressed high levels of Msx1 and low levels of Sox9. As cells progressed through differentiation, Sox9 expression was upregulated as expected, while Msx1 expression was downregulated (Fig. 5A,B). To validate this, we performed in situ HCR for Sox9 and Msx1 on E10.5-E12.5 forelimbs. As seen in Figure S4A-C and in agreement with the single-cell results, the expression domains of Msx1 and Sox9 were mutually exclusive, with slight overlap at the borders, which likely represents the transitional stage.
Our data analysis indicated that the transition from Msx1 to Sox9 expressing cells takes place in each of the examined days, suggesting that the differentiation of Msx1 naïve progenitors into chondroprogenitors is a progressive process. To demonstrate in vivo the spatiotemporal dynamics of this process, we combined sequential short (30 h) pulse-chase experiments using the Msx1-CreERT2; Rosa26-tdTomato mice with whole-mount in situ HCR for Msx1 and Sox9. At E10.5, Msx1 lineage cells populated the entire limb. Sox9 expression was observed in the center of the limb, within a tdTomato+Msx1- domain, which was surrounded by tdTomato+Msx1+ cells at the margin (Fig. 4C). Examination at E11.5 showed that Msx1 lineage cells populated most of the limb, excluding the most proximal posterior domain. Sox9 expression domain was observed in the center of the limb. Interestingly, the Sox9 expression domain overlapped with tdTomato+Msx1- cells along the entire forming skeleton, mostly on the anterior side (Fig. 5D,D’). tdTomato+Msx1+ cells were located at the autopod margin, surrounding the distal Sox9 expression domain. At E12.5, Msx1 lineage cells were observed at the autopod margin, with a small proximal extension on the anterior side. Sox9 expression demarcated the humerus, radius, ulna, and five metacarpals. Sox9 expression overlapped with tdTomato+Msx1- cells at the distal anterior radius, metacarpals 1 and 5 and tips of metacarpals 2-4. Areas of overlap between Sox9 expression and tdTomato+Msx1+ cells were detected in the lateral side of metacarpals 2-4 and at all metacarpal tips (Fig. 5E,E’).
Overall, these results support the temporally progressive differentiation of Msx1 lineage cells not only to proximal and autopodial progenitors, but also into Sox9+ chondroprogenitors. Moreover, this process is not restricted spatially, but rather occurs simultaneously along all segments of the developing skeleton.
The skeleton forms progressively and nonconsecutively in a complex three-dimensional pattern
Our observation that the progressive differentiation of Msx1+ naïve progenitors to chondroprogenitors occurs simultaneously in all segments of the developing limb skeleton prompted us to reexamine the order by which these segments form. The finding that Msx1 expression is lost once the naïve progenitors differentiate provided us with a unique opportunity to address this question. The rationale behind our approach was that the first element to form would be composed of descendants of progenitors that lost Msx1 expression first, whereas the last element to form would be composed of descendants of progenitors that were last to lose Msx1 expression. To follow temporally the loss of Msx1 expression by naïve progenitors, we performed consecutive pulse-chase lineage tracing experiments by administering single doses of tamoxifen to Msx1-CreERT2; Rosa26-tdTomato mice at E9.5, E10.5, E11.5 or E12.5. To determine the exact 3D spatial distribution of tdTomato-positive cells in the forming skeleton, we cleared limbs of E13.5 embryos and imaged them using light sheet microscopy.
As seen in Figure 6A, tamoxifen administration at E9.5 resulted in tdTomato labeling of the entire skeleton. However, pulsing at E10.5 (Fig. 6B) resulted in loss of tdTomato signal in most of the scapula, ventral humerus (Fig. S5A) and radius. tdTomato signal was observed in the acromion, humeral head and deltoid tuberosity, dorsal humerus, ulna, and in the entire autopod (Fig. 6B, Fig. S5B). Pulsing at E11.5 (Fig. 6C) resulted in loss of tdTomato signal in acromion and humeral shaft and metacarpals of digits 3 and 4 (Fig. S5C). Still, tdTomato signal was detected at the humeral head and deltoid tuberosity, radius and most of the digits. Finally, following pulsing at E12.5 (Fig. 6D), tdTomato signal was lost in the radius and metacarpals, but remained at the tips of the growing digits (Fig. S5D). These results indicated that the skeleton forms progressively in a complex pattern and not linearly along the proximal-distal axis.
To better understand this process, we examined in greater detail the tdTomato signal in the humerus and radius. As seen in Figures 6E,M’’’, in the humerus, pulsing at E10.5 led to the loss of tdTomato signal at the ventral side from head to medial epicondyle, whereas the dorsal side was tdTomato-positive from head to lateral epicondyle. Following pulsing at E11.5, the areas of the humerus that lost tdTomato signal were in the dorsal shaft and lateral epicondyle, whereas the deltoid tuberosity and the dorsal side of humeral head were still tdTomato-positive (Fig. 6J-N’’’).
In the radius, tamoxifen administration at E10.5 resulted in tdTomato labeling throughout the bone (Fig. 6O-S’’’). However, pulsing at E11.5 resulted in loss of tdTomato signal in almost the entire ventral side, with few labeled cells on its distal tip (Fig. 5T). On the dorsal side, the proximal posterior side of the radius was mostly tdTomato-negative, while the anterior side still displayed extensive labeling (Fig. 6U-X’’’). These results indicate that the ventral side of the radius forms first, followed by a diagonal AP direction of dorsal radius formation. These results demonstrate that the skeleton form in a complex pattern that extends to the level of the single element.
To validate the results of the Msx1 lineage experiments, we examined the spatiotemporal dynamics of chondroprogenitor differentiation in the limb by following the induction of Sox9 expression. For that, we used the Sox9-CreERT2 mice, which were previously shown to efficiently drive the expression of Rosa26-lacZ reporter in the developing skeleton (Soeda et al., 2010). Because both our single-cell analysis and lineage studies showed that at E10.5, only part of the Sox9 chondroprogenitors were differentiated (Fig. 6B), we decided to activate Cre activity at this time point. Thus, we administered a single dose of tamoxifen at E10.5 and harvested the limbs 30 h and 72 h afterwards. E11.5 whole-limbs were stained for Sox9 mRNA using in situ HCR, whereas E13.5 whole limbs were stained for SOX9 protein and imaged using light sheet microscopy. As shown in Figure 7A-E’’’, tdTomato signal was observed in most of the scapula, and only on the ventral-posterior side of the humerus and ulna, whereas the radius and autopod were tdTomato-negative. At 72 h post-induction (Fig.67F-J’’), tdTomato labeling was seen in most of the scapula, the entire humeral shaft and ulna; however, the acromion, humeral head, deltoid tuberosity, lateral and medial epicondyles, elbow and most of the radius and digits were tdTomato-negative. These results further support the data obtained using the Msx1-CreERT2 line, demonstrating that the skeleton forms nonconsecutively in a complex pattern that involves not only the PD axis, but also the DV and AP axes, extending to the level of the single element.
DISCUSSION
In this work, we revisit the long-standing question of the spatiotemporal sequence of limb development using modern molecular tools. We generated a comprehensive cellular atlas of the limb mesenchymal cell lineages during development. Using this atlas, we identified a population of naïve progenitors and their progressive and simultaneous transition into proximal and autopodial progenitors. We establish Msx1 as a marker of naïve progenitors and localize them to the outer margin of the developing limb, along the anterior-posterior axis. We then showed that the descendants of these progenitors progressively contribute to the entire forming skeleton. Finally, temporal analysis of the differentiation of naïve progenitors revealed that the skeleton forms progressively in a complex 3D pattern, which extends to the single element level (Fig. 8).
Our systematic single-cell analysis of limb mesenchymal cells revealed a pool of naïve progenitors. These cells transit initially to proximal progenitors and, shortly afterwards, also to autopodial progenitors. The finding that the pool of Msx1 naïve progenitors is maintained for several days suggests that this transition occurs progressively. These finding correspond with some aspects of previously suggested models of limb development. The existence of naïve limb progenitors and their progressive transition into progenitors of the different limb segments was suggested by the progress zone model (Saunders, 1948; Summerbell et al., 1973; Wolpert, 2002). The coexistence of the different progenitors of these segments was suggested by the early specification model, whereas the progressive and concurrent specification of proximal and distal fates is consistent with the two-signal model (Mariani et al., 2008; Mercader et al., 1999a, 2000). In this respect, our findings integrate the three models.
Because the limb skeleton comprises three segments, it was expected to originate from three different pools of progenitors. However, in line with previous studies that failed to identify zeugopod-specific markers (Tabin and Wolpert, 2007b), we also were unable to subdivide the proximal gene program into stylopod- and zeugopod-specific. This finding raises the question of the mechanism underlying the formation of the stylopod and zeugopod as two separate segments.
Recently, several works studying limb development at single-cell resolution have been published (Desanlis et al., 2020a; Feregrino et al., 2019; Kelly et al., 2020). Our work is unique in that we sampled cell from the entire limb daily from E10.5 up to E14.5, providing a complete and continuous representation of cell states and transcriptional changes that take place in the limb during this critical stage of limb patterning and differentiation.
Our single-cell data revealed a set of markers for the three progenitor populations we identified. While Msx1, Lhx9, and Lhx2 marked the naïve progenitors, Msx1 lineage cells gave rise to all mesenchyme-derived limb tissues. These TFs are involved in FGF, BMP and Shh signaling, major pathways that regulate patterning along the PD, AP and DV axes (ALAPPAT et al., 2003; Bensoussan-Trigano et al., 2011; Lallemand et al., 2005, 2009; Tzchori et al., 2009; Watson et al., 2018; Yang and Wilson, 2015). Interestingly, we found that the naïve progenitors largely maintain their transcriptional program during limb development. For the proximal progenitors, we identified a set of markers that includes Meis1 and Meis2, two well-known proximal markers (Capdevila et al., 1999; Delgado et al., 2020; Mercader et al., 1999b, 2009), Shox2, Pkdcc and many other genes. The validity of Shox2 as a marker for proximal cells is supported by lineage studies showing that Shox2 lineage gives rise to the proximal part of the limb, ending at the wrist (Sun et al., 2013). For the autopod progenitors, we identified a set of markers that included Hoxd13, Hoxa13 and Hoxd12. These genes are expressed specifically in the autopod and play an essential role in digit identity and patterning (Desanlis et al., 2020b; Fabre et al., 2018; Fromental-Ramain et al., 1996; Knezevic et al., 1997; Scotti et al., 2015a; Sheth et al., 2007; Zákány et al., 1997). These markers were co-expressed with Msx1, Msx2, Lhx9, and Lhx2, which we show to be important for autopod patterning, suggesting a functional link between these two groups of genes.
The identification of markers for the three progenitor populations allowed us to study their spatiotemporal distribution during limb development. As suggested by the single-cell analysis results, we found that the naïve progenitors marker Msx1 was expressed throughout the process in an arc-like pattern along the anterior-posterior axis, as well as dorsally and ventrally away from the AER. This finding suggests that the naïve progenitors maintain their location through development. Moreover, it suggests that their progressive transition to the different lineages may not be restricted to the limb apex, but can occur along the anterior-posterior axis. Indeed, sequential pulse-chase experiments clearly showed that between E9.5 and E11.5, Msx1 lineage cells populated extensive areas in the proximal side of the limb. Moreover, they overlapped with both proximal and autopodial progenitors. Proximal progenitors were initially located in the center and later expanded proximally, whereas autopodial progenitors were initially located distally in the posterior-dorsal side and later expanded anteriorly. We therefore suggest that the progressive transition of naïve progenitors into proximal and autopodial fates occurs along the length of the limb, allowing simultaneous transition into these identities.
The establishment of limb progenitor identities and their differentiation to chondroprogenitors can follow two scenarios. One possibility is that the two processes are separated temporally, such that differentiation starts only after all progenitor identities have been established. Alternatively, a progressive process of identity establishment may coincide with the differentiation of the two progenitor pools into chondroprogenitors. Our findings indicate that the differentiation of naïve Msx1+ progenitors into chondroprogenitors is progressive and that the transition of these naïve progenitors into proximal and autopodial progenitors coincides with the differentiation of these two progenitor pools into chondroprogenitors, suggesting that these processes overlap temporally. Strong support for this possibility is our observation that at E10.5-E11.5, both Msx1, which marks the naïve progenitors, and Sox9, which marks chondroprogenitors, were expressed in the developing limb. Other pieces of evidence that are consistent with this scenario came from genetic lineage tracing analyses in mice. We previously showed that different skeletal elements form progressively by continuous addition of Sox9+ cells (Blitz et al., 2013; Eyal et al., 2019)
In this work, we provide several pieces of evidence to support the conclusion that Msx1 is a marker for the naïve mesenchymal progenitors. These include our single-cell analysis and lineage tracing using Msx1-CreERT2 mice. This knock-in allele was previously shown to drive an identical pattern of Cre expression as the endogenous Msx1 gene (Lallemand et al., 2013). The combination of a reliable marker, mouse line and the finding that Msx1 expression by naïve mesenchymal progenitors is lost once they differentiate provided us with a unique opportunity to study the order by which the skeleton forms. If the common view is correct and the skeleton forms in a proximal-to-distal direction, then progenitors of the proximal stylopod should be first to lose Msx1 expression and differentiate, followed by zeugopod progenitors and, lastly, by autopod progenitors. However, the pattern that we observed was much more complex and nonconsecutive, as skeletogenesis occurred simultaneously progressing from multiple foci along the limb. This finding indicates that in addition to the formation of skeletal elements along the PD axis, there is also strong contribution along the AP and DV axes. An example for the complexity of the process is our finding that the posterior half of the humerus formed first together with ulna, whereas the anterior side of the humerus formed later together with the radius. Further support for this notion is the similar results we obtained studying the order by which the skeleton forms using the spatiotemporal elevation of Sox9 expression in chondroprogenitors.
In summary, our findings suggest a new model for limb and skeleton development. At its core is the principle that limb development involves progressive and simultaneous transition of naïve limb progenitors into either proximal or autopodial progenitors, which then progressively differentiate into Sox9+ chondroprogenitors. This process occurs simultaneously at different locations along the limb, suggesting that the skeleton forms progressively from multiple foci in a complex 3D pattern.
METHODS
Animals
All experiments involving mice were approved by the Institutional Animal Care and Use Committee (IACUC) of the Weizmann Institute. The generation of Sox9-CreERT2 (Soeda et al., 2010), Scx-GFP (Pryce et al., 2007), Msx1-CreERT2 (Lallemand et al., 2013), Sox9-GFP (Chan et al., 2011) and Rosa26-tdTomato (Madisen et al., 2010) mice has been described previously. For fluorescence-activated cell sorting (FACS) experiments, Sox9-CreERT2-tdTomato;Scx-GFP or Sox9-GFP mice were crossed with Rosa26-tdTomato;Scx-GFP or C57BL/6 mice, respectively. For lineage tracing experiments, Msx1-CreERT2 were crossed with Rosa26-tdTomato or with Rosa26-tdTomato;Scx-GFP reporter mice. Plug date was defined as E0.5.
Induction of Cre recombinase was performed at indicated pregnancy stages by administration of 5 mg/ml tamoxifen in corn oil X5 body weight by oral gavage. For harvesting of embryos, timed-pregnant females were euthanized by cervical dislocation.
Cell isolation and flow cytometry
Single-cell experiments were performed on forelimbs from E10.5, E11.5, E12.5, E13.5 and E14.5 mouse embryos. For collection of E10.5 cells, Sox9-GFP and Sox9-CreERT2-tdTomato;ScxGFP (without tamoxifen induction) mice were used. For collection of E11.5-E14.5 cells, Sox9-CreERT2-tdTomato;ScxGFP mice were used 48 h after Cre induction. Forelimbs were dissected and minced in cold PBS using small scissors. For each biological replicate, forelimbs of embryos from the same litter were pooled together (six forelimbs at E10.5 and E11.5, four forelimbs at E12.5 and E13.5, and two forelimbs from one E14.5 embryo). Forelimb tissues were disassociated using enzymatic digestion. E10.5-E11.5 forelimbs were digested with pre-heated 0.25% trypsin in DMEM medium (ThermoFisher) and incubated for 10 min at 37°C, gently pipetting every 3 min. E12.5-E14.5 forelimbs were digested with 1.5 mg/ml collagenase type V (Sigma-Aldrich) in DMEM at 37°C for 10-15 min, gently pipetting every 5 min until the tissue completely dissolved. The digestion reaction was stopped by addition of DMEM supplemented with 10% FBS and 1% Pen-Strep. Cell suspensions were filtered through a 40-µm nylon mesh and collected by centrifugation at 1000 rpm for 7 min at 4°C. Supernatant was removed and cells were resuspended in 500 µl ice-cold MACS buffer (with 0.5% BSA and 2 mM EDTA in PBS) and used immediately for FACS.
Flow cytometry analysis and sorting were performed using an AriaFusion instrument (BD Biosciences, San Jose, CA) equipped with 488, 407, 561 and 633 nm lasers, using a 100-µm nozzle. Sorting gates and fluorescence compensation were defined based on GFP, tdTomato single-stained and unstained control cells. Live cells were gated using DAPI staining (1 μg/ml) and by size and granularity using FSC-A versus SSC-A. FSC-W versus FSC-A was used to further distinguish single cells. Unstained, GFP-stained only and tdTomato-stained only cells were mixed in various combinations to verify that the analysis excluded false-positive doublets. GFP was detected by excitation at 488 nm and collection of emission using 502 longpass (LP) and 530/30 bandpass (BP) filters. tdTomato was detected by excitation at 561 nm and collection of emission using a 582/15 BP filter. DAPI was detected by excitation at 407 nm and collection of emission using a 450/40 BP filter. Data were collected and analyzed using BD FACSDiva software v8.0.1 (BD Biosciences). For single-cell RNA-seq (Jaitin et al., 2014), live cells were sorted into 384-well cell capture plates containing 2 μL of lysis solution and barcoded poly(T) reverse- transcription primers. In each plate, four empty wells were used as a control. Immediately after sorting, each plate was spun down to ensure cell immersion into the lysis solution, snap frozen on dry ice and stored at −80°C until processed.
Massively parallel single-cell RNA sequencing (MARS-Seq)
FACS-sorted cells were used for single-cell library preparation according to MARS-seq protocol, as described in (Jaitin et al., 2014). Briefly, mRNA from cells sorted into capture plates was barcoded, converted into cDNA and pooled using an automated pipeline. The pooled sample was then linearly amplified by T7 in vitro transcription and the resulting RNA was fragmented and converted into sequencing-ready library by tagging the samples with pool barcodes and Illumina sequences during ligation, reverse transcription and PCR. Each pool of cells was tested for library quality and concentration was assessed as described in (Jaitin et al., 2014).
Low-level processing and filtering
RNA-seq libraries were sequenced by Illumina NextSeq500 at a median sequencing depth of 58,585 reads per single cell. Sequences were mapped to mouse reference genome (mm10), demultiplexed, and filtered as previously described (Jaitin et al., 2014), with the following adaptations. Mapping of reads was done using HISAT version 0.1.6 (Kim et al., 2015) and reads with multiple mapping positions were excluded. Reads were associated with genes if they were mapped to an exon defined by a reference set obtained from the UCSC genome browser extended by up to 2 kb for complete 3’ peak acquire. Noise level was estimated statistically on empty MARS-seq wells; median estimated noise over all experiments was 2%. Cells with less than 600 UMIs were discarded from the analysis. After filtering, cells contained a median of 2,800 unique molecules per cell. All downstream analysis was performed in R.
Metacells modeling
We used the metacells pipeline (Baran et al., 2019) with the following specific parameters (complete script reproducing all analyses from raw data is available in GEO: GSE185940). We removed mitochondrial genes, genes linked with poorly supported transcriptional models (annotated with the prefix “RP-“) and cell cycle genes, which were identified by correlation coefficient of at least 0.1 for one of the anchor genes Mki67, Hist1h1d, Pcna, Smc4, or Mcm3. We then filtered cells with total fraction of mitochondrial gene expression exceeding 30% and cells with high (> 64) expression of hemoglobin genes (Hba-a2, Hba-a1, Hbb-b2, Hba-x, Hbb-b1). Gene features were selected for analysis using the parameter Tvm=0.2.
The gene selection strategy produced 425 marker gene features for the computation of the metacells balanced similarity graph. We used K = 150, 500 bootstrap iterations and otherwise standard parameters (500 iterations; resampling 70% of the cells in each iteration, and clustering the co-cluster matrix with minimal cluster size set to 20). We applied outlier filtering.
The resulting metacells model was annotated using the metacells confusion matrix and analysis of known marker genes. Muscle, epidermal, Schwann and immune metacells were excluded from further analysis. Next, we applied again the metacells pipeline on the remaining cells with the above-mentioned parameters.
To annotate the resulting metacells into cell types, we used the metric FPgene,mc, which signifies for each gene and metacells the fold change between the geometric mean of this gene within the metacells and the median geometric mean across all metacells, thus highlighting for each metacells genes that are highly overexpressed as compared to the background. Finally, we hierarchically clustered the FP most significantly changing gene table along with a set of known marker genes to identify the major cell populations.
Defining progenitor gene module signatures and scores
To define the gene signatures of progenitor cells, we first identified modules of co-expressed genes by Pearson’s correlation across the metacells log2 FPgene,mc expression of the 175 most variable genes. The signature genes for each progenitor state were defined by module scores. Scores were calculated for each metacell by averaging the log enrichment scores (lfp values) of the genes in the module. This approach limited the contribution of highly expressed genes to the score.
Calculation of a chondrogenic score
To define a chondrogenic gene signature, we identified a list of genes correlated with Col2a1 using linear correlation over metacells log enrichment scores. To avoid over-fitting of the modeling, TFs were excluded from the list. We used this chondrogenic signature for calculation of chondrogenic score across all cells. Chondrogenic scores were calculated for each cell by averaging the metacells log enrichment scores (lfp values) of the genes in the signature. Finally, to study gene expression during chondrogenic differentiation, cells were arranged by increasing ranges of signature scores and binned into 65 bins. To validate our approach, we tested differential expression of TFs that were not part of the signature gene set and are upregulated during chondrogenesis.
Statistical Analyses
Differential gene expression analysis was performed on log2 sum of UMIs normalized by reads per cell, divided by cell number. P-values were calculated using Wilcoxon test to compare between mean expressions of metacells (for Fig. 2) or cells (for Figs. 3,4).
PACT clearing
For sample preparation, E9.5-E14.5 embryos were harvested from Bl6, Msx1-CreER; Rosa26-tdTomato, Scx-GFP and Msx1-CreERT2; Rosa26-tdTomato timed-pregnant females and fixed in ice-cold 4% PFA in 1x PBS overnight. PFA-fixed embryos were dissected and forelimbs were cleared using PACT method (Treweek et al., 2015; Yang et al., 2014). Briefly, samples were washed in PBS, then incubated in hydrogel solution containing 4% (wt/vol) acrylamide in 1x PBS with 0.25% thermal initiator 2,2’- azobis[2-(2-imidazolin-2-yl)propane]dihydrochloride (Wako, cat. no. VA-044) at 4°C overnight. The next day, hydrogel was polymerized at 37°C for 3 hours. The samples were removed from the hydrogel, washed in PBS, and moved to 10% SDS with 0.01% sodium azide, shaking (60 rpm) at 37°C for 1-5 days, changing the SDS solution each day. Cleared samples were washed three times for 5 min with 1× PBST (PBS + 0.1% Triton X-100 + 0.01% sodium azide) at room temperature and then subjected to whole-mount in situ HCR or whole-mount SOX9 immunostaining.
Whole-mount immunostaining
To detect SOX9, samples were first incubated with proteinase K (Millipore Sigma, P9290) for 10 min at room temperature, washed and post-fixed again in 4% PFA. Then, samples were incubated with 5% goat serum, 1% BSA dissolved in PBST at 4°C overnight in order to block non-specific binding of immunoglobulin. Next, samples were incubated with primary anti-SOX9 antibodies (1:100, AB5535 Millipore Sigma) in 5% goat serum, 1% BSA dissolved in PBST shaking at 37°C for 5 days. Samples were washed four times for 2 h with 1× PBST at room temperature. Next, samples were incubated with secondary Cy5 antibodies (1:100, 715-165-150, Jackson ImmunoResearch) and 1:100 DAPI (1 mg/ml) in 5% goat serum, 1% BSA dissolved in PBST shaking at 37°C for 2 days. Samples were washed four times for 2 h with 1× PBST at room temperature and then prepared for light sheet imaging. To bring the refractive index (RI) of the sample to 1.45, it was submersed in a refractive index matching solution (RIMS) prepared by dissolving 35 g of Histodenz (Millipore Sigma, D2158) in 30 ml 0.02 M phosphate buffer, shaking gently at room temperature for 1-2 days. Finally, samples were embedded in 1% low gelling Agarose (Millipore Sigma, A9414) in PBS in a glass capillary, submerged in RIMS and stored protected from light at room temperature until imaging.
Whole-mount in situ hybridization chain reaction (HCR)
The Msx1 (NM_010835.2), Shox2 (NM_001302358.1), Hoxd13 (NM_008275.4) and Sox9 (NM_011448.4) probes and DNA HCR amplifiers, hybridization, wash and amplification buffers were purchased from Molecular Instruments. In situ HCR v3.0 was performed using the protocols detailed in www.molecularinstruments.com. Briefly, PACT-cleared samples were pre-incubated with hybridization buffer and incubated overnight at 37⁰C, 60 rpm with probe solution containing 1 µL of each probe in 250 µL of pre-heated probe hybridization buffer. The next day, probes were washed four times for 15 min at 60 rpm with pre-heated wash buffer, followed by two 5-min washes at room temperature with 5xSSCT. Next, samples were pre-amplified with 250 µL of amplification buffer for 5 min at room temperature and incubated with 250 µL of hairpin mixture (5 µL of hairpin h1 and hairpin h2 from 3 µM stock for each probe) overnight in the dark at room temperature. The following day, samples were washed with 5xSSCT two times for 5 min, two times for 30 min and once for 5 min at room temperature, gently shaking. For nuclear staining, samples were incubated with 1:100 DAPI/PBS solution (DAPI stock, 1mg/ml) overnight at 4⁰C, gently shaking. Finally, samples were washed twice with 2XSSC for 5 min at room temperature gently shaking and prepared for light sheet imaging as described above for SOX9-immunostained samples.
Light-sheet fluorescence microscopy
Samples were imaged with a Zeiss Lightsheet Z.1 microscope. For each limb, a low-resolution image of the entire limb was taken with the 20× Clarity lens at a zoom of 0.36. Light-sheet fusion of images was done if necessary in Zen software (Zeiss). Tile stitching and 3D image reconstruction were performed using Imaris software (Bitplane).
Contributions
S.M. designed, performed, and analyzed experiments; performed flow cytometry sorting experiments, annotated and interpreted the single-cell data, performed and analyzed imaging experiments. M.Z. performed flow cytometry sorting experiments and library preparations. E.D and A.G performed single-cell bioinformatics analyses. S.M, M.Z, E.D, A.G, I.A and E.Z generated the figures. S.M, E.Z and I.A wrote the manuscript. E.Z and I.A supervised the project. All authors discussed the results and commented on the manuscript at all stages.
Ethics declarations
Competing interests
The authors declare no competing interests.
Acknowledgements
We thank Nitzan Konstantin for expert editorial assistance, Neria Sharabi from the Department of Veterinary Resources, Weizmann Institute for his help with mouse maintenance, Dr. Yoseph Addadi and Ofra Golani from the MICC Cell Observatory unit at Life Sciences Core Facilities, Weizmann Institute, for their guidance and assistance with light sheet imaging experiments and analyzes, and Dr. Tomer Meir Salame from the flow cytometry unit at Life Sciences Core Facilities, Weizmann Institute, for his assistance with FACS. We thank Efrat Davidson from the Weizmann Institute Department of Design, Photography and Printing for designing the graphic model. Special thanks to all members of the Zelzer and Amit laboratories for encouragement and advice. I.A. is an Eden and Steven Romick Professorial Chair, supported by the Merck KGaA, Darmstadt, Germany, the Chan Zuckerberg Initiative (CZI), the HHMI International Scholar award, the European Research Council Consolidator Grant (ERC-COG) 724471- HemTree2.0, an SCA award of the Wolfson Foundation and Family Charitable Trust, the Thompson Family Foundation, an MRA Established Investigator Award (509044), the Helen and Martin Kimmel award for innovative investigation, the NeuroMac DFG/Transregional Collaborative Research Center Grant.
This study was supported by grants from the David and Fela Shapell Family Center for Genetic Disorders and by The Estate of Mr. and Mrs. van Adelsbergen (to E.Z).