Abstract
The transcription factor SOX9 is a key regulator of multiple developmental processes, and is frequently re-expressed in non-small cell lung cancer (NSCLC). Its precise role in the progression of NSCLC histopathologies has however remained elusive. We show that SOX9 expression relates to poor outcome and invasive histopathology in human adenocarcinomas, and is absent in murine early minimally invasive and human in situ adenocarcinoma. Interestingly, despite wide SOX9 expression across advanced NSCLC histotypes, its genetic deletion in the murine KrasG12D;Lkb1-/- model selectively disrupted only the growth of papillary NSCLC, without affecting the initiation of precursor lesions or growth of mucinous or squamous tissue. Spatial tissue phenotyping indicated a requirement of SOX9 expression for the progression of surfactant protein C-expressing progenitor cells, which gave rise to papillary tumours. Intriguingly, while SOX9 expression was dispensable for squamous tissue formation, its loss in fact led to enhanced squamous tumour metastasis, which was associated with altered collagen IV deposition in the basement membrane. Our work therefore demonstrates histopathology-selective roles for SOX9 in NSCLC progression, namely a requirement for papillary adenocarcinoma progression, but opposing metastasis-suppressing function in squamous histotype tissue. This attests to a pleiotropic SOX9 function, linked to the cell of origin and microenvironmental tissue contexts.
Introduction
Lung cancer remains the leading cause of cancer-related mortality, the majority of which represent non-small cell lung cancer (NSCLC; 85%). NSCLC is subdivided via its two major histotypes, adenocarcinoma (AC) and squamous cell carcinoma (SCC), of which the AC group is most heterogeneous [1–4]. ACs are further subdivided based on the tissue’s predominant histotype structure, particularly papillary AC (PAC), mucinous AC (MAC), and acinar AC (AAC) [5]. Certain driver mutations are selectively enriched in these AC subtypes. For example, KEAP1 mutation or NRF2 overexpression occurs more frequently in PACs when compared to total NSCLCs, and KRAS, EGFR or ALK mutations have respectively been associated with mucinous, lepidic growth, or signet ring structure [6, 7]. Molecular and genetic functions therefore need to be interpreted within these histopathology-specific tissue contexts.
The transcription factor SOX9 is a member of the SOX (sex-determining region Y-related) family [8], and regulates a variety of developmental processes [9–11]. During murine lung development, SOX9 is expressed in a timed manner to coordinate lung branching morphogenesis and alveolar differentiation [12, 13]. SOX9 is silenced in adult tissues but frequently re-expressed in cancer, where it has been associated with increased cell proliferation, transcriptional reprogramming and self-renewal of stem cells, and increased cell invasiveness and drug resistance [14–17]. The SOX9 gene resides on the long arm of chromosome 17 (17q24.3), which is frequently gained in non-smokers with lung AC [18]. In patients, elevated SOX9 expression has been associated with poor survival and advanced clinical NSCLC stage, and its expression was shown to promote the proliferation of AC cells in vitro through affecting the CDKN1A and CDK4 cell cycle regulators [19, 20]. However, while many studies find oncogenic roles for SOX9, supporting its therapeutic targeting, also the downregulation of SOX9 has been linked to relapse of prostate cancer [21] and stage II colon cancer [22], indicating that SOX9 has pleiotropic functions. Despite NSCLC constituting a heterogeneous set of diseases, the potential histopathology-selective role of SOX9 in NSCLC progression has thus far not been studied.
Distinct sets of developmental signals and transcription factors are expressed in different lung progenitor cells, and thereby exert lineage-specific functions [23]. In recent years, the ways in which different cells of origin influence NSCLC progression have been addressed by applying genetically engineered mouse models [24]. Using adenovirus-mediated Cre-recombination, we previously showed that conditional KrasG12D expression combined with loss of Lkb1 in bronchial progenitors expressing Clara cell antigen 10 (CC10) gives rise to a wide NSCLC histotype spectrum encompassing minimally invasive AC (IAC), MAC, AAC, PAC as well as adenosquamous carcinoma (ASC) [25], a rare NSCLC subtype exhibiting both SCC and AC features [26]. To the contrary, in alveolar type II (AT2) progenitors expressing surfactant protein C (SPC), the same genetic alterations mainly lead to IAC and PAC formation [25]. To study the histopathology-selective roles of SOX9 expression in NSCLC, we established heterozygous and homozygous conditional Sox9 null KrasG12D;Lkb1-/- cohorts, and examined how Sox9 loss influenced the progression of lesions of distinct histotypes. We describe opposing histopathology-selective roles of SOX9 in NSCLC progression and metastasis, and link these to the cell of origin and tumour microenvironment contexts, with ramifications for future directions in therapeutic intervention.
Materials and Methods
Human and murine tumour tissues
Breeding of KrasG12D/+;Lkb1flox/flox (KrasG12D;Lkb1-/-) mice, genotyping, and lung tumour initiation by intranasal infection with progenitor cell-directed Ad5-Cre viruses were performed as previously described [23]. KrasG12D;Lkb1-/-;Sox9+/- and KrasG12D;Lkb1-/-;Sox9-/- cohorts were generated by crossing KrasG12D;Lkb1-/- and Sox9flox mice (B6.129S7-Sox9tm2Crm/J, The Jackson Laboratory). Sox9 genotypes were determined using the supplied protocol 29713 (standard PCR Assay - Sox9<tm1Crm>, The Jackson Laboratory). Animal handlings and studies were performed according to the guidelines from the Finnish National Board of Animal Experimentation, approved by the Experimental Animal Committee of the University of Helsinki and the State Provincial Office of Southern Finland (ESAVI/6365/2019). Formalin fixed paraffin embedded (FFPE) samples of human NSCLC SCC, ASC and PAC lesions were collected from the Helsinki Biobank’s pathology sample archive (Helsinki Biobank project number HBP2016002), and a tissue microarray (TMA-1) was constructed as previously described [27]. Tumour stages were determined by a certified thoracic pathologist, following American Joint Committee on Cancer (AJCC) criteria. Detailed information and raw data from the murine and human NSCLC sample analyses are provided in the supplementary material, Table S2 and S3. The large human adenocarcinoma TMA collection (TMA-2; 970 samples) was obtained from a continuous retrospective cohort of archival human lung adenocarcinoma tissue from University Hospital Leicester, under research ethics committee approval. Clinicopathological and follow-up data was obtained from regional and national databases, and all morphological assessments were performed by a subspecialty consultant diagnostic histopathologist.
Tissue processing, immunohistochemistry (IHC) and image acquisition
Murine lung tumours were processed and embedded in paraffin as previously described [23]. FFPE tissue blocks containing whole murine lung or TMA blocks were cut as 4 μm thick sections. Two sequential sections were placed on one glass slide (SUPERFROST® PLUS, Thermo scientific, Ref J1800AMNZ), and labelled to mark the sectioning order. Adjacent tissue sections were each stained with one antibody. Standard IHC was performed with the primary antibodies listed in supplementary material, Table S1. For routine signal development, BrightVision poly-HRP goat anti-rabbit IgG (Immunologic B.V., no dilution) was incubated at room temperature for 30 min. For developing the SOX2 signal, DaG-HRP (Santa Cruz Biotechnology sc2020, 1:100) was incubated at room temperature for 30 min. For signal detection, Bright-DAB Substrate Kit (Immunologic B.V.) was used. Whole slide bright-field scans of stained murine and clinical tissue sections were acquired by a digital slide scanner (Pannoramic 250, 3DHISTECH Ltd) using a 20x objective. 3 μm sections from the TMA-2 human adenocarcinoma cohort were duplex stained using a Ventana autostainer with antibodies for cytokeratin AE1/AE3 (1:250, #NCL-L-AE1/AE3, Leica detected with Roche discovery yellow chromogen) and SOX9 (1:10000, AP5535, Millipore detected with Roche discovery purple chromogen). Up to three 1 mm cores per tumour were analysed. Images of haematoxylin-counterstained images were acquired using a Hamamatsu nanozoomer.
Region-of-interest (ROI)-specific image analysis
Histopathology classification was performed in conjunction with an expert pathologist as previously described [23, 24]. The sizes of individual tumours and lung tissues were measured using Pannoramic CaseViewer (3DHISTECH Ltd). For image analysis, TIFF images of murine lungs with hematoxylin and eosin (H&E) and biomarker stainings on whole slide scans were exported using the Pannoramic CaseViewer (3DHISTECH Ltd) at a magnification of 1:2. Individual tumours on H&E-stained images were then outlined and annotated by their histotypes using ImageJ/Fiji [28] (‘Analyze’-> ‘Tools’-> ‘ROI manager’) to generate a ROI library for each murine sample. Images of serial sections stained with SOX9, Ki67, pAKT and pERK were registered to the nearby H&E images using our previously reported open-source software Spa-R [29], which allows ROI libraries to be applied to marker-stained images (with slight adjustments). For quantifying Ki-67 and SOX9-positive nuclei, individual tumours were exported as inverted 8-bit grayscale images and processed with a CellProfiler pipeline (www.cellprofiler.org) [30] built with nuclei segmentation, intensity measurement, thresholdbased filtration, and percentage calculation modules. The parameters in the pipeline were specified for each staining. To measure pERK and pAKT staining in individual tumours, a workflow using Spa-R and ImageJ/Fiji, and thresholds decided by three independent immunohistochemistry experts, were implemented as before [29]. All raw results can be found in the supplementary material, Table S3. To mark Collagen IV fibres with red pseudo-colour, our in-house open-source Spa-Q software [29] was applied, using identical thresholds for all samples.
Image analysis of human adenocarcinoma cohort (TMA-2)
SOX9 duplex core images were quantified with a purpose-made Visiopharm app. In brief, images were segmented into epithelial and stromal elements using the yellow channel with manual assistance where necessary. Tumour cell nuclei were categorised as weak, negative, moderate or strong for purple SOX9 staining using human-defined cut-offs, and an H-score was calculated for each core (%weak + 2x%moderate + 3x%strong nuclei, yielding a score from 0-300). Ki-67 staining was performed with the Ventana pre-diluted ready-to-use kit, and assessed by a trained histopathologist.
Statistical analysis
For clinical sample (TMA-1) and murine sample analysis, data visualisation and statistical analyses were performed with GraphPad Prism 8 (GraphPad Software, Inc. San Diego, USA). For statistical comparisons, one-way ANOVA multiple comparison (Kruskal-Wallis test, nonparametric test), unpaired t test (non-parametric Mann-Whitney test), or Fisher’s exact test was used. Statistical significance of survival was assessed with a Log-rank (Mantel-Cox) test. P values < 0.05 were considered significant. For the TMA-2 human adenocarcinoma cohort all statistics were performed in R. The cut-off for binarisation of SOX9 was obtained by a published method that has a greater power and lower bias than logrank optimisation in large data sets [31]. The proportional hazards assumption was tested by examination of log-log plots.
Results
The correlation between SOX9 expression and NSCLC staging is histopathology-selective
To evaluate SOX9 expression in NSCLC histotypes according to their stages, we performed IHC analyses of TMAs encompassing 28 human lung SCCs, 13 human lung ASCs and 25 human lung PACs. SOX9 expression was not seen in normal lung tissues but observed in the tumour regions in most of all samples (SCC: 96%, ASC: 100%, PAC: 88%). This significantly contrasts with the expression of SOX2, a well-established genetic driver of SCC, which was more commonly detected in SCCs and ASCs than PACs (SCC: 85%, ASC: 100%, PAC: 8%) (Figure 1A). The SOX9 protein was mainly located in the nucleoplasm, but cytoplasmic/membranous expression was also observed, particularly in PACs (Figure 1A-B).
(A) Schematic summary of SOX9 and SOX2 expression, evaluated by visual scoring, in TMA panels representing clinical SCCs, ASCs and PACs. (B) Representative images illustrating SOX9 expression in normal and malignant epithelial lung tissues. Scale bar: 20 μm. (C) Contingency bar plots show percentages of SOX9-negative, -moderate and -high cases in each histotype, and for NSCLC staging categories determined following American Joint Committee on Cancer (AJCC) criteria. Sample sizes are indicated above each bar. (D) Representative images of duplex-stained in situ or invasive adenocarcinoma tumour tissues in TMA-2. Yellow = cytokeratin, purple = SOX9, blue = haematoxylin. Cores are 1 mm across. Scale bars: 200 μm for original or 500 μm for magnified images. (E) Violin plots showing the relationship between SOX9 and mucinous phenotype and local invasiveness in stained tissue cores (left), and between SOX9 status and tumour cell proliferation quantified by nuclear Ki-67 expression (right).
Based on visual scoring, we assigned each sample to a ‘SOX9-negative’, ‘SOX9-moderate’ or ‘SOX9-high’ category and grouped them by their clinical stages (Figure 1B-C). Among the three NSCLC histotypes, PACs contained the most SOX9-negative or-moderate cases, particularly in the lowest stage tumours (6/6 cases in stage I and 5/7 cases in stage II), whereas in SCCs and ASCs absence of SOX9 was only detected in one sample and at least half of all stage I-II cases were SOX9-high. Thus, a positive correlation between SOX9 expression and clinical stage was detected in PACs, but not in ASCs or SCCs.
To more deeply explore the relationship between SOX9 and the phenotype of human lung adenocarcinoma, we quantified the expression of SOX9 expression in epithelial tumour cell nuclei in a large cohort of 970 resected primary lung adenocarcinomas in tissue microarrays (Figure 1D). Interestingly, nuclear SOX9 levels were higher in mucinous than non-mucinous tumours (p<0.001), and cores with invasive growth patterns showed a non-significant trend towards higher levels of SOX9 expression selectively in non-mucinous samples (p=0.09 in non-mucinous; p=0.58 in mucinous) (Fig 1D-E). Finally, corroborating previous studies [19–20, 32], epithelial tumour cell proliferation assessed by the percentage of tumour nuclei expressing Ki-67 was significantly related to SOX9 expression (p=0.003) (Fig 1E), and in analysis of patient survival time after surgery higher median levels of SOX9 related to relatively poor outcome (Cox model HR=1.2, p=0.044) (supplementary material, Figure S1).
Higher median nuclear SOX9 expression is significantly associated with relatively poor overall survival in patients with lung AC after surgery.
SOX9 is commonly expressed in advanced murine KrasG12D;Lkb1-/- tumours
We subsequently applied animal models to study the role of SOX9 in NSCLC histotype formation. As a first step, we examined SOX9 expression in tumours from KrasG12D;Lkb1-/- mice (KL mice characterised in [25], in this study named KLSwt). Most KLSwt histotypes, including ASC, MAC and PAC, displayed high SOX9 expression, but its expression was fully absent or low in IACs which are small and represent the least aggressive lesion type with low proliferation and oncogenic signalling activities (Figure 2; supplementary material, Figure S2A). Therefore, SOX9 expression levels in murine NSCLC histotypes were comparable to those in clinical NSCLC, showing absent or low expression in IAC or in situ non-mucinous adenocarcinomas and wide expression across all advanced NSCLC histotypes.
(A) Three scatter dot plots summarising the size (left), proliferation rate (centre) and MAPK/AKT oncogenic signalling activities (right) in murine NSCLC histotypes. Stainings were performed on serial tissue sections and analysed by histopathology-guided analyses of individual tumours. Statistical significance was assessed with One-way Anova (non-parametric Kruskal-Wallis test): *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001. (B) Representative images depicting marker stainings for murine ASC and SCC-like histotypes. NKX2-1: AC marker; p63: SCC marker. Scale bars: 100 μm. (C) Representative images depicting murine NSCLC histotype tissues from KLSwt and KLSnull cohorts that express or lack detectable SOX9. Scale bars: 10 μm. The red dashed line separates the AC and SCC regions within an ASC. PACs or ASCs with papillary AC cores were not detected in the absence of SOX9 expression.
(A) Scatter dot plot showing SOX9 expression in individual tumours, grouped by histotype and calculated as percentages of SOX9-postive cells in total number of tumour cells, from six KLSwt-CC10 and eight KLSwt-SPC mice. Numbers above scatter plots indicate total tumour numbers. Statistical significance was assessed with One-way Anova (nonparametric Kruskal-Wallis test): ****p < 0.0001. (B) Representative H&E images and SOX9 stainings, illustrating SOX9 expression in each histotype. Areas depicted in higher magnification are outlined with black squares in the H&E images. Red dashed lines outline the whole tumour regions, or separate SCC and AC regions within an ASC. Scale bars: 500 μm for original or 10 μm for magnified images.
SOX9 expression is essential for papillary structure formation in KrasG12D;Lkb1-/- murine tumours
To investigate the impact of Sox9 loss on histotype formation, we established Sox9-deficient KrasG12D;Lkb1-/- cohorts, including KLShet (losing one floxed Sox9 allele) and KLSnull (losing two floxed alleles), following infection with either Ad5-CC10-Cre or Ad5-SPC-Cre viruses (Figure 3A).
(A) Murine Sox9 loss-of-function and control cohorts established for this study. (B) Kaplan-Meier survival curves for the indicated cohorts. Statistical significance was assessed with Log-rank (Mantel-Cox) test: *p < 0.05, **p < 0.01. (C) Left: representative images illustrating whole lung sections of individual mice from the indicated Ad5-SPC-Cre cohorts, depicting H&E staining and corresponding SOX9 expression. Scale bars: 1000 μm. Right: bar chart showing the histotype-selective tumour burden (% total histotype-specific tumour region per lung) in the indicated cohorts. Numbers of mice that carry the specified histotype are listed above each bar. In the KLSnull-SPC cohort, only SOX9-negative tumours were plotted. Statistical significance was assessed with One-way Anova and non-parametric Kruskal-Wallis test: *p < 0.05, ***p < 0.001. (D) Left: representative images illustrating whole lung sections of individual mice from the indicated Ad5-CC10-Cre cohorts, depicting H&E staining and corresponding SOX9 expression. Scale bars: 1000 μm. Right: bar chart showing the histotype-selective tumour burden (% total histotype-specified tumour region per lung) in the indicated cohorts. Numbers of mice that carry the specified histotype are listed above each bar. In the KLSnull-CC10 cohort, only SOX9-negative tumours were plotted.
Kaplan-Meier survival analysis showed significantly longer survival of KLShet-SPC and KLSnull-SPC mice, as well as KLSnull-CC10 mice, compared to the relative KLSwt controls (Figure 3B). The increased life span of KLShet-SPC and KLSnull-SPC cohorts corresponded to a reduced PAC burden (Figure 3C), and no papillary tissue structure was observed in SOX9-negative tumours in these mice (Figure 3C; supplementary material, Figure S2C). In Ad5-CC10-Cre cohorts, Sox9 loss was associated with a shift in the major tumour burden from ASC to SCC tissue with large necrotic regions (Figure 3D). These SCC tumours were positive for the AC marker NKX2-1 and lacked SOX2 expression (supplementary material, Figure S2B), suggesting that these were derived from ASCs, and hence called SCC-like from here onwards. Interestingly, only the formation of ASC with papillary AC cores, but not those with mucinous AC cores, was disrupted (Figure 3D and supplementary material, Figure S2C). Additionally, PAC was the only AC subtype normally present in KLSwt-CC10 mice that was absent following Sox9 loss (Figure 3D and supplementary material, Figure S2C).
Together, this data suggested that SOX9 expression was selectively required for the growth of papillary histotype tissue. In support of this conclusion, SOX9-positive tumours in both the Sox9null-SPC and -CC10 cohorts, explained by recombination escape known to arise in three-allelic compound mice [33], always showed a papillary tissue structure (supplementary material, Figure S3). As expected, based on the obligate role of LKB1 as a tumour suppressor in this model, we found no evidence for escape of Lkb1 recombination (IHC staining; data not shown). Importantly, in both the CC10- and SPC-infected cohorts, KLShet resembled KLSnull mice in terms of survival, histopathology spectra and tumour burden (Figure 3B-D; supplementary material, Figure S3C-D), together indicating that both heterozygous and homozygous Sox9 loss selectively disrupted papillary NSCLC histotype formation.
(A) Representative images of PAC tumours that developed following escape from Sox9 recombination in a KLSnull-SPC mouse. The red dashed line separates the papillary histotype region from its precursor IAC lesion. Scale bars: 1000 μm for full images and 20 μm for magnified images. (B) Representative images showing the appearance of an ASC with papillary AC core (left) and a PAC (right) that developed following escape from Sox9 recombination in two KLSnull-CC10 mice. Scale bars: 1000 μm for full images and 20 μm for magnified images. (C) Contingency bar plot showing histotype-specific tumour burdens (% area of each histotype perlung) in eight KLSwt-SPC, 14 KLShet-SPC, and 13 KLSnull-SPC mice. All tumours, including those that escaped Sox9 recombination, are plotted. PACs found in KLSnull-SPC mice were always SOX9-positive. (D) Contingency bar plot showing histotype-specific tumour burdens (% area of each histotype per lung) in six KLSwt-CC10, 11 KLShet-CC10, and 12 KLSnull-CC10 mice. All tumours, including those that escaped Sox9 recombination, were plotted. PACs and ASCs with papillary AC cores detected in KLSnull-CC10 mice were always SOX9-positive.
SOX9 expression is essential for the oncogenic transformation of cancer-initiating SPC+ cells
To understand why SOX9 selectively regulated the formation of papillary tissue structure, we studied the spatial expression of cell lineage markers and SOX9 in different tumour histotypes. In the Ad5-SPC-Cre cohorts, SPC, which marks AT2 progenitor cells and differentiated AT2 cells, was highly expressed in IACs, while SPC was absent or expressed at low levels in PACs (Figure 4A-B). Also in the Ad5-CC10-Cre cohorts, IACs were positive for SPC whereas PACs and papillary AC cores of ASCs showed negative to low SPC expression (Figure 4C-D; supplementary material, Figure S4A), indicating that also CC10-Cre induced the transformation of SPC+ cells. Other tumour histotypes found in Ad5-CC10-Cre cohorts that were not affected by Sox9 loss (Figure 3D), particularly mucinous and squamous tissue, were always SPC-negative (Figure 4D and supplementary material, Figure S4A). Interestingly, we further found that SPC expression was mutually exclusive with SOX9 expression in PACs as well as papillary AC cores of larger ASC tumours, and marked tumour subregions that lacked prominent papillary structure (Figure 4B and D). The concomitant appearance of papillary structure together with SPC loss upon SOX9 expression, together with the absence of PAC and papillary AC cores following Sox9 loss as shown in the section above, suggested that SOX9 was a driving force for papillary tumour progression from SPC+ progenitor cells (illustrated in Figure 4E).
(A) Left: representative images of a KLSwt lung lobe collected six weeks post Ad5-CC10-Cre infection. One small lesion was SOX9+/SPC- and had papillary structure, while another was SOX9-/SPC+ with IAC histotype features. Right: representative images of a KLSnull lung lobe collected six weeks post Ad5-CC10-Cre infection, where the highlighted small lesion was SOX9-/SPC- and SCC-like. Scale bars: 1000 μm for the lung overview image and 100 μm for the magnified images. (B) Representative H&E images showing resting and reactive mediastinal LNs (outlined with a red dashed line) and the macroscopic location of LNs collected in this study. The tissue under the resting LN represents the thymus. No metastasis was found in resting and reactive LNs, confirmed by staining with the AC marker NKX2-1 and SCC marker p63. Scale bars: 200 μm. (C) Representative images illustrating AC metastasis in a KLSwt-SPC mouse. The metastatic tissue is NKX2-1+, SOX9+ and p63-in epithelial regions. Scale bars: 200 μm. (D) Representative images showing a mucinous minimal metastasis positive for the SCC marker p63, and an AC minimal metastasis positive for the AC marker NKX2-1 observed in KLShet-CC10 mice. SOX9 staining was positive in both cases. Outlined regions are represented in the magnifications. Scale bars: 500 μm for images showing entire LNs and 100 μm for magnifications.
(A) Representative images illustrating SOX9 and SPC stainings in the lung of a KLSwt-SPC and a KLSnull-SPC mouse. More IACs were observed in the KLSnull-SPC mouse, which all expressed SPC. Scale bars: 1000 μm. (B) Upper: representative images of a tumour from a KLSnull-SPC mouse that had partially escaped Sox9 recombination. The papillary area outlined by the red dashed line contained SOX9-positive regions that were SPC-negative; the remaining IAC-like region was SPC-positive. Lower: representative images of a PAC from a KLSwt mouse, depicting mutual exclusive expression of SOX9 and SPC. Scale bars: 100 μm. (C) Representative images illustrating SOX9 and SPC stainings in the lung of a KLSwt-CC10 and a KLSnull-CC10 mouse. Arrowheads point to SPC-positive tumour regions. Scale bars: 1000 μm. (D) Representative images showing spatial SOX9 and SPC expression in tumours of the indicated histotypes detected in KLSwt-CC10 mice. The red dashed line separates the SCC and AC regions within an ASC tumour. Arrowheads point to SOX9-negative, SPC-positive regions in a PAC and a papillary AC core region of an ASC. Scale bars: 100 μm. (E) Schematic explaining how SOX9 regulates murine NSCLC progression in a histotype-selective manner. Left: in KLSwt mice, SPC+ AT2 cells, targeted by Ad5-SPC-Cre virus, give rise to SPC+ IACs. These IAC lesions gradually start to express SOX9 and this is essential for papillary tumour progression; the SPC+ identity is gradually lost when papillary tissue structure appears. Upon Ad5-CC10-cre infection, SPC+ IACs again develop, either from AT2 cells that are targeted by the CC10-cre virus or from trans-differentiated Clara or dual positive SPC+/CC10+ cells. SOX9 expression in papillary tissue corresponds with loss of SPC expression, giving rise to PACs or the papillary AC core of ASC. Lastly, transformation of CC10+ progenitor cells, which are mostly Clara cells, gives rise to MAC, AAC and ASC with mucinous AC core. Right: Sox9 loss selectively disrupts papillary histotype tissue formation in which transformed cells exhibit progression from a SPC+ to a SPC-status.
SOX9 expression differently regulates AC and SCC tissue metastases
Given the previous links between SOX9 expression and invasion, we next asked how Sox9 loss affected the metastatic potential of NSCLC histotypes by analysing the mediastinal lymph nodes (supplementary material, Figure S4B-C). Metastases in KLSwt-SPC (observed in 3/8 KLSwt-SPC mice) were similar to those previously described [32] and represented AC histotype tissue with papillary structure (supplementary material, Figure S4C). Metastases that eventually developed in KLSnull-SPC mice (4/19 mice, including 2 minimal metastases) were always SOX9-positive, suggesting that they had originated from PACs that had escaped Sox9 recombination (Figure 5A and C). Also, the KLShet-SPC mice developed frequent metastasis (12/17 mice, including 6 minimal metastasis), which could however not be linked to SOX9 status due to the heterozygous SOX9 expression. Importantly, as previously reported [32], squamous metastases were never detected in KLSwt-CC10 mice, but observed in 2/11 KLShet-CC10 and 2/13 KLSnull-CC10 mice (Figure 5B and D; supplementary material, Figure S4D). Three of the four mice carrying squamous metastases succumbed to their disease relatively early and within the survival range of control KLSwt mice. This analysis therefore suggested an overall reduction in AC metastases following Sox9 loss, linked to a block in PAC progression, mirrored by a notable increase in SCC-like metastases following Sox9 loss in murine NSCLC.
(A) Bar chart illustrating the proportion of total animals exhibiting no metastasis, minimal metastasis, or overt metastasis in the mediastinal lymph nodes (LNs) in the indicated Ad5-SPC-Cre cohorts. Total numbers of mice are listed above each bar. Metastases were all SOX9-positive, and all were adenocarcinoma. (B) Bar chart illustrating the proportion of total animals exhibiting no, minimal, or overt metastasis in the mediastinal LNs, subgrouped by SOX9 status, in the indicated Ad5-CC10-Cre cohorts. Total numbers of mice are listed above the bars. (C) Representative images illustrating minimal metastasis (only a small LN region infiltrated with tumour cells) and overt metastasis (LN structure severely disrupted) in Ad5-SPC-Cre cohorts; NKX2-1 indicates AC histotype tissue. Areas depicted in higher magnification are outlined with red dashed squares. Scale bars: 500 μm for original images or 100 μm for magnified images. (D) Representative images illustrating SCC-like metastases in Ad5-CC10-Cre cohorts; p63 indicates SCC histotype tissue. Areas depicted in higher magnification are outlined with red dashed squares. Scale bars: 500 μm for original images or 100 μm for magnified images.
As SOX9 is known to regulate extracellular matrix (ECM) deposition, we next examined ECM features following Sox9 loss by staining of collagen IV (COLIV), a central component of the tumour basement membrane. COLIV was expressed in both IAC and PAC, but only in the latter it formed a linear structure, which was also observed in papillary SOX9-expressing tissue at metastatic sites (Figure 6A-B). This suggested that a precise alignment of COLIV, regulated by SOX9 expression, might be required for the actual growth and later invasiveness of PACs. On the other hand, in Sox9wt ASCs, COLIV formed continuous boundaries around squamous tumour cell pockets (Figure 6C), while Sox9null SCC-like tumours and the SCC regions of ASCs with mucinous AC cores showed thinner parallel COLIV fibres as well as granular and focal depositions (Figure 6D). A similar alteration in COLIV rearrangement was also observed in the SCC-like LN metastases (Figure 6D). Taken together, in murine PACs, Sox9 loss reduced primary papillary tumour progression and consequently their invasion, possibly related to an alteration in ECM deposition, while in squamous histotype the altered COLIV deposition following Sox9 loss in fact associated with an increase in tumour invasiveness and metastasis (illustrated in Figure 6E).
(A) Representative images showing SOX9 and collagen IV (COLIV) expression in a KLSnull-SPC tumour with both PAC and IAC features. The red dashed line marks the papillary region. Squares mark the areas depicted in higher magnification. Scale bars: 100 μm for original images or 20 μm for magnified images. (B) Representative images showing SOX9 and COLIV expression in a LN metastasis with PAC features. Squares mark the areas depicted in higher magnification. Scale bars: 100 μm for original images or 20 μm for magnified images. (C) Representative images depicting the COLIV arrangement in a Sox9wtASC with papillary AC core and a Sox9wtASC with mucinous AC core. Squares mark the areas depicted in higher magnification. Scale bars: 1000 μm for original images or 20 μm for magnified images. (D) Representative images depicting the COLIV arrangement in a Sox9nullASC with mucinous AC core, a Sox9null SCC-like tumour, and a Sox9null SCC metastasis in the mediastinal LN (p63 staining of the same LN metastasis is shown in Fig. 5D). Altered COLIV deposition is detected selectively in SCC of ASC with mucinous AC core and SCC-like tissue. Squares mark the areas depicted in higher magnification. Scale bars: 1000 μm for original images or 20 μm for magnified images. (E) Schematic illustrating how SOX9 may regulate histotype-specific metastasis by influencing ECM (marked by COLIV) secretion/deposition. Linearised COLIV backbones form during IAC to PAC progression, which requires SOX9 expression. Sox9 loss disrupts PAC formation, and hence their subsequent metastasis. To the contrary, while COLIV fibres form continuous boundaries around squamous tumour cell pockets in Sox9wt SCC of ASC that may confine tumour invasiveness, they are re-oriented, or become granular/discontinuous following Sox9 loss, in SCC-like tumours or SCCs of ASCs with mucinous AC cores. This associates with an increase in the incidence of SCC-like metastasis, and these metastases similarly show altered COLIV deposition.
Discussion
Re-expression of the lung developmental transcription factor SOX9 specifically in malignant lung tissues, in conjunction with the reported positive correlation of SOX9 expression levels with clinical NSCLC staging, renders SOX9 a good candidate for prognostic and therapeutic interrogation. We here studied the previously unexplored expression and role of SOX9 in NSCLC histotypes. We identified a trend towards elevated SOX9 expression in non-mucinous clinical invasive AC compared to in situ samples, as well as in murine PAC compared to IAC, with IAC constituting an early form of PAC. SOX9 was widely expressed across multiple invasive stage murine and clinical NSCLC histotypes, yet murine Sox9 loss only affected papillary structure formation, not the initiation or progression of other histotype lesions. The overarching conclusion of our work is therefore that the role of SOX9 in NSCLC is to be interpreted in a histotype-selective manner.
The expression of KrasG12D combined with Lkb1 loss in alveolar AT2 or bronchial Clara lung progenitor cells gives rise to a range of NSCLC histotype tumours. While the alveolar differentiation marker SPC was uniformly expressed in SOX9-IAC, its downregulation coincided with the appearance of papillary tissue structure, and SPC was completely absent in mature SOX9-expressing papillary tumour regions. A transition from alveolar dedifferentiation to gradual SOX9 expression was similarly seen following oncogenic KRAS expression in organoids [34], and corroborates data showing that ectopic expression of SOX9 in embryonic lungs prevented AT2 cell differentiation [12,13], overall indicating that SOX9 expression intrinsically associates with AT2 lineage exit. Use of a conditional Sox9 null model to address the role of SOX9 in NSCLC progression showed that Sox9 loss disrupted the formation of PAC and papillary AC cores of ASC lesions, but not IAC or histotypes derived from bronchial progenitors. This therefore suggests that SOX9 expression is specifically required to drive the progression of lesions developed from a SPC+ cell of origin, leading to papillary NSCLC formation. Precisely how SOX9 drives AT2 lineage exit remains unclear, and an intriguing future direction would hence be to identify the lineage-specific SOX9 transcriptional network, and particularly its interplay with transcription factors known to regulate alveolar differentiation, such as FOXA2 [35–37]. Our study thus reveals a role of SOX9 in NSCLC progression that is selectively linked to the SPC+/AT2 lineage of cancerinitiating lung progenitor cells.
We showed that Sox9 loss decreased AC metastasis, which is consistent with a reported prometastatic role of SOX9 in multiple cancer types [38–40]. But surprisingly, the incidence of SCC metastasis increased following Sox9 loss. One mechanism in which SOX9 is involved in tumour metastasis is through regulating the expression and deposition of ECM components, including multiple collagens and laminins [14]. Our findings importantly suggest that SOX9-regulated ECM re-arrangements affect tumour invasiveness in different and microenvironmental context-dependent manners. We identified differential COLIV deposition, which marks tumour basement membranes, across NSCLC histotypes. Linear COLIV was observed in SOX9-expressing primary PACs and papillary metastatic LNs, but not in nonmetastatic SOX9-negative IACs, suggesting that it is part of a supportive structure for papillary growth and metastasis upon SOX9 expression. On the other hand, in the SOX9-expressing squamous regions COLIV formed pocket structures surrounding the tumour islets, and these became discontinuous and granular in SOX-negative squamous region. Similar COLIV discontinuity has been previously linked to the progression of oral SCC [41], suggesting that disrupted squamous pocket structure is commonly associated with increased squamous invasiveness. Our study therefore depicts functional heterogeneity of SOX9 in NSCLC metastasis, namely a requirement for SOX9-associated matrix deposition in papillary histotype metastasis, but an opposing metastasis-suppressing function in squamous histotype tissue.
Analysis of Sox9 heterozygous null mice showed that loss of one Sox9 allele already disrupted papillary tissue formation. While Sox9 haploinsufficiency has not been previously described in cancer studies, loss of one Sox9 allele has been associated with defects in beta-cell development during murine pancreas organogenesis [42], as well as with defects in murine chondrocyte differentiation [43]. In humans, Sox9 haploinsufficiency caused by de novo heterozygous Sox9 mutation leads to a rare but severe form of skeletal dysplasia called campomelic dysplasia (CD) [44], and CD mutations can act in a dominant negative manner by disrupting the capacity of SOX9 to dimerise [45]. While our study demonstrates a similar allelic dosage effect of Sox9 in regulating NSCLC progression and metastasis, further studies are needed to understand whether this is associated with defective SOX9 dimerisation, and which are the target genes that essentially regulate histotype-selective effects during NSCLC progression.
Targeting transcription factors in cancer has been a long-standing challenge, yet the tumour cell-specific SOX9 expression renders it an attractive therapeutic target in NSCLC. Our results indicate that the inhibition of SOX9 could hamper the progression of early-stage ACs with SPC+/AT2 identity. Further study should address the effect of Sox9 loss subsequent to PAC formation, for example using a flippase-FRT and Cre-loxP dual-recombinase system [46], to learn whether SOX9 is similarly required for the progression or metastasis of already established PACs. If so, then this would provide additional support favouring development of SOX9 inhibitors to treat PAC progression. Small molecules that disrupt the protein-interaction function of the SOX18 vascular development regulator have recently become available [47]. A similar strategy could possibly be applied to identify small molecules that disrupt SOX9 dimerisation or its binding to other lineage-specific transcription factors. However, while our study implies SOX9 as a potential anti-cancer target in papillary histotype NSCLC, Sox9 loss also associated with enhanced squamous tissue metastasis, emphasising that therapeutic considerations ought to consider the tumour’s histotype context and cannot simply adopt SOX9 positivity as a biomarker for response.
In conclusion, we demonstrate opposing requirements for SOX9 in different NSCLC histotypes, and shed light on a pleiotropic SOX9 function that links to different tumour progenitor cells and extracellular microenvironmental features. This provides a strong motivation to evaluate the prognostic value and therapeutic application of SOX9 in the context of NSCLC histopathology types.
Author contributions
JB, KN and EWV designed the study and planned the experiments. KN established the animal cohorts and started the experiments. JB conducted further experiments and performed histopathology-selective analyses. AT performed immunohistochemistry and quantification on the large clinical adenocarcinoma cohort. AH and NML assisted with animal health checks, genotyping, and tissue processing. AH performed image scanning. KS provided histopathological review of the murine tumours. MIM provided the grade information of clinical NSCLC samples. JLQ supervised the large clinical adenocarcinoma cohort analysis. JB and EWV wrote the manuscript, and EWV provided supervision.
Data availability statement
The data that support the findings of this study, including murine tumour and TMA-1 analyses, are available as supplementary materials, Table S2–3. Data generated with TMA-2 is accessible by contacting the authors.
Acknowledgements
We thank the Digital and Molecular Pathology Unit supported by the University of Helsinki and Biocenter Finland, and the Laboratory Animal Centre for husbandry support. We thank Zhangyi He for performing statistical analysis on the TMA-2 dataset. Katja Välimäki is thanked for material support, Ashwini Nagaraj for advice, and Iris Lähdeniemi for reviewing the manuscript. Samples/data used for the research were obtained from the Helsinki Biobank, and we warmly thank all patients for consenting to archive their samples in the Biobank. Research was supported by the University of Helsinki Doctoral Programme in Biomedicine (JB); The Finnish Medical Foundation (MIM); MRC Toxicology Unit core funding (JLQ); Innovative Medicines Initiative Joint Undertaking grant agreement no. 115188, the resources of which are composed of a financial contribution from the European Union’s Seventh Framework Programme (FP7/2007-2013) and EFPIA companies’ in-kind contribution (EWV); and the Academy of Finland grant 307111 (EWV).
Footnotes
No conflicts of interest were declared