ABSTRACT
SULF1 and SULF2 are oncogenic in a number of human malignancies, including head and neck squamous cell carcinoma (HNSC). The function of these two heparan sulfate editing enzymes was previously considered largely redundant but the biology of cancer suggests differences that we explore in our RNAseq and RNAScope studies of HNSC and in a pan cancer analysis using the TCGA and CPTAC (proteomics) data. Our studies document a consistent upregulation of SULF1 and SULF2 in HNSC which is associated with poor survival outcomes. SULF2 expression increases in multiple malignancies but less consistently than SULF1, which uniformly increases in the tumor tissues and negatively impacts survival in several types of cancer. Meanwhile, SULF1 showed low expression in cancer cell lines and a scRNAseq study of HNSC shows that SULF1 is not supplied by epithelial tumor cells, like SULF2, but is secreted by cancer associated fibroblasts. Our RNAScope and PDX analysis of the HNSC tissues fully confirm the stromal source of SULF1 and explain the uniform impact of this enzyme on the biology of multiple malignancies. In summary, the SULF1 enzyme, supplied by a subset of cancer associated fibroblasts, is upregulated and negatively impacts HNSC survival at an early stage of the disease progression while the SULF2 enzyme, supplied by tumor cells, impacts survival at later stages of HNSC. This paradigm is common to multiple malignancies and suggests a potential for diagnostic and therapeutic targeting of the heparan sulfatases in cancer diseases.
INTRODUCTION
Eukaryotic sulfatases are primarily lysosomal enzymes that hydrolyze sulfate esters during the degradation of macromolecules, often in conjunction with glycosidases1. The sulfatase family of proteins shares a common catalytic mechanism using a formyl glycine residue in the active site for catalysis2. However, human 6-O-endosulfatases SULF1 and SULF2 are distinct from all other sulfatases in that they are neutral pH extracellular enzymes that edit the sulfation of heparan sulfate proteoglycans (HSPG) instead of degrading them3. The HSPG family of core proteins carries one or several serine/threonine residues covalently attached to a heparan sulfate glycosaminoglycan chain further modified by N-deacetylation and sulfation, epimerization, and variable O-sulfation4. Four sulfation sites at the N-, 3-O-, and 6-O-positions of the glucosamine and at the 2-O-position of the glucuronic acid4 regulate protein interactions and the 6-O-sulfate-dependent interactions are critical regulators of the pathophysiology of multicellular organisms5–8. The SULF1 and SULF2 enzymes are the only post-synthetic editors of the 6-O-sulfation at the internal glucosamines of highly sulfated HSPG domains and their activity defines many critical interactions at the cell-surface and in the extracellular matrix (ECM).
HSPG exquisitely regulate embryogenesis, organogenesis, and physiology of nearly all organs by adjusting gradients of at least 600 proteins including growth factors, cytokines, chemokines, proteases, or collagens6,9. The highly specific SULF activities liberate sequestered HS-binding proteins which regulate matrix remodeling, immune infiltration, or signaling of the respective cognate receptors7,10,11. The determinants of ligand binding are under intense investigation12–14 and systemic rules need to be further elucidated. We know, however, that heparan 6-O-sulfation is essential for binding of many ligands including VEGF, FGF-1, FGF-10, IL8, HGF, Wnt ligands or L- and P- selectins4,10. SULFs represent, therefore, an essential regulatory element that controls the HS-dependent developmental and pathophysiological processes including cancer progression15–17.
Human SULF1 and SULF2 are 65% identical and there is no clear difference in the substrate-specificity of the two enzymes18. However, the impact of the two enzymes on cancer diseases is distinct. SULF2 is upregulated and oncogenic in various cancers7 and a recent study documented that anti-SULF2 antibodies prevent tumor growth in a mouse model of cholangiocarcinoma19. The reported impact of SULF1 on cancer progression is less consistent. Widespread low expression of the SULF1 transcript is observed in cancer cell lines 20,21 and prior studies suggested a tumor suppressor function of SULF1 in ovarian, breast, and liver cancers16. In contrast, increased SULF1 expression is observed in a wide range of human tumors7,16 and high SULF1 expression is associated with advanced primary tumor status, higher histological grade, and worse survival in urothelial carcinoma22. We have shown that SULF1 and SULF2 enzymes increase in tumor tissues of patients with HNSC and that the increase is associated with poor survival23. In this study, we therefore examined available datasets to find which cancers are affected by the 6-O-endosulfatases and we carried out experiments that verify the unifying concepts in their impact on HNSC and other cancers.
METHODS
Differential expression of SULF1 and SULF2 mRNAs in 32 TCGA studies
RNA-seq data and clinical information of 9,160 patients enrolled in 32 cancer studies conducted by the Cancer Genome Atlas (TCGA) consortium (portal.gdc.cancer.gov) and corresponding non-disease tissues from the Genotype-Tissue Expression (GTEx) project (gtexportal.org/home) were downloaded from UCSC-Xena on 02-26-2021 (xenabrowser.net/datapages); SULF1 and SULF2 mRNA was quantified as log2(RSEM counts+1) in both datasets. For the differential expression analysis, we selected 14 TCGA cancer studies with n>10 of paired tumor and normal tissues (Table 1) and we used a Wilcoxon rank sum test to compare paired tumor and non-tumor tissues, where the log2-fold change (|Log2FC|) >1 and the false discovery rate (FDR) <0.05 across studies were considered as statistically significant. SULF1 and SULF2 mRNA expression was further compared by Wilcoxon rank sum test between unpaired tumor tissues of 24 cancer studies and corresponding non-disease tissues of the same organs reported in GTEx (Supplemental Table 1A).
Differential expression of SULF proteins in 10 CPTAC studies
Proteomics data and clinical information of 1,247 patients enrolled in 10 cancer studies with matched tumor and adjacent non-tumor tissue-pairs conducted by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) were downloaded from the Proteomics Data Commons (proteomic.datacommons.cancer.gov/pdc) and the CPTAC Data Portal24 on 05-18-2020. Protein abundance, as determined by the CPTAC Common Data Analysis Pipeline25 quantified the log2 ratio of individual proteins to an internal control of each study, using only peptides not shared between quantified proteins. We analyzed the differential expression between tumor and paired normal tissues by paired t-test; we compared SULF expression at different cancer stages by one-way ANOVA and computed the corresponding FDRs (Table 2).
Pan-cancer survival analysis based on SULF1 and SULF2 mRNA
The impact of SULF mRNA expression on time to event endpoints as cancer driven progression-free interval (PFI), defined by the TCGA research network26, was evaluated by the Kaplan-Meier method in 20 TCGA cancer studies with >100 patients and >40 PFI events (Supplemental Table 1B). We identified optimal SULF cutoff values yielding the smallest p-value from the log-rank test when both SULF low and high groups, in each study, have at least 25% patients. We used the multivariable Cox-proportional hazard model (MCPH) adjusting for age and gender and summarized hazard ratios (HR) with 95% confidence intervals (CI). We used FDR<0.05 to call statistical significance. We further evaluated the HNSC TCGA dataset for an association between SULF1 or SULF2 mRNA expression and PFI at different pathological stages of the disease. We analyzed separately the early stage (stage I and II, n=96) and the late stage (stage III and IV, n=346) tumors. In each subset, we used the log-rank test as above.
A confirmatory survival analysis was completed on HNSC patients (n=88) enrolled at the Princess Margaret Cancer Centre, University of Toronto in line with IRB approved protocols. Majority of the patients have stage IV oral cancer (Supplemental Table 2). RNA was extracted from the snap frozen tumor tissues using a Qiagen RNeasy mini-kit and sample library preparation was done using the Illumina TruSeq stranded total RNA sample preparation kit. Sequencing used a 100-cycle paired read protocol and multiplexing to obtain ~75 million reads/sample on a Novaseq S4 flow cell using XP mode. Transcript abundances in transcripts per million (TPM) were generated from trimmed reads using Kallisto (v. 0.46.1) and the gencodev33 human transcriptome or a combined human-mouse transcriptome with reads aligning to mouse removed prior to analysis. Gene-level abundances (TPM) were calculated using the Bioconductor package tximport (v. 1.24.0). Survival analyses for disease-free interval were performed using methods described above.
Cell-specific expression of SULF1 and SULF2 in HNSC
We evaluated a published HNSC single cell RNA-seq dataset27 that profiled transcriptomes of 5,578 cells from tumor tissues of 18 HNSC patients and identified the type of individual cells based on copy-number variations, karyotypes and expression signatures. We downloaded the data from the UCSC Cell Browser (cells.ucsc.edu/?ds=head-neck) and we quantified the SULF1 and SULF2 mRNAs as log2(TPM+1). Each cell-type expressing SULF1 or SULF2 was defined as a percentage of cells with non-zero expression values (Supplemental Table 3). Differential expression of SULF1 and SULF2 between the cell types was calculated by non-paired t-test.
Correlation studies of the SULF enzymes with CAF and other proteins
We computed Pearson’s correlations between SULF1 and all proteins/corresponding mRNAs in the CPTAC study of HNSC28. To explore magnitude of the correlations between SULF1 and cancer associated fibroblasts (CAF), the averaged Pearson’s correlation between CAF1 proteins (n=206) and SULF1 was compared with the averaged SULF1 correlation with all proteins (n=10,073) where p-value was obtained by performing 10,000 permutations using a randomly selected subset of proteins (n=206). We adopted the CAF1 and CAF2 definitions by Puram et al27. In addition, we adopted the CAF definition of a COL11A1-related CAF subset associated with pre-metastatic locally invasive tumors proposed by Anastassiou29,30 (Supplemental Table 4).
RNAscope analysis of SULF1 and SULF2 in OSCC tissues
We selected tumor tissues of patients (n=20) with carcinoma of the oral cavity (OSCC) for the analysis of SULF1 and SULF2 expression based on in situ hybridization. The patients were either node positive (n=10) or node negative (n=10) and the node positive group had, in general, poor survival outcomes (Supplemental Table 5). FFPE sections (5 μm) of the patient’s tumors were baked at 60°C, deparaffinized, and dehydrated. The RNAScope assay (RNAscope Multiplex Fluorescent Reagent Kit v2 #323100) was done according to the manufacturers protocol with probes for Sulf1 and Sulf2 (ACD 403581-C3 and ACD 502241) paired with OPALs 650 and 570 (Akoya FP1496001KT and FP1488001KT, respectively). After the final wash, slides were prepped for IHC and incubated for 60min with anti-panCK antibody (M3515 DAKO), anti-mouse HRP secondary (DAKO K400111-2), OPAL TSA 520 (Akoya # FP1487001KT), and DAPI (Akoya # FP1490).
Slides were scanned at 10× magnification using the Vectra 3.0 Automated Quantitative Pathology Imaging System (Akoya). Whole slide scans were viewed with Phenochart (Akoya) and high-powered images at 20× (resolution of 0.5 μm per pixel) were selected for multispectral image capture. Three to 20 multispectral image regions of interest (ROIs; 669 μm × 500 μm) were captured in the tumor and normal adjacent regions on each slide. A selection of 10-15 representative multispectral images spanning all tissue sections was used to train the inForm software (tissue/cell segmentation and phenotyping tools). All the settings applied to the training images were saved within an algorithm for batch analysis of all the multispectral images for the project. The analysts were blinded to the patient status and all the raw data were consolidated in PhenoptrReports (Akoya). We quantified the total number of cells in the panCK+ tumor area and the adjacent panCK-area (stroma); these compartments were analyzed for the SULF1+ and SULF2+ cells (Supplemental Table 5).
Expression of SULF1 and SULF2 in a PDX model of OSCC
PDX models were generated as described31. RNAseq was carried out on snap-frozen tissues from early passage (passage 1 to 3) PDX models, as described above. Expression of SULF1 and SULF2 was compared between patient and xenograft samples using a paired Wilcoxon rank sum test and visualized using the R package ggplot2 (v. 3.3.6).
Study Approval
The studies involving human participants were reviewed and approved by the Georgetown University-MedStar Health Institutional Review Board and the University Health Network Research Ethics Board. The patients/participants provided their written informed consent to participate in this study.
RESULTS
SULF1 and SULF2 mRNA expression in different cancer types
We extended our analysis of HNSC23 by retrieving the RNA-seq data of 9,160 patients in 32 cancer studies from the TCGA database to develop a systematic evaluation of SULF1 and SULF2 expression. We compared tumor and paired normal tissues in 14 cancer studies with >10 available paired samples (Table 1). SULF1 is significantly upregulated in tumor tissues of 10 of the 14 studies of which 9 show >2-fold upregulation. The highest SULF1 log2FC is observed in LUAD (log2FC =2.78, FDR<0.001). SULF2 is overexpressed in 8 cancer types of which 5 increase >2-fold. The highest SULF2 FC is observed in ESCA (log2FC=2.76, FDR<0.001). Besides the wide-scale overexpression in tumor tissues, SULF1 is significantly downregulated in KICH and THCA and SULF2 in PRAD but none of the studies reaches a 2-fold decrease.
Our comparison of SULF mRNA in 32 TCGA tumors with the corresponding non-disease tissues from GTEx (Supplemental Table 1A) shows that SULF1 is significantly increased in 18 cancers of which 16 show >2-fold upregulation. SULF2 is significantly higher in 16 cancer studies, of which 11 show >2-fold increase. All the TCGA studies with significant SULF1 or SULF2 upregulation in paired tissues retain the trend in our GTEx analysis. However, the slight decreases observed for SULF1 in KICH and THCA or for SULF2 in PRAD (Table 1) are not confirmed in the GTEX comparison. The decrease in SULF1 is not significant for any of the GTEX comparisons; SULF2 expression is significantly lower in OV (log2FC=−2.27, FDR<0.001) and UCEC (log2FC=−1.76, FDR<0.001).
SULF1 and SULF2 proteins in 10 CPTAC studies
We analyzed differential expression of SULF proteins in paired tumor and normal tissues of 10 proteomics studies from the CPTAC consortium (Table 2). Seven of the cancer types (LUSC, LUAD, HNSC, KIRC, BRCA, COAD, and LIHC/HCC-HBV) overlap with the TCGA datasets, which enables comparison of the expression at the transcriptional and translational levels. Similar to the pervasive upregulation of SULF1 mRNA, SULF1 protein is significantly upregulated in tumor compared to paired normal tissues in 9 of the 10 studies. Four studies showed >2-fold increase of SULF1 protein in tumor tissues (Table 2) with the highest fold-change observed in HNSC (log2FC=1.59, FDR<0.001). Only the smaller size (n=12) study of OSC did not show any difference in SULF1 protein. SULF2 protein is significantly upregulated in the tumors of 6 studies but a >2-fold increase is only observed in PDAC (log2FC=1.14, FDR<0.001). In addition, we saw a significant downregulation of SULF2 protein in tumor tissues of HBV-related HCC (log2FC=−0.33, FDR<0.001) and UCEC (log2FC =−0.43, FDR=0.01). This is consistent with the reduced expression of SULF2 mRNA in tumor tissues of LIHC (log2FC=−0.481, FDR=0.116, Table 1) and UCEC (log2FC=−1.76, FDR<0.001, Supplemental Table 1A). Based on the SULF expression in the two independent datasets (TCGA and CPTAC), we conclude that SULF1 is commonly upregulated across different cancer types while SULF2 overexpression is more restricted to certain cancer pathologies.
We observed >2-fold upregulation of both SULF1 and SULF2 mRNA in four cancer studies from the TCGA (HNSC, ESCA, LUSC, and STAD) and all these cancers remain significantly upregulated compared to the GTEX normal tissues (Figure 1A, 1B). At the same time, SULF1 and SULF2 proteins are significantly elevated in HNSC and LUSC (Table 1) while the STAD and ESCA studies were not reported at the time of our analysis. Pancreatic cancer (PDAC) is the only CPTAC study with both SULF1 and SULF2 protein significantly upregulated >2-fold in tumor compared to paired normal tissues (Figure 1C). Limited size (n=4 paired samples) of the PDAC study prevented our analysis of paired mRNA expression in the TCGA dataset. However, SULF1 and SULF2 mRNAs are >20-fold higher in the PDAC tumor tissues than in the non-disease pancreatic tissue from the GTEx (SULF1 log2FC=6.81, SULF2 log2FC=4.85, both p<0.001, Figure 1D), which is consistent with the large difference in the protein expression (Table 2). The survival analyses presented below further support the impact of SULF enzymes on HNSC and PDAC and warrant additional study.
Association of SULF1 and SULF2 mRNA expression with survival outcomes
The association of SULF1 or SULF2 mRNA expression and PFI was analyzed by univariate log-rank tests and compared to published studies (Supplemental Table 1B). Our literature search found five studies (bladder22, breast32, lung16, gastric33, and liver11) showing that high SULF1 expression is associated with poor survival outcomes. We observed an adverse prognostic trend for these cancers in the TCGA studies but the associations did not reach significance (Supplemental Table 1B). In addition, high SULF1 is a significant (FDR<0.05) prognostic factor in KIRP (HR=2.693), PAAD (HR=2.365), CESC (HR=2.227), COAD (HR=1.867), and LGG (HR=1.479); we are not aware of studies reporting these associations. We note that SULF1 is significantly increased at the mRNA (Table 1) and protein (Table 2) levels in COAD and negatively impacts survival; such associations deserve further attention. The association of high SULF1 with poor survival (Figure 2) in many cancers is quite remarkable especially in view of the fact that the SULF1 transcript is low in most cancer cell lines (Supplemental Figure 1) 20,21.
We found studies of 8 cancers (bladder22, esophagus34, head and neck23, kidney35, liver36, lung37, and pancreatic38) showing significant association of SULF2 with survival but the impact is less uniform. Our analysis confirms the reported associations (all FDR<0.05) of high SULF2 expression with poor PFI of patients with HNSC (HR=1.687), LIHC (HR =1.587), and PAAD (HR =1.724) (Figure 2). An adverse association of SULF2 with PFI in ESCA (HR=1.352, p=0.158) did not reach statistical significance as reported34. In addition, high SULF2 expression is significantly (FDR<0.05) associated with better PFI in LGG (HR=0.354, p<0.001) and UCEC (HR=0.415, p=0.004) (Supplemental Table 1B) and a favorable prognostic impact of high SULF2 was reported in clear cell renal carcinoma35 (HR=0.07, p=0.015, n=49) and lung squamous cell carcinoma37 (HR=0.11, p=0.02, n=51). These observations were, however, not corroborated in the larger TCGA studies (Supplemental Table 1B).
Survival impact of SULF1 and SULF2 in HNSC differs by pathological tumor stage
We have shown that high SULF1 or SULF2 expression in HNSC is associated with poor survival outcomes23. This association was further confirmed in our study of 88 HNSC patients enrolled at the University of Toronto; we used an optimized cutoff of SULF1 (52 high and 36 low expressors) or SULF2 (24 high and 64 low expressors) to show that high expression of either gene is associated with poor disease-free interval (p<0.001) (Supplemental Figure 2). The study has limited size but provides an important independent verification of the results.
SULF1 is associated with poor survival in univariate analysis23 but, contrary to SULF2, loses a significant impact when analyzed in a multivariable model together with SULF2, age, gender, smoking history, tumor stage, and radiation therapy. However, the TCGA sample-set used in the analysis is dominated by tumors of stage 3 and 4. To further evaluate the impact of SULF1, we analyzed separately HNSC patients with early (stage 1 and 2) or late (stage 3 and 4) tumors. We observe that the survival outcomes differ by stage even though SULF1 and SULF2 expression does not differ between early- and late-stage tumors23. High SULF1 mRNA expression in tumor is associated with poor PFI in early-stage patients (HR=2.327, p=0.023, Figure 3A) but not in late-stage patients (HR=1.034, p=0.842). On the contrary, SULF2 mRNA overexpression is significantly associated with poor PFI outcomes in late-stage patients (HR=1.794, p<0.001, Figure 3B) but not in early-stage patients (HR=1.457, p=0.866). The high impact of SULF1 in the early tumors is even more pronounced in a multivariable model23 that includes SULF1, SULF2, age, gender, smoking, and radiation therapy. While SULF2 is insignificant in the 97 stage 1 and 2 patients (HR=1.32 (95% CI, 0.58-3.0), p=0.59), high SULF1 remains an independent predictor of poor PFI (HR=4.61 (1.88-11.3) p<0.001). We speculate that this pattern of adverse prognostic impact is associated with a SULF1 function in the local spread of the disease at an early stage which is complemented by SULF2 activity at later stages of the HNSC progression.
Cell-specific expression of SULF enzymes in HNSC
Analysis of a single cell RNA-seq dataset of HNSC27 showed that the percentage of SULF1 and SULF2 positive cells varies substantially across 9 cell types (Supplemental Table 3). SULF2 is expressed in 63% of all tumor cells (n=1,389 of 2215) which is the highest representation among all the cell types. The positivity of SULF1 is the highest in fibroblasts (48%, n=691 of 1,440) compared to <20% in any other cell type (Supplemental Table 3, Figure 4A). SULF1 is expressed in only 14% of tumor cells (n=309 of 2,215).
SULF1 expression is significantly higher in fibroblasts compared to tumor epithelial cells in terms of both percent positivity and expression; in contrast, SULF2 expression has the opposite trend (Figure 4). The mean SULF1 mRNA, represented as log2(TPM+1), is 0.172 in tumor cells compared to 1.099 in fibroblasts (p<0.001); the mean value of SULF2 mRNA is 1.145 in tumor cells compared to 0.386 in fibroblasts (p<0.001) (Figure 4A). The percentage of positive cells among individual patients ranges from 2.4-54.5% for SULF1 and 43.1-91.7% for SULF2 in tumor cells, and from 31.3-71.0% for SULF1 and 8.7-43.8% for SULF2 in fibroblasts (Figure 4B). Interestingly, the mean expression of SULF1 in SULF1-positive cells remains significantly higher in fibroblasts than in tumor cells (2.291 vs 1.232, p<0.001); however, the mean expression of SULF2 in SULF2-positive fibroblasts and tumor cells is the same (1.826 vs 1.812, p=0.8, Figure 4C). These results suggest that SULF1 is expressed to a high degree by a large sub-population of fibroblasts; SULF2 is expressed to the same degree in the fibroblasts and tumor cells but the population of tumor cells expressing SULF2 is much bigger than that of the fibroblasts. We conclude that SULF1 and SULF2 in HNSC derive primarily from the fibroblasts and tumor cells, respectively. The results show that the expression of SULF1 and SULF2 in HNSC is regulated by different mechanisms which leads to an independent regulation of the HS-dependent signaling activities by the two enzymes.
To further strengthen the observation that SULF1 is expressed in fibroblasts and SULF2 in tumor cells, we analyzed the RNA-seq data from the Cancer Cell Line Encyclopedia (CCLE). SULF1 expression in fibroblast cells is distinctly higher compared to all the cancer cell lines (Supplemental Figure 1A) but SULF2 expression is the highest in cell lines from neuroblastoma, HNSC, and other cancers (Supplemental Figure 1B). This suggests that SULF1 expression in fibroblasts of the tumor tissues is not unique for HNSC but more likely a pan-cancer event.
A final demonstration of the expression of SULF1 in fibroblasts comes from our PDX studies of 42 HNSC patients (Figure 5) showing that in all but one case the expression of SULF1 decreases in the PDX compared to the primary tumor (median primary tumor TPM=52.6, median PDX TPM = 1.7, p<0.001; Wilcoxon rank-sum test). This is in line with the expansion of tumor cells and loss of the transplanted stroma commonly observed in the PDX models. In contrast, SULF2 expression in the PDX increases (median primary tumor TPM = 50, median PDX TPM = 103, p<0.01; Wilcoxon rank-sum test) which confirms that the tumor cell is the major source of this enzyme.
SULF1 expression in cancer-associated fibroblasts
Dominant expression of SULF1 in fibroblasts and its increase in HNSC tissues, in spite of low expression in the HNSC cell lines, prompted us to analyze its connection with cancer-associated fibroblasts (CAF). We examined the correlation of genes (n=206) of CAF1, a HNSC CAF defined previouslyl27, with SULF1 in the CPTAC HNSC study. Analysis of the Pearson’s correlation coefficients shows that 122 proteins in the CPTAC dataset are correlated with SULF1 with r>0.55, of which 44 belong to the CAF1 cluster (Supplemental Table 4). The distribution of the correlation coefficients of the CAF1 genes is shifted to significantly higher values compared to other proteins (n=10,073, p<0.001) (Figure 4D). The correlations of SULF2 protein with the CAF1 genes is weaker, as expected, and we observed a similar trend in the RNAseq data from the CPTAC HNSC study (Supplemental Table 4). The expression signature of the HNSC CAF130 overlaps substantially with a COL11A1-expressing CAF which is defined by an invasive pre-metastatic phenotype29,30. This subset of CAF was observed in multiple cancers (HNSC, ovarian, pancreatic, colorectal) which strongly suggests that SULF1 is impactful in multiple cancers, in addition to HNSC.
The association of SULF1 with CAF is further supported by our RNAscope analysis of SULF1 and SULF2 in 20 OSCC patients (Figure 6). The in-situ hybridization clearly shows that SULF1 expressing cells are more common in the stroma (mean 24.9% stroma vs 8.4% tumor, p<0.001) while SULF2 expression is higher in the cancer cells (mean 22.7% stroma vs 52.5% tumor, p<0.001) (Supplemental Table 5). In an exploratory analysis, we separated the OSCC patients into node positive (n=10) and node-negative (n=10) groups and we observe a trend for higher SULF1 and SULF2 expression in the node-positive cases (Figure 6). The results support that SULF1-positive CAF accumulate at the invasive front of the HNSC at a pre-metastatic stage and facilitate local invasion.
DISCUSSION
Previous studies showed that SULF1 and SULF2 are upregulated in several cancers4,16,22,23,34,37,38. SULF2 is considered oncogenic7, the SULF1 impact is more controversial7,39,40. Are there unifying trends in the cancer biology of the heparan 6-O-endosulfatases supporting their impact on HNSC and other cancers? Proteogenomic analysis of the TCGA and CPTAC datasets conclusively documents a significant elevation of SULF1 and SULF2 in multiple cancer tissues compared to adjacent (Table 1) or normal (Supplemental Table 1) counterparts. This is uniformly corroborated by protein increases (Table 2) and at least six cancers (BRCA, HNSC, KIRC, LUSC, LUAD, PAAD) consistently upregulate both SULF1 and SULF2. PDAC mRNA was not reported in the TCGA adjacent normal but both mRNA (compared to normal) and protein show some of the highest increases overall. Other cancers upregulate one of the SULFs (e.g. SULF1 in COAD) or the SULFs remain unchanged, but decreases, like SULF2 in UCEC, are rare.
At the same time, high expression of SULF1 or SULF2 is typically associated with poor survival (Supplemental Table 1B). Among all the cancers examined, we observe most consistent impact of SULF1 and SULF2 in PAAD and HNSC (Figure 1 and 2) but other cancers are affected as well. HNSC is an interesting case because SULF2 negatively affects survival of stage III and IV patients while the impact of SULF1 is more prominent in stage I and II cancer patients (Figure 3). Associations of high expression with improved outcomes are restricted to SULF2 (e.g. in UCEC) but remain exceptional. High SULF1 predicts significantly lower PFI in at least 5 cancers and other malignancies, reported previously, follow a similar trend (Supplemental Table 1B). The consistent increases of SULF1 in cancer tissues and their uniform association with poor survival outcomes are quite remarkable because SULF1 expression in cancer cell lines is typically low20,21 (Supplemental Figure 1). However, our analyses of HNSC (Figure 6) and analyses of other cancers41 show that SULF1 is high in cancer tissues and supplied by the CAF, a cell type associated with cancer invasion, metastasis, and the escape from immune surveillance30,42,43 and so far overlooked in the cancer biology of the SULF enzymes.
Our PDX study of HNSC shows that SULF1, contrary to SULF2, disappears in the transplanted tumors, as expected for a gene expressed by the stromal cells (Figure 5). The scRNAseq data27 show a strong correlation of SULF1 with CAF1 genes (Figure 4) and the genes of the COL11A1-expressing CAF associated with locally invasive pre-metastatic cancer disease29 (Supplemental Table 4). SULF1 is a gene typical of this subtype of CAF in several cancers30 which supports that the CAF supply SULF1 not only in HNSC but in general; the function of SULF1 in the biology of the CAF deserves further attention. Finally, our RNAScope study shows that SULF1+ cells localize to the stroma while SULF2+ cells overlap to a large degree with the cytokeratin+ tumor cells (Figure 3). Our results also suggest that SULF1+ cells in the stroma are more abundant in node positive tumors which is in line with recent papers associating low tumor/stroma ratio44 or the presence of CAF45 with poor HNSC survival outcomes.
In conclusion, our study confirms overexpression of SULF1 and SULF2 in various cancers which is commonly associated with poor survival outcomes; the 6-O-endosulfatases emerge as interesting targets for cancer monitoring and therapeutic intervention. It is expected that the enzymes determine survival of HNSCC patients by adjusting gradients of heparan sulfate binding proteins in the microenvironment of tumors. We have strong evidence that SULF1 is supplied by CAF while SULF2 is provided primarily by the cancer cells which has important consequences because the secreted SULF enzymes act locally due to strong non-covalent interactions with cell surfaces7. In addition, recent data suggest that their activity is regulated by cell-specific posttranslational modifications46. We know that SULFs regulate oncogenic pathways but they also adjust matrix structure, angiogenesis, or immune responses3,4,7,8,10,15–17,47,48 and their function at the tumor/stroma interface in different cancers needs further study.
Supplemental Table 1. A. SULF1 and SULF2 mRNA expression across 32 TCGA studies compared to non-disease tissues from GTEx. SULF1 and SULF2 mRNAs are quantified as log2 (RSEM_counts+1). Differential expressions, represented by log2FC and p-value, were calculated by non-paired t-test. B. Association between SULF mRNA expression and PFI outcomes of patients in 20 TCGA studies. Differential expression of SULFs is represented as log2FC between the paired tumor and normal tissue of patients in each cancer study. The HR and p-values were calculated by the Kaplan-Meier method and the log-rank test using optimal cutoff of SULF expressions in each cancer type (>100 patients, >40 events for PFI).
Supplemental Table 2: Characteristics of the HNSC patients enrolled at the Princess Margaret Cancer Center, University of Toronto.
Supplemental Table 3. Distribution of SULF1- or SULF2-expressing cells among cell-types analyzed in a scRNAseq study of HNSC tumor tissues. The table summarizes single-cell RNA-seq data of 5578 cells from tumor tissues of 18 HNSC patients27. SULF1+ and SULF2+ cells were defined as individual cells with non-zero value of log2(TPM+1). Number and percentage of SULF1+ and SULF2+ cells in each cell type was calculated based on the 5,578 cells.
Supplemental Table 4. Genes (n=100) with the highest SULF1 Pearson’s correlations (FDR<0.05) at the protein level in the CPTAC study28. ns, non-significant
Supplemental Table 5: Percentage of SULF1 and SULF2 positive cells in the tumor and stroma of the 20 OSCC patients (A) and corresponding clinical characteristics of the patients (B).
Acknowledgments
The authors want to thank Dr. Dimitris Anastassiou for his insightful analyses of the CAF.
Footnotes
The authors declare no potential conflicts of interest.