An LSC epigenetic signature is largely mutation independent and implicates the HOXA cluster in AML pathogenesis

Jung, Namyoung; Dai, Bo; Gentles, Andrew J.; Majeti, Ravindra; Feinberg, Andrew P.

doi:10.1038/ncomms9489

Download PDF

Article
Open access
Published: 07 October 2015

An LSC epigenetic signature is largely mutation independent and implicates the HOXA cluster in AML pathogenesis

Namyoung Jung^1,2^na1,
Bo Dai³^na1,
Andrew J. Gentles⁴,
Ravindra Majeti³ &
…
Andrew P. Feinberg^1,2

Nature Communications volume 6, Article number: 8489 (2015) Cite this article

9841 Accesses
90 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Acute myeloid leukaemia (AML) is characterized by subpopulations of leukaemia stem cells (LSCs) that are defined by their ability to engraft in immunodeficient mice. Here we show an LSC DNA methylation signature, derived from xenografts and integration with gene expression that is comprised of 71 genes and identifies a key role for the HOXA cluster. Most of the genes are epigenetically regulated independently of underlying mutations, although several are downstream targets of epigenetic modifier genes mutated in AML. The LSC epigenetic signature is associated with poor prognosis independent of known risk factors such as age and cytogenetics. Analysis of early haematopoietic progenitors from normal individuals reveals two distinct clusters of AML LSC resembling either lymphoid-primed multipotent progenitors or granulocyte/macrophage progenitors. These results provide evidence for DNA methylation variation between AML LSCs and their blast progeny, and identify epigenetically distinct subgroups of AML likely reflecting the cell of origin.

Acute lymphoblastic leukemia displays a distinct highly methylated genome

Article Open access 19 May 2022

Oncogenic gene expression and epigenetic remodeling of cis-regulatory elements in ASXL1-mutant chronic myelomonocytic leukemia

Article Open access 17 March 2022

Mutant H3 histones drive human pre-leukemic hematopoietic stem cell expansion and promote leukemic aggressiveness

Article Open access 28 June 2019

Introduction

Acute myeloid leukaemia (AML) is an aggressive malignancy of bone marrow precursors defective in their maturation and function¹. A large body of evidence indicates that like normal haematopoiesis, AML is organized as a cellular hierarchy initiated and maintained by a subpopulation of leukaemia stem cells (LSCs)². These LSCs are functionally defined by their ability to transplant disease into immunodeficient mice, and are enriched in the immuno-phenotypically defined CD34+CD38− fraction of leukaemic cells^3,4,5. AML LSCs in turn give rise to clonally related, downstream leukaemic blasts that lack engraftment potential. The clinical significance of this leukaemia stem cell model for AML is highlighted by the finding that LSC gene expression signatures are prognostic for poor outcome in multiple cohorts of AML patients^5,6. As LSCs and their non-engrafting blast progeny are clonally related, a major implication of this leukaemia stem cell model is that their functional properties likely involve epigenetic differences. However, the epigenomic differences that would cause the functional differences between LSCs and their non-stem blast progeny have not been demonstrated experimentally. This would be a key addition to the previous literature since DNA methylation is stably copied during cell division in contrast to more labile patterns of gene expression⁷.

A number of both mouse and human studies have investigated the cell of origin in AML. Mouse studies have typically utilized retroviral oncogene transduction or knock-in models to explore this question and have generally led to the conclusion that committed progenitors, in particular common myeloid progenitors (CMP) and/or granulocyte/macrophage progenitors (GMP), serve as the cell of origin for most AML models. In one study of MN1-induced AML, retroviral transduction of single CMP, but not GMP or haematopoietic stem cells (HSC), resulted in the development of AML, indicating tight restriction of transformation by this oncogene⁸. In a second study using a mouse model of MLL-AF9 AML, the cell of origin influenced biological properties such as gene expression, epigenetics and drug responses⁹. Both of these studies highlight the significance of this question for leukaemogenesis and potential therapies. In contrast to mouse models, inferring the cell of origin in human leukaemia is only possible based on features of the disease. Studies investigating the cell of origin of human AML using surface immunophenotype and gene expression originally suggested AML LSCs arise from HSC¹⁰, but more recent analysis suggests they arise from committed progenitors, including lymphoid-primed multipotent progenitors (L-MPP) and GMP³. Notably, we and others have recently reported that leukaemogenic mutations arise in pre-leukaemic HSC that undergo further clonal evolution to give rise to AML LSC^11,12,13, likely in downstream progenitors as has been demonstrated in chronic myeloid leukaemia (CML)¹⁴. Here we address this question of the cell of origin directly by determining the epigenetic signature of engrafting LSC in AML.

Dysregulation of the epigenome is a common feature in AML, as indicated by the recent discoveries that a number of epigenome-modifying genes are mutated in AML including some involved in the regulation of DNA methylation such as IDH1/2, DNMT3A and TET2, and modulation of chromatin modifications such as ASXL1, EZH2 and others^15,16. Beyond somatic mutations in these epigenome-modifying factors, initial characterization of DNA methylation in bulk AML cells revealed great heterogeneity among patient cases that could be clustered according to their methylation patterns¹⁷. In particular, AML with IDH1 or IDH2 mutations are associated with globally increased DNA methylation^15,17, MLL fusions or mutations in NPM1, DNMT3A or FLT3 were associated with decreased DNA methylation¹⁵.

Epigenetic signatures distinguishing normal haematopoietic stem and progenitor cells (HSPCs) are useful markers for survival analysis in AML¹⁸. However, these epigenetic changes have not previously been linked functionally to AML engraftment potential (that is, the LSC phenotype), and have only been investigated in mixed cell populations. Here we have attempted to define epigenetic differences between LSC and their non-stem blast progeny by testing engraftment capability of fractionated leukaemic cells from AML patients, to define the critical core elements of this key malignant stem cell phenotype. Through this approach, we define a methylation and gene expression-based epigenetic signature for LSC, and find it to be largely independent of genetic mutations and strongly implicating a role for the HOXA gene cluster. Finally, comparison of LSC with normal HSPC suggests that both L-MPP and GMP can give rise to LSC.

Results

AML LSC versus blasts define an LSC epigenetic signature

To formally investigate epigenetic differences between LSC and blast progeny, we sought to identify differentially methylated regions (DMRs) between functionally defined AML LSC-enriched populations and their downstream non-engrafting blasts from a cohort of 15 primary patient samples. We obtained samples from 15 AML patients (Supplementary Tables 1 and 2) and isolated subpopulations based on the expression of CD34 and CD38 including: Lin−CD34+CD38−; Lin−CD34+CD38+; and Lin−CD34− (Supplementary Fig. 1). We then performed comprehensive genome-scale DNA methylation analysis using the Illumina Infinium HumanMethylation450 bead chip array. While AML LSC were originally described to be exclusively contained in the CD34+CD38− subpopulation, recent reports have indicated that leukaemia-initiating cells can also be detected in multiple compartments including both the CD34+CD38+ and CD34− subpopulations, although usually at lower frequencies^3,4,5.

To identify LSC and blast populations, we conducted xenotransplantation assays on all three CD34/CD38 subpopulations from each of the 15 AML cases (Supplementary Table 3). Similar to other reports, leukaemic engraftment was observed from at least one subpopulation in 10 out of 15 AML patients. As expected, LSC activity markedly decreased following the immunophenotypic hierarchy with 64.3% of CD34+CD38−, 46.7% of CD34+CD38+ and 26.7% of CD34− subpopulations engrafting in vivo (Supplementary Table 3). To identify epigenetic markers of functional LSC, we performed DMR analysis between the 20 LSC-containing (engrafting) and 24 blast-containing (non-engrafting) fractions (hereafter termed ‘LSC’ and ‘Blast’, respectively). The analysis identified 3,030 DMRs, of which 91.4% were hypomethylated in LSC (Table 1 and Supplementary Data 1). These DMRs were further classified according to their global genomic location including: islands (regions with a GC content >50% and an observed/expected CpG ratio of more than 0.6), shores (regions within 2 kb of an island), shelves (regions 2–4 kb away from an island) and open sea (isolated CpG sites in the genome without a specific designation). These DMRs were nearly evenly distributed in CpG islands (27.8%) and open seas (29%) (Table 1). In addition, the DMRs correlated with gene expression at CpG islands and open seas (Pearson’s r=−0.12 (P=0.004) and −0.27 (P=0.003), respectively), whereas most hypomethylated DMRs in the engrafting populations were associated with transcriptional upregulation of associated genes (Supplementary Fig. 2).

Table 1 Summary of DMRs identified in the indicated pairwise comparisons.

Full size table

We next sought to integrate DNA methylation with gene expression analysis to identify an LSC epigenetic signature by extracting genes, which passed a DMR P value <0.01 cutoff (permutation test) and exhibited >0.5 log₂ ratio of differential expression between the LSC and Blast populations, with an inverse relationship between gene expression and DNA methylation within 2 kb of the transcriptional start site. We excluded gene body DMRs, as there was no statistically significant positive correlation in AML or normal haematopoiesis comparisons (Supplementary Fig. 3). We applied a minimum absolute value of log₂ ratio of 0.5 differential expression, similarly to our previous LSC gene expression signature using the same microarray platform⁶. With these parameters, we identified 84 regions of 71 unique genes exhibiting differential methylation and gene expression in LSC compared with Blasts (Supplementary Data 2).

We compared our LSC epigenetic signature with the LSC gene expression signatures from previous studies^5,6. Only six out of 71 genes were found in these earlier studies, suggesting most of the genes identified here comprise a novel signature for LSC defined first by DMR analysis and refined by gene expression differences. One gene in this signature, REC8, which encodes a kleisin family protein that is associated with the cohesin complex, was hypomethylated and transcriptionally upregulated in LSC (Fig. 1a and Supplementary Data 2). Notably, mutations of components of the cohesin complex have been identified in AML and other tumour types^19,20. Further experiments will be necessary to determine if increased expression of REC8 alters cohesin complex functions in LSC. We also identified HOXA5, HOXA6, HOXA7, HOXA9 and HOXA10 in the LSC epigenetic signature (Fig. 1b–d and Supplementary Data 2). These HOXA cluster genes were hypomethylated and highly expressed in LSC (Supplementary Data 2). Notably, HOXA9 showed hypomethylation and increased expression in LSC (Fig. 1b and Supplementary Data 2), and aberrant expression of HOXA9 is known to be involved in increased proliferation of HSPCs and leukaemogenesis, suggesting a critical role in LSC activity^21,22,23,24.

**Figure 1: AML LSCs and Blasts exhibit epigenetic differences that define an LSC epigenetic signature.**

Because the mixed-lineage leukaemia (MLL) subtype is itself associated with changes in expression of members of the HOXA gene cluster^25,26, we performed a second DMR and gene expression analysis excluding the five LSC populations from the two MLL patients in our cohort to obtain an LSC epigenetic signature without MLL cases. We observed substantial overlap between the sets of DMRs without MLL cases and with all samples. For the key LSC epigenetic signature 81 of 84 DMRs were present after removal of the MLL cases (Supplementary Table 4). Considering all DMRs with P<0.01 (not just the LSC signature), there was 77% overlap (Supplementary Table 4). The LSC epigenetic signature without MLL cases showed substantial overlap including HOXA genes (Supplementary Table 4). Note that the reduction of DMRs within the LSC epigenetic signature without MLL cases is primarily due to decreased sample numbers of LSC, which reduced statistical power as 5 samples out of 20 LSC were excluded. To clarify this point, we performed an enrichment test to see if the 1,632 DMRs, which were excluded after MLL cases were removed, were still enriched among the DMRs without MLL cases with less strict statistical cutoff (P <0.05 rather than P<0.01, permutation test). We observed statistically significant enrichment of these excluded DMRs in the DMRs without MLL cases (P<2.2 × 10⁻¹⁶, χ²-test). In addition, we performed an enrichment test to examine if the LSC epigenetic signature without MLL cases was enriched in the LSC epigenetic signature with all samples. We observed statistically significant enrichment between those two LSC epigenetic signatures (P<2.2 × 10⁻¹⁶, χ²-test). Thus, the presence of the MLL subtype was not a confounding variable in defining the LSC epigenetic signature with the inclusion of the HOXA cluster.

The LSC epigenetic signature is largely mutation independent

To identify the important pathways and upstream regulators of LSC activity, we utilized Ingenuity Pathway Analysis. The most significantly enriched pathway was fatty acid α oxidation (Supplementary Data 3), and inhibitors of this pathway have been previously shown to induce apoptosis of leukaemia cells²⁷. Ingenuity upstream regulator analysis identified NPM1, ASXL1 and KAT6A as the most significant upstream regulators of the LSC epigenetic signature genes, primarily through regulation of HOXA genes including HOXA5, HOXA6, HOXA7, HOXA9 and HOXA10 (Supplementary Data 4). Significantly, all three of these upstream regulators have been found to be mutated in AML and likely serve as driver genes¹⁵. In particular, mutations in ASXL1 and NPM1 have been shown to cooperate with HOX genes to initiate leukaemia by enhancing self-renewal and proliferation of haematopoietic progenitors^16,28. Consistent with this, we observed that NPM1 mutation was associated with decreased methylation and increased expression of HOXA5, HOXA6, HOXA7, HOXA9 and HOXA10 compared with NPM1 wild-type samples in the The Cancer Genome Atlas (TCGA) cohort (Supplementary Fig. 4).

We then sought to investigate the LSC epigenetic signature for its association with AML mutations in the TCGA cohort¹⁵ (Fig. 2). The TCGA cohort consists of 200 AML patient samples with associated DNA methylation, gene expression and full genotyping from genome/exome sequencing¹⁵. First, we identified the epigenetic signatures associated with individual AML mutations by performing DMR analysis between wild-type and mutant patient samples (Fig. 2). The mutations tested included epigenome-modifying enzymes such as DNMT3A, IDH1/2 and TET1/2, and upstream regulators of our LSC epigenetic signature, NPM1 and ASXL1 (Fig. 2). KAT6A was not included as there was no patient who had this mutation among the patients investigated on methylation arrays. Next, we examined the overlap between the mutation-associated DMRs and our LSC epigenetic signature (Fig. 2). Each LSC epigenetic signature gene was classified into three categories: (1) upstream regulator-associated if differentially methylated in association with any mutation in upstream regulators; (2) epigenome-modifying enzyme-associated if differentially methylated in association with any mutation in epigenetic enzymes; or (3) mutation independent if it was not differentially methylated in association with either upstream regulator or epigenome-modifying enzyme (Fig. 2 and Supplementary Data 5). Of the 84 LSC DMRs, 28 (33.3%) and 27 (32.1%) were associated with upstream regulator or epigenome-modifying enzyme mutations, respectively (Fig. 2 and Supplementary Data 5). However, 40 DMRs (47.6%) including HOXA7 and HOXA9 were mutation-independent targets (Fig. 2 and Supplementary Data 5). It should be noted that some of the LSC differentially methylated genes, including HOXA7 and HOXA9, have multiple DMRs regulated by different mechanisms (Fig. 3). For example, HOXA7 has four DMRs in the LSC epigenetic signature; one associated with mutation in NPM1, two associated with mutation in DNMT3A, TET1 and NPM1, and one mutation independent (Supplementary Data 5). Therefore, we annotated each DMR in those genes differently with DMR numbering such as HOXA9/DMR1 (Figs 2 and 3, and Supplementary Data 5). A small subset (11 signatures) of upstream regulator and epigenetic enzyme-associated LSC epigenetic signatures overlapped, including REC8, HOXA6 and HOXA7 (Fig. 2 and Supplementary Data 5). This analysis showed that all the HOXA genes are epigenetically regulated by at least one upstream regulator, and HOXA6, HOXA7/DMR2 and HOXA7/DMR3 are common targets of both upstream regulators and epigenetic enzymes (Figs 2 and 3, and Supplementary Data 5), and all of these changes involved DNA hypomethylation. In addition, hypomethylation of HOXA7/DMR1 occurred independently of mutations (Figs 2 and 3, and Supplementary Data 5). Together, these results suggest that overexpression of HOXA genes mediated by DNA hypomethylation is a core mechanism for LSC activity.

**Figure 2: The LSC epigenetic signature is partially dependent on underlying somatic mutations.**

**Figure 3: Examples of multiple DMRs for a gene.**

The LSC epigenetic signature is prognostic in human AML

We hypothesized that if the LSC epigenetic signature reflected key drivers of the functional differences between LSC and Blasts, then this signature should be associated with clinical outcomes in human AML. First, we tested the association between the LSC epigenetic signature and overall survival in the DNA methylation data from the TCGA AML cohort¹⁵. To assign each TCGA patient to an LSC-like or Blast-like category, we calculated scores of each TCGA sample based on the probability of being closer to either LSC or Blasts. A comparable number of samples were assigned to each category by this method (99 for Blast like and 93 for LSC like). In univariate survival analysis, the LSC-like group showed worse outcome compared with the Blast-like group (hazard ratio (HR)=2.3, 95% confidence interval (CI)=1.6–3.4; P=1.1 × 10⁻⁵, log-rank test; Fig. 4a). The LSC-like versus Blast-like stratification remained associated with overall survival in multivariate analysis together with other known prognostic factors such as age (considered as a continuous variable), cytogenetic risk (assessed as high versus low risk and intermediate versus low risk), NPM1 and FLT3 mutations (HR=1.9, 95% CI=1.2–2.9; P=0.003, log-rank test; Table 2).

**Figure 4: The LSC epigenetic signature is associated with overall survival in human AML.**

Table 2 Multivariate analysis of overall survival of TCGA patients using either DNA methylation or gene expression.

Full size table

Next, we tested the association between expression of LSC epigenetic signature genes and clinical outcome using four different cohorts including TCGA¹⁵, a cohort of normal karyotype patients^29,30, and two cohorts of mixed karyotype patients^31,32,33. These cohorts consist of a total of 776 AML patients treated on different clinical protocols that also exhibited distinct biological characteristics⁶. We observed a strong correlation between the relative expression of LSC epigenetic signature genes and overall survival in the TCGA cohort (correlation=0.49; P=4 × 10⁻¹³; Supplementary Fig. 5). The more highly expressed a gene was in LSC compared with Blasts, the more robust its association with worse overall survival. In all four cohorts, the overall expression level of the signature genes was significantly associated with overall survival, with higher expression associated with worse clinical outcomes in the TCGA cohort (HR=2.4, 95% CI, 1.6–3.6; P=1x10⁻⁵, log-rank test; Fig. 4b, Supplementary Table 5). This association remained significant in multivariate Cox regression including age (continuous), cytogenetic risk, NPM1 and FLT3 mutations (HR=1.7, 95% CI, 1.0–2.7); P=0.03, log-rank test; Table 2). Similar results were observed for the three other cohorts in univariate and multivariate analyses (Fig. 4c–e, Supplementary Table 5 and Table 3).

Table 3 Multivariate analysis of overall survival of patients in various cohorts using gene expression.

Full size table

Finally, we tested if mutations in epigenetic enzymes such as DNMT3A, IDH1/2, TET2 and ASXL1 affected the prognostic impact of the LSC epigenetic signature in the TCGA cohort. As described previously, mutation in DNMT3A, but none of the other genes, was associated with patient overall survival (Supplementary Table 6). Multivariate survival analysis including DNMT3A mutation showed that our LSC epigenetic signature remained independently associated with clinical outcome in both the DNA methylation and gene expression data from TCGA, even when incorporating cytogenetic risk group (Supplementary Table 7), as well as within the intermediate cytogenetic risk group alone (Supplementary Table 8). Overall, these results demonstrate that the LSC epigenetic signature defined by DNA methylation and gene expression is associated with overall survival in human AML.

Epigenetically distinct LSC reflecting the cell of origin

To address the question of the cell of origin of AML LSC, we first analysed normal haematopoiesis that proceeds through a series of multipotent and oligopotent stem and progenitor cells that progressively lose self-renewal ability and become more restricted in their differentiation potential (Supplementary Fig. 6a). We reasoned that comparison of DNA methylation profiles of AML populations to those of normal HSPC would imply the cell of origin of AML LSC. We obtained bone marrow from five normal donors and isolated HSPC by fluorescence-activated cell sorting (FACS) including: HSC, multipotent progenitors (MPP), L-MPP, CMP, megakaryocyte/erythroid progenitors (MEP) and GMP (Supplementary Fig. 6b and Supplementary Tables 9 and 10).

To further understand epigenetic variation during early human haematopoiesis, we generated genome-scale methylation profiles for these normal HSPC and subjected them to DMR analysis. Multidimensional scaling analysis utilizing the top 1,000 most variable CpG positions revealed tight clustering of human HSPC populations by lineage with no outliers (Fig. 5a). This analysis indicates that DNA methylation reflects the identity of the HSPC populations.

**Figure 5: Comprehensive DNA methylation analysis shows tight clustering of individual HSPC populations.**

The DMRs identified across HSPC potentially reveal novel regulators of haematopoietic lineage development by identifying previously unknown sites of epigenetic variation during haematopoiesis (Fig. 5b and Supplementary Data 1). For example, HMHB1, encoding one of the minor histocompatibility antigens, was found to be hypomethylated in L-MPP and GMP, suggesting a possible role in GMP differentiation (Fig. 5b). Progressive hypomethylation was also identified in MIR539 going from HSC to MEP, suggesting that this microRNA may contribute to erythropoiesis (Fig. 5b). Interestingly, the MIR539 gene is located in the DLK1-DIO3 imprinting region that contains a microRNA cluster involved in leukaemia pathogenesis³⁴. Further validation of these novel candidate regulators will require functional experiments. Genes with an inverse correlation between DNA methylation and gene expression were located outside of the islands themselves, with the strongest correlation at shores and open seas for HSC/GMP and HSC/MEP comparisons (Supplementary Fig. 7). In addition to these comparisons, more than 50% of the DMRs among HSPCs were in open seas (Table 1). Thus, functional epigenetic differences during early human haematopoietic differentiation occur in CpG sparse regions, consistent with other recent studies of differentiation^35,36 and cancer^36,37.

To relate normal haematopoiesis to LSC, we first identified the DMRs from all possible pairwise comparisons among the six HSPCs after applying a more rigorous cutoff of family-wise error rate <0.1 (Supplementary Data 6). The resulting 216 DMRs were applied in clustering analysis including all six normal HSPC populations with LSC and Blasts (Fig. 6a). Strikingly, this analysis revealed that AML samples formed two distinct clusters, L-MPP like and GMP like (Fig. 6a). Importantly, the GMP-like cluster included several CD34+CD38− subpopulations, indicating that these clusters could not have been identified by immunophenotype alone. Moreover, clustering analysis using an equal number of length-matched random regions showed that the clustering of AML populations with either L-MPP or GMP was unique to the selected DMRs (Supplementary Fig. 8).

**Figure 6: Epigenetic signatures define subgroups of AML LSC reflecting the cell of origin.**

Strikingly, using the same 216 DMRs, the TCGA samples also formed the same two major clusters, L-MPP like and GMP like (Fig. 6b). In addition to the two major clusters, we also identified a minor CMP-like cluster that was not observed in our smaller cohort. We calculated scores indicating the similarity of each TCGA sample to each of the six progenitors, and designated a counterpart HSPC population for each TCGA sample based on highest similarity. This approach showed that 76.6% of TCGA samples resembled GMP and 14.6% had a methylation profile most similar to L-MPP (Fig. 6c).

We hypothesized that if the assignment of AML samples to L-MPP-like and GMP-like clusters was related to the cell of origin, then the degree of maturity and morphology might differ between the two groups. Consistent with this, we compared the distribution of the French–American–British (FAB) classification of the TCGA samples and found that the L-MPP-like cases mainly consisted of more immature M0, M1 and M2 types, while the more differentiated M4 and M5 types were enriched in GMP-like AML (P<1 × 10⁻⁴, χ²-test, Fig. 6d). It should be noted that the LSC epigenetic signature is not merely a recapitulation of FAB types, as our signature is prognostic in multivariate analysis while FAB types are not (Supplementary Table 11). In addition, it is not possible to know the cell of origin simply by examining FAB types (Supplementary Table 12). For example, in 38 cases of M1 AML, 29/38 are GMP like and 9/29 are L-MPP like (Supplementary Table 12).

Finally, we sought to investigate if the L-MPP-like and GMP-like clusters, and therefore the potential cell of origin, were associated with cytogenetic abnormalities or recurrent mutations of specific genes including DNMT3A, IDH1, IDH2, TET1, TET2, FLT3 and NPM1. The GMP-like cluster was enriched for patients in the low- and intermediate-cytogenetic-risk groups, while the L-MPP-like cluster was enriched for patients in the high-cytogenetic-risk group (P=1x10⁻⁴, Fisher’s exact test; Supplementary Table 13). We found that IDH1 and IDH2 mutations were enriched in the L-MPP-like group (P<0.01 for both, Fisher’s exact test), and FLT3 and NPM1 mutations were enriched in the GMP-like group (P<0.01 for both, Fisher’s exact test). DNMT3A and TET1 mutations were more enriched in the L-MPP group, but this was not statistically significant (Supplementary Table 14). Together, these results demonstrate that DNA methylation signatures permit a novel clustering of AML into L-MPP-like and GMP-like groups that may reflect the cell of origin for each case and demonstrate an association with key disease features. Ultimately, these results are suggestive of the cell of origin in AML, but definitive proof is not possible in any human primary cancer.

Discussion

The cancer stem cell model was originally proposed based on observations from human AML in which only subpopulations of leukaemia-initiating or LSC possessed engraftment potential^38,39. According to this model, the LSCs give rise to downstream Blasts that lack critical stem cell properties. As LSCs and their non-engrafting Blast progeny are clonally related, a major implication of this leukaemia stem cell model is that their functional properties must be due to epigenetic differences. Here we provide such evidence by characterizing global DNA methylation features of LSC defined by xenotransplantation of AML subpopulations, compared with non-engrafting Blast cells, demonstrating that AML LSCs exhibit global hypomethylation compared with non-LSC Blast cells. Integrating DNA methylation and gene expression analysis, we identified 84 regions of 71 genes as the LSC epigenetic signature. A total of 65 of these 71 genes were not reported in previous gene expression studies for LSC (the exceptions being CD34, SH3BP5, RBPMS, LTB, MS4A3 and VNN1 (refs 5, 6)). Most of the LSC epigenetic signature was mutation independent, not associated with mutations in upstream regulators or epigenome-modifying enzymes suggesting that leukaemogenesis may converge on these primary epigenetic signatures. We also identified some mutation-associated epigenetically dysregulated genes, including REC8 and HOXA7. Together, these epigenetic signatures represent potential therapeutic targets regardless of the different types of the underlying mutations present in individual AML cases. Furthermore, the LSC epigenetic signature was prognostic of patient overall survival independently of known survival predictors such as age and cytogenetic abnormalities, emphasizing its functional importance. Finally, by mapping the epigenome of early normal haematopoiesis, we also determined that epigenetic signatures define clusters of GMP-like and L-MPP-like AML LSCs that likely reflect the cell of origin and are associated with key clinical features including cytogenetic abnormalities and molecular mutations.

Apart from its prognostic effect, the LSC epigenetic signature represents a molecular target that may improve patient survival and prevent relapse. Recently, epigenetic therapy with hypomethylating agents such as azacytidine and decitabine has been approved for the treatment of AML. Randomized trials demonstrated improved overall survival compared with chemotherapy, but also indicated limited effect on relapse rate in high-risk AML⁴⁰. Our results indicate that LSCs are relatively hypomethylated compared with Blasts, suggesting that they may be less effectively targeted by hypomethylating agents, possibly accounting for their limited efficacy in relapse-free survival. It would be of great interest to see how the LSC epigenetic signature is affected by these drugs.

More specifically, this LSC epigenetic signature was markedly enriched for the members of the HOXA cluster, suggesting this cluster is a key driver of LSC function. The HOXA cluster has been implicated as a key regulator of haematopoiesis and myeloid malignancy⁴¹. In particular, HOXA9 is known to be involved in increased proliferation of HSPC and leukaemogenesis²¹, even occurring as a fusion oncogene in rare cases⁴². Moreover, increased expression of HOXA9 has been found to be an adverse prognostic factor in AML⁴³. Other HOXA family members including posterior (HOXA7 and HOXA10) and anterior (HOXA6) members have been implicated in leukaemogenesis, as overexpression of these genes in normal mouse HSPC leads to increased self-renewal, transformation and development of myeloid malignancies⁴⁴. The functional LSC epigenetic signature provided here demonstrates that the HOXA family is a key driver of AML LSC that may function in imparting aberrant self-renewal.

As DNA methylation is a potential marker of cell identity, here we compared DNA methylomes of normal HSPC with LSC as a marker for the cell of origin in AML. Using this approach, we observed two subtypes in our cohort: L-MPP-like and GMP-like. These two subtypes were also identified in the TCGA cohort, suggesting that leukaemic transformation predominantly occurs at either the L-MPP or GMP stage of haematopoietic development. We found that other features of AML were associated with these two subtypes including FAB type, several mutations and cytogenetic abnormalities, suggesting that the cell of origin may drive key clinical features in AML.

We newly identified a small subset of AML cells clustering with CMP and few samples clustering with HSC, MPP and MEP that could not be identified in smaller data sets including our own data and a previous study³. The result from TCGA indicates that the cell of origin of AML could be variable among HSPC. The result for the cell of origin should be considered carefully, as it is hard to provide definite proof of the question due to experimental limitations. Despite this caveat that affects all of human primary cancer biology, we do provide the first epigenetic evidence for cell of origin in human leukaemia and believe that our approach using epigenomic profiles suggests an efficient way to study cell of origin in cancer biology using large data sets.

Methods

Human samples

Fresh human bone marrow mononuclear cells (BMMCs) from healthy donors (2 × 10⁸ cells per donor, Catalogue#: ABM006) were purchased from ALLCELLS (Emeryville, CA). A CD34+ cell-enrichment step was performed with the human progenitor cell-enrichment kit with CD61 depletion (Stem Cell Technologies, Canada, Catalogue# 19356) on a RobSep machine from the same company. Human AML samples were collected from patient peripheral blood or bone marrow at Stanford Hospital, according to a Stanford-IRB-approved protocol (22264), and informed consent was obtained from all subjects. Peripheral blood mononuclear cells (PBMCs) or BMMCs were separated with Ficoll-Paque Plus (Amersham Biosciences, Piscataway, NJ, Catalogue number: 17-1440-03), and cryopreserved in 1 × freezing medium (90% fetal bovine serum (FBS)+10% dimethylsulfoxide). All the AML experiments were conducted with cryopreserved PBMC or BMMC samples that were thawed and washed in IMDM medium containing 10% FBS.

TCGA patient sample data

DNA methylation, gene expression, somatic mutation and clinical information were derived from the TCGA Research Network: http://cancergenome.nih.gov/.

Flow cytometry analysis and cell sorting

A battery of antibodies (Supplementary Table 10) was used for staining, analysis and sorting of progenitor cells from either healthy BMMCs or AML patient PBMCs/BMMCs, as well as lineage analysis human chimerism/engraftment. Cells were either analysed or sorted using a FACS Aria II cytometer (BD Biosciences, Franklin Lakes, NJ). Analysis of flow cytometry raw data was done with FlowJo Software (Treestar, Ashland, OR).

Xenotransplantation assay

NOD.Cg-Prkdc^scid Il2rg^tm1Wjl/SzJ mice (NSG) were obtained from The Jackson Laboratory (Bar Harbor, ME) and bred in a specific pathogen-free environment per Stanford Administrative Panel on Laboratory Animal Care Guidelines (Protocol 22264). Six- to eight-week-old female adult mice were exposed to 200 rad of gamma irradiation at least 2 h (up to 24 h) before transplantation. Up to 500 thousand fresh-sorted AML cell subpopulation were resuspended in 30 μl of Hank’s balanced salt solution (Gibco Life Technologies, Grand Island, NY) containing 2% FBS, and injected intravenously via the tail vein using a 29-gauge needle. For each cell subpopulation, at least three technical replicates were performed by transplantation of three aliquot of cells into three mice. It was counted as ‘Yes’ if at least one of the three mice was engrafted (Supplementary Table 3). Around 150 mice in total were used. Neither randomization nor blinding was used for this study.

After 8 weeks, mice were euthanized with CO₂ according to Stanford-IRB-approved protocol (22264). Bone marrows were isolated using scissors and needle flashing, then underwent hypotonic red cell lysis using ACK (ammonium–chloride–potassium) lysing buffer (Life Technologies, Grand Island, NY, Catalogue# A10492). BMMCs were stained with antibody combinations (Supplementary Table 10) on ice for 30 min, and dead cells were excluded by propidium iodide staining. Human myeloid engraftment (hCD45+CD33+) and lymphoid engraftment (hCD45+CD19+) were analysed on flow as described before.

Illumina infinium human methylation 450 bead array assay

Genomic DNA from each sample was purified using the MasterPure DNA purification kit (Epicentre) according to the manufacturer’s protocol. The genomic DNA (250–500 ng) was treated with sodium bisulfate using the Zymo EZ DNA Methylation Kit (ZYMO Research) as recommended by the manufacturer, with the alternative incubation conditions for the Illumina Infinium Methylation Assay. Converted DNA was eluted in 11 μl of elution buffer. DNA methylation level was measured using Illumina Infinium HD Methylation Assay (Illumina) according to the manufacturer’s specifications. Methylation array data are deposited at the Gene Expression Omnibus (GEO) with accession number GSE63409.

Illumina infinium human methylation 450 bead array analysis

Raw intensity files were obtained using minfi package⁴⁵ to calculate methylation ratios (Beta values). The data was normalized using Illumina preprocessing method implemented in minfi. Several quality control measures were applied to remove arrays with low quality. Control probes were examined on the 450k array to assess several measures including bisulfite conversion, extension, hybridization, specificity and others. One of the MPP samples (BM2712) and two samples of TCGA (patient IDs: 2934 and 2827) showed low quality for the measures, so they were removed for further analysis. Next, median methylated and unmethylated signals were calculated for each arrays; no array was identified for signal values lower than 10.5. For multidimensional scaling analysis, probes containing an annotated single-nucleotide polymorphism (SNP) (dbSNP137) at the single-base extension or CpG sites were removed (17,398 probes removed). Minfi 1.8.9 was used.

Bump hunting method previously described was applied to identify DMRs in 450k array^45,46. Beta value of 0.1 (10% of methylation difference) was used as cutoff when finding DMRs. Statistical significance was assigned by permutation testing and the P value cutoff used for downstream analysis was <0.01 that corresponded to Benjamini–Hochberg adjusted P value <0.1 (data not shown) unless different cutoff was designated in result part. Bumphunter 1.2.0 was used. Same method was applied to identify DMRs for the second DMR analysis of LSC versus Blast that we removed five LSC cases from two MLL patients (SU042 and SU046).

Bisulfite pyrosequencing

A unit of 100 ng of genomic DNA from each sample was treated with sodium bisulfate using an EZ DNA Methylation Gold Kit (ZYMO Research) following the manufacturer’s protocol. The bisulfate-treated DNA was PCR amplified using unbiased nested primers. Quantitative pyrosequencing was performed using a PSQ HS96 (Biotage) to validate DMR regions. The DNA methylation percentage at each CpG site was measured using the Q-CpG methylation software (Biotage). SssI-treated human genomic DNA was used as 100% methylated controls and human genomic DNA amplified by Repli-G mini kit (Qiagen) was used as the non-methylated (0%) DNA control. Supplementary Table 15 provides the primer sequence used for the pyrosequencing reactions with the chromosomal coordinates in the University of California at Santa Cruz February 2009 human genome assembly (hg19) for each CpG site investigated.

Affymetrix microarray expression analysis

Total RNA was extracted from each FACS-sorted cell population using RNeasy Plus Mini (QIAGEN, Valencia, CA, Catalogue#: 74134) according to the manufacture’s protocol. All RNA samples were quantified with 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA), subjected to reverse transcription, two consecutive rounds of linear amplification, and production and fragmentation of biotinylated cRNA. A unit of 15 μg of cRNA from each sample was hybridized to HG U133 Plus 2.0 microarrays. Hybridization and scanning were performed according to the manufacture’s instruction (Affymetrix). This step was performed at the PAN Center of Stanford University. Data were normalized by GC robust multi-array average method and analysed on R/Bioconductor. SU042 CD34+38+, BM2770 GMP, BM2759 L-MPP, BM2761 CMP and BM2770 CMP were removed from further analysis due to low quality. SU001 was excluded, as the samples from this patient were not included on expression array (GEO accession code: GSE63270).

Sanger sequencing to detect AML mutations

Genomic DNA was extracted from patient BMMC or PBMC using QIAmp DNA Mini Kit (QIAGEN, Valencia, CA, Catalogue#: 51304) according to the manufacture’s instruction. PCR primers were designed to cover exon 3–11 of TET2, exon 4 of IDH1/2 and exon 7–23 of DNMT3A (Supplementary Table 16). The PCR reaction premix consists of 1 × of OneTaq 2 × Master Mix (NEB, Ipswich, MA, Catalogue#: M0482L), 0.2 μM forward and reverse primers, respectively, and 10 ng (up to 100 ng) genomic DNA as template. The reaction was under the condition of 95 °C initial denaturation for 30 s, 45 cycles of extension containing 94 °C for 30 s, 56 °C for 1 min (or as necessary) and 72 °C for 1 min, and a final extension at 72 °C 5 min. The PCR products were concentrated with PCR purification kit (QIAGEN, Valencia, CA, Catalogue#: 28106), then submitted to Sequentech (Mountain View, CA) for sequencing of both forward and reverse directions using 3730xl DNA Analyzer (Applied Biosystems, Foster City, CA) according to the manufacturer’s instruction. The sequencing data were analysed using Sequencher 5.1 (Gene Codes Corporation, Ann Arbor, MI), and SNP was excluded by checking NCBI website before getting the final mutation results.

Survival analysis

Survival analysis was performed to assess the association of LSC DNA methylation and gene expression signatures with clinical outcome (overall survival) in four different cohorts. For DNA methylation data set (TCGA), patients were separated into two groups; LSC like and Blast like based on the methylation profile of each individual. Survival was compared between the two groups using the coxph function in R (survival package 2.37), with significance assessed by log-rank test. For gene expression, the genes in the LSC epigenetic signature were identified in expression data sets for which survival outcomes were available. The first principal component of their expression levels was computed, and patients were stratified as ‘high’ or ‘low’ relative to its median value. Survival differences between the groups were assessed by log-rank test. In multivariate analyses, age was incorporated as a continuous variable, mutations were coded as present/absent (1/0) and assessment of cytogenetic risk was treated as individual groups and done for intermediate versus low risk and high versus low risk (Supplementary Software 1). Analysis was also performed within intermediate-risk groups.

Statistical analysis

To assign cell identity of LSC/Blast to TCGA samples, mean methylation value of each LSC epigenetic signature (84 DMRs) for LSC/Blast (methylation profile) was retrieved and s.d. of the mean value for each signature was calculated. Then scores (probability density values as log value) for each TCGA sample regarding LSC and Blast profile was calculated using dnorm function with the mean and s.d. calculated in previous step. Maximum value of scores between the ones regarding LSC and Blast methylation profile was chosen, and then cell identity assigned. Same strategy was applied using the 216 DMRs for normal haematopoiesis to assign normal progenitor identity to TCGA samples in Fig. 6c. R code is available on request.

For clustering analysis, hclust function with ward method in R was used to generate all the cluster dendrogram analysis.

The test to examine the enrichment of the excluded DMRs after removing MLL cases in DMRs without MLL cases using a less strict statistical cutoff (P<0.05) was a χ²-test based on the number of overlapping DMRs between the excluded DMRs after removing MLL cases and DMRs without MLL cases (P<0.05), then DMRs only in the excluded DMRs and DMRs only in DMRs without MLL cases (P<0.05), and finally random DMRs from array background that do not overlap with either the excluded DMRs or DMRs without MLL cases (P<0.05).

Bioinformatics analysis

QIAGEN’s Ingenuity Pathway Analysis (Ingenuity Systems, www.ingenuity.com) was performed for pathway analysis.

Additional information

Accession codes: Methylation array data and microarray data have been deposited in the Gene Expression Omnibus (GEO) under accession codes GSE63409 and GSE63270, respectively.

How to cite this article: Jung, N. et al. An LSC epigenetic signature is largely mutation independent and implicates the HOXA cluster in AML pathogenesis. Nat. Commun. 6:8489 doi: 10.1038/ncomms9489 (2015).

Accession codes

Accessions

Gene Expression Omnibus

References

Lowenberg, B., Downing, J. R. & Burnett, A. Acute myeloid leukemia. N. Engl. J. Med. 341, 1051–1062 (1999).
Article CAS Google Scholar
Majeti, R. Monoclonal antibody therapy directed against human acute myeloid leukemia stem cells. Oncogene 30, 1009–1019 (2011).
Article CAS Google Scholar
Goardon, N. et al. Coexistence of LMPP-like and GMP-like leukemia stem cells in acute myeloid leukemia. Cancer Cell 19, 138–152 (2011).
Article CAS Google Scholar
Sarry, J. E. et al. Human acute myelogenous leukemia stem cells are rare and heterogeneous when assayed in NOD/SCID/IL2Rgammac-deficient mice. J. Clin. Invest. 121, 384–395 (2011).
Article CAS Google Scholar
Eppert, K. et al. Stem cell gene expression programs influence clinical outcome in human leukemia. Nat. Med. 17, 1086–1093 (2011).
Article CAS Google Scholar
Gentles, A. J., Plevritis, S. K., Majeti, R. & Alizadeh, A. A. Association of a leukemic stem cell gene expression signature with clinical outcomes in acute myeloid leukemia. JAMA 304, 2706–2715 (2010).
Article CAS Google Scholar
Issa, J. P. DNA methylation as a clinical marker in oncology. J. Clin. Oncol. 30, 2566–2568 (2012).
Article CAS Google Scholar
Heuser, M. et al. Cell of origin in AML: susceptibility to MN1-induced transformation is regulated by the MEIS1/AbdB-like HOX protein complex. Cancer Cell 20, 39–52 (2011).
Article CAS Google Scholar
Krivtsov, A. V. et al. Cell of origin determines clinically relevant subtypes of MLL-rearranged AML. Leukemia 27, 852–860 (2013).
Article CAS Google Scholar
Kreso, A. & Dick, J. E. Evolution of the cancer stem cell model. Cell Stem Cell 14, 275–291 (2014).
Article CAS Google Scholar
Corces-Zimmerman, M. R., Hong, W. J., Weissman, I. L., Medeiros, B. C. & Majeti, R. Preleukemic mutations in human acute myeloid leukemia affect epigenetic regulators and persist in remission. Proc. Natl Acad. Sci. USA 111, 2548–2553 (2014).
Article CAS ADS Google Scholar
Jan, M. et al. Clonal evolution of preleukemic hematopoietic stem cells precedes human acute myeloid leukemia. Sci. Transl. Med. 4, 149ra118 (2012).
Article Google Scholar
Shlush, L. I. et al. Identification of pre-leukaemic haematopoietic stem cells in acute leukaemia. Nature 506, 328–333 (2014).
Article CAS ADS Google Scholar
Jamieson, C. H. et al. Granulocyte-macrophage progenitors as candidate leukemic stem cells in blast-crisis CML. N. Engl. J. Med. 351, 657–667 (2004).
Article CAS Google Scholar
Cancer Genome Atlas Research Network. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N. Engl. J. Med. 368, 2059–2074 (2013).
Abdel-Wahab, O. et al. ASXL1 mutations promote myeloid transformation through loss of PRC2-mediated gene repression. Cancer Cell 22, 180–193 (2012).
Article CAS Google Scholar
Figueroa, M. E. et al. Leukemic IDH1 and IDH2 mutations result in a hypermethylation phenotype, disrupt TET2 function, and impair hematopoietic differentiation. Cancer Cell 18, 553–567 (2010).
Article CAS Google Scholar
Bartholdy, B. et al. HSC commitment-associated epigenetic signature is prognostic in acute myeloid leukemia. J. Clin. Invest. 124, 1158–1167 (2014).
Article CAS Google Scholar
Losada, A. Cohesin in cancer: chromosome segregation and beyond. Nat. Rev. Cancer 14, 389–393 (2014).
Article CAS Google Scholar
Thol, F. et al. Mutations in the cohesin complex in acute myeloid leukemia: clinical and prognostic implications. Blood 123, 914–920 (2014).
Article CAS Google Scholar
Thorsteinsdottir, U. et al. Overexpression of the myeloid leukemia-associated Hoxa9 gene in bone marrow cells induces stem cell expansion. Blood 99, 121–129 (2002).
Article CAS Google Scholar
Chung, K. Y. et al. Enforced expression of NUP98-HOXA9 in human CD34(+) cells enhances stem cell proliferation. Cancer Res. 66, 11781–11791 (2006).
Article CAS Google Scholar
Takeda, A., Goolsby, C. & Yaseen, N. R. NUP98-HOXA9 induces long-term proliferation and blocks differentiation of primary human CD34+ hematopoietic cells. Cancer Res. 66, 6628–6637 (2006).
Article CAS Google Scholar
Lehnertz, B. et al. The methyltransferase G9a regulates HoxA9-dependent transcription in AML. Genes Dev. 28, 317–327 (2014).
Article CAS Google Scholar
Milne, T. A. et al. MLL targets SET domain methyltransferase activity to Hox gene promoters. Mol. Cell 10, 1107–1117 (2002).
Article CAS Google Scholar
Drabkin, H. A. et al. Quantitative HOX expression in chromosomally defined subsets of acute myelogenous leukemia. Leukemia 16, 186–195 (2002).
Article CAS Google Scholar
Samudio, I. et al. Pharmacologic inhibition of fatty acid oxidation sensitizes human leukemia cells to apoptosis induction. J. Clin. Invest. 120, 142–156 (2010).
Article CAS Google Scholar
Vassiliou, G. S. et al. Mutant nucleophosmin and cooperating pathways drive leukemia initiation and progression in mice. Nat. Genet. 43, 470–475 (2011).
Article CAS Google Scholar
Dufour, A. et al. Acute myeloid leukemia with biallelic CEBPA gene mutations and normal karyotype represents a distinct genetic entity associated with a favorable clinical outcome. J. Clin. Oncol. 28, 570–577 (2010).
Article CAS Google Scholar
Metzeler, K. H. et al. An 86-probe-set gene-expression signature predicts survival in cytogenetically normal acute myeloid leukemia. Blood 112, 4193–4201 (2008).
Article CAS Google Scholar
Valk, P. J. et al. Prognostically useful gene-expression profiles in acute myeloid leukemia. N. Engl. J. Med. 350, 1617–1628 (2004).
Article CAS Google Scholar
Wouters, B. J. et al. Double CEBPA mutations, but not single CEBPA mutations, define a subgroup of acute myeloid leukemia with a distinctive gene expression profile that is uniquely associated with a favorable outcome. Blood 113, 3088–3091 (2009).
Article CAS Google Scholar
Wilson, C. S. et al. Gene expression profiling of adult acute myeloid leukemia identifies novel biologic clusters for risk classification and outcome prediction. Blood 108, 685–696 (2006).
Article CAS Google Scholar
Benetatos, L. et al. The microRNAs within the DLK1-DIO3 genomic region: involvement in disease pathogenesis. Cell. Mol. Life Sci. 70, 795–814 (2013).
Article CAS Google Scholar
Ji, H. et al. Comprehensive methylome map of lineage commitment from haematopoietic progenitors. Nature 467, 338–342 (2010).
Article CAS ADS Google Scholar
Irizarry, R. A. et al. The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nat. Genet. 41, 178–186 (2009).
Article CAS Google Scholar
Hansen, K. D. et al. Increased methylation variation in epigenetic domains across cancer types. Nat. Genet. 43, 768–775 (2011).
Article CAS Google Scholar
Bonnet, D. & Dick, J. E. Human acute myeloid leukemia is organized as a hierarchy that originates from a primitive hematopoietic cell. Nat. Med. 3, 730–737 (1997).
Article CAS Google Scholar
Lapidot, T. et al. A cell initiating human acute myeloid leukaemia after transplantation into SCID mice. Nature 367, 645–648 (1994).
Article CAS ADS Google Scholar
Estey, E. H. Epigenetics in clinical practice: the examples of azacitidine and decitabine in myelodysplasia and acute myeloid leukemia. Leukemia 27, 1803–1812 (2013).
Article CAS Google Scholar
Alharbi, R. A., Pettengell, R., Pandha, H. S. & Morgan, R. The role of HOX genes in normal hematopoiesis and acute leukemia. Leukemia 27, 1000–1008 (2013).
Article CAS Google Scholar
Nakamura, T. et al. Fusion of the nucleoporin gene NUP98 to HOXA9 by the chromosome translocation t(7;11)(p15;p15) in human myeloid leukaemia. Nat. Genet. 12, 154–158 (1996).
Article CAS Google Scholar
Golub, T. R. et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999).
Article CAS Google Scholar
Bach, C. et al. Leukemogenic transformation by HOXA cluster genes. Blood 115, 2910–2918 (2010).
Article CAS Google Scholar
Aryee, M. J. et al. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics 30, 1363–1369 (2014).
Article CAS Google Scholar
Jaffe, A. E. et al. Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. Int. J. Epidemiol. 41, 200–209 (2012).
Article Google Scholar

Download references

Acknowledgements

We thank P. Lovelace and M. Jan for FACS training, F. Zhao for mouse training and lab management, E. Briem and A. Runarsson for array hybridization, the Stanford Hematology Division Tissue Bank and patients for the AML samples, R. Irizarry for help with statistical analysis of 450k array data and development of statistical method for cell identity assignment, P. Murakami for help with the correlation analysis between DNA methylation and gene expression, H. Ji for help with genomic DNA extraction of normal bone marrow samples, and S. Springer for help with bisulfite pyrosequencing. This work was supported by National Institutes of Health grants U01HL099997 (R.M. and A.P.F.), RO1CA054358 (A.P.F.), R01CA188055 (R.M.), U01HL099999 (R.M.) and U54CA149145 (A.J.G.). R.M. is a New York Stem Cell Foundation, Robertson Investigator.

Author information

Namyoung Jung and Bo Dai: These authors contributed equally to this work.

Authors and Affiliations

Center for Epigenetics, Johns Hopkins University School of Medicine, Baltimore, 21205, Maryland, USA
Namyoung Jung & Andrew P. Feinberg
Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, 21205, Maryland, USA
Namyoung Jung & Andrew P. Feinberg
Department of Medicine, Division of Hematology, Cancer Institute, and Institute for Stem Cell Biology and Regenerative Medicine, School of Medicine, Stanford University, Stanford, 94305, California, USA
Bo Dai & Ravindra Majeti
Department of Radiology, and Center for Cancer Systems Biology, School of Medicine, Stanford University, Stanford, 94305, California, USA
Andrew J. Gentles

Authors

Namyoung Jung
View author publications
You can also search for this author in PubMed Google Scholar
Bo Dai
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Gentles
View author publications
You can also search for this author in PubMed Google Scholar
Ravindra Majeti
View author publications
You can also search for this author in PubMed Google Scholar
Andrew P. Feinberg
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.D. carried out experiments including flow cytometry analysis and cell sorting, xenotransplantation assay, gene expression array and Sanger sequencing; N.J. performed and analysed DNA methylation arrays and bisulfite pyrosequencing, and performed gene expression analysis; N.J. and A.J.G. performed survival analysis and statistical analysis; N.J., B.D., A.J.G., A.P.F. and R.M. wrote the paper; R.M. and A.P.F. conceived the strategy, oversaw the experiments and provided overall guidance and interpretation of the results; R.M. and A.P.F. contributed equally.

Corresponding authors

Correspondence to Ravindra Majeti or Andrew P. Feinberg.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-8, Supplementary Tables 1-16 and Supplementary References (PDF 2874 kb)

Supplementary Data 1

DMR lists for the comparisons in Table1 (XLSX 1029 kb)

Supplementary Data 2

LSC epigenetic signature (XLSX 17 kb)

Supplementary Data 3

Ingenuity pathway analysis (XLSX 13 kb)

Supplementary Data 4

Ingenuity upstream regulator analysis (XLSX 40 kb)

Supplementary Data 5

Association of LSC epigenetic signature with DMRs of genetic mutations (XLSX 65 kb)

Supplementary Data 6

DMRs for normal hematopoiesis (XLSX 52 kb)

Supplementary Software 1

R script for multivariate survival analysis (DOCX 98 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Jung, N., Dai, B., Gentles, A. et al. An LSC epigenetic signature is largely mutation independent and implicates the HOXA cluster in AML pathogenesis. Nat Commun 6, 8489 (2015). https://doi.org/10.1038/ncomms9489

Download citation

Received: 02 June 2015
Accepted: 27 August 2015
Published: 07 October 2015
DOI: https://doi.org/10.1038/ncomms9489

This article is cited by

MethScore as a new comprehensive DNA methylation-based value refining the prognosis in acute myeloid leukemia
- Šárka Šestáková
- Cyril Šálek
- Hana Remešová
Clinical Epigenetics (2024)
Quantification of hematopoietic stem and progenitor cells by targeted DNA methylation analysis
- Ledio Bocova
- Wouter Hubens
- Wolfgang Wagner
Clinical Epigenetics (2023)
Investigation of measurable residual disease in acute myeloid leukemia by DNA methylation patterns
- Tanja Božić
- Chao-Chung Kuo
- Wolfgang Wagner
Leukemia (2022)
A primary hierarchically organized patient-derived model enables in depth interrogation of stemness driven by the coding and non-coding genome
- Héléna Boutzen
- Seyed Ali Madani Tonekaboni
- John E. Dick
Leukemia (2022)
Mitochondrial metabolism as a potential therapeutic target in myeloid leukaemia
- Lucie de Beauchamp
- Ekaterini Himonas
- G. Vignir Helgason
Leukemia (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

AML LSC versus blasts define an LSC epigenetic signature

The LSC epigenetic signature is largely mutation independent

The LSC epigenetic signature is prognostic in human AML

Epigenetically distinct LSC reflecting the cell of origin

Discussion

Methods

Human samples

TCGA patient sample data

Flow cytometry analysis and cell sorting

Xenotransplantation assay

Illumina infinium human methylation 450 bead array assay

Illumina infinium human methylation 450 bead array analysis

Bisulfite pyrosequencing

Affymetrix microarray expression analysis

Sanger sequencing to detect AML mutations

Survival analysis

Statistical analysis

Bioinformatics analysis

Additional information

Accession codes

Accessions

Gene Expression Omnibus

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links