Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Genomic and transcriptomic analyses reveal a tandem amplification unit of 11 genes and mutations of mismatch repair genes in methotrexate-resistant HT-29 cells

View ORCID ProfileAhreum Kim, Jong-Yeon Shin, Jeong-Sun Seo
doi: https://doi.org/10.1101/2020.02.26.965814
Ahreum Kim
1Precision Medicine Center, Seoul National University Bundang Hospital, Seongnamsi 13605, Korea
2Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul 03080, Republic of Korea
3CHA University School of Medicine, Seongnam, South Korea
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ahreum Kim
Jong-Yeon Shin
1Precision Medicine Center, Seoul National University Bundang Hospital, Seongnamsi 13605, Korea
4Macrogen Inc., Seoul 08511, Korea
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeong-Sun Seo
1Precision Medicine Center, Seoul National University Bundang Hospital, Seongnamsi 13605, Korea
2Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul 03080, Republic of Korea
4Macrogen Inc., Seoul 08511, Korea
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jeongsunseo@gmail.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

DHFR gene amplification is present in methotrexate (MTX)-resistant colon cancer cells and acute lymphoblastic leukemia. However, little is known about DHFR gene amplification due to difficulties in quantifying amplification size and recognizing the repetitive rearrangements involved in the process. In this study, we have proposed an integrative framework to characterize the amplified region by using a combination of single-molecule real time sequencing, next-generation optical mapping, and chromosome conformation capture (Hi-C). Amplification of the DHFR gene was optimized to generate homogenously amplified patterns. The amplification units of 11 genes, from the DHFR gene to the ATP6AP1L gene position on chromosome 5 (~2.2Mbp), and a twenty-fold tandemly amplified region were verified using long-range genome and RNA sequencing data. In doing so, a novel inversion at the start and end positions of the amplified region as well as frameshift insertions in most of the MSH and MLH genes were detected. These might stimulate chromosomal breakage and cause the dysregulation of mismatch repair pathways. Using Hi-C technology, high adjusted interaction frequencies were detected on the amplified unit and unsuspected position on 5q, which could have a complex network of spatial contacts to harbor gene amplification. Characterizing the tandem gene-amplified unit and genomic variants as well as chromosomal interactions on intra-chromosome 5 can be critical in identifying the mechanisms behind genomic rearrangements. These findings may give new insight into the mechanisms underlying the amplification process and evolution of drug resistance.

Introduction

Gene amplification, the triggering of an abnormal copy number increase in a specific region of the genome of an organism growing in a selective condition, is associated with overexpression of oncogenes such as MYC, MYCN, and ERBB, which engender abnormal cell proliferation and replication1, 2, 3. These genes undergo amplification much more frequently than would be caused by a genomic mutation event in mammalian cells, since the rate of gene amplification is greater than the mutation rate 4; thus, such amplication has great tumorigenic potential5.

However, no molecularly targeted agents have been specifically developed to prevent gene amplification because of the associated chromosomal complexity and technical limitations. It is therefore critical to find predictive and prognostic biomarkers for gene amplification to develop specific medicines that will improve patient outcomes and optimize therapeutic decisions6, 7.

In addition, gene amplification is an indicator of a drug-resistant sample in cancer and healthy cells8, so it will be important to identify the genetic features or pathways that promote amplification in tumors. These mechanisms might serve as therapeutic targets that can prevent drug resistance and arrest or eradicate a tumor9. However, the molecular mechanisms that contribute to high gene copy numbers are completely unknown, since sequence alignment and assembly programs that rely on short reads are not equipped to deal with genomic rearrangements and repetitive sequences10, 11.

DHFR gene amplification at chromosome 5 has been a hallmark of methotrexate (MTX) responsiveness and resistance in colon cancer cells and acute lymphoblastic leukemia. MTX is an antifolate drug that inhibits dihydrofolate reductase (DHFR) by preventing DNA synthesis and cell division12, 13, 14.

The amplified DHFR gene generates two major DNA segments consisting of extrachromosomal double-minutes (DMs) and intrachromosomal homogenously staining chromosome regions (HSRs); however, the molecular mechanisms behind these amplified region products and suitable methods for their detection and characterization are still unkown. Currently, these segments are detected only in advanced-stage tumors and accompany more repetitive and complicated sequences to assemble15, 16,17.

In this study, single-molecule real time (PacBio SMRT) sequencing, optical genome mapping (BioNano Genomics; read size: ~10Kb), and high throughput chromosome conformation capture (Hi-C) for inter- and intra-chromosomal interactions are used to identify relevant repetitive rearrangements with amplified segments and interpret gene amplification mechanisms in a MTX-resistant colon cancer cell line (HT-29) (Fig. 1), which has a heterogeneously amplified genome.

Figure 1.
  • Download figure
  • Open in new tab
Figure 1. Visualization of DHFR-amplification patterns using FISH and karyoptyping between MTX-resistant clones (C1-2-4) and controls.

A subculture (C1-2-4) of the C1-2 clone was visualized via fluorescent in situ hybridization (FISH), at 1000x magnification, and the FISH signal type and percentage at metaphase were compared between controls and MTX-resistant clones (C-1-2-4). All chromosomes were karyotyped and abnormal chromosomal shapes were detected at the 5q and 17q arms, compared to controls, as indicated by the red arrow.

These techniques allowed for accurate quantification of amplification size and identification of drastic differences in chromosomal abnormalities and structural variants compared to MTX-sensitive sample, which are difficult to extensively analyze using short-read sequencing and fluorescence in situ hybridization (FISH). Accordingly, this study contributes to an understanding of basic genomic principles, impacts of genetic rearrangements on cancer cells, and by extension, drug resistance mechanisms.

Results

Determination of MTX-resistant HT-29 cells

While generating MTX-resistant HT-29 cells (selected clone: C1-2) from singe-cell selection and MTX sensitization as previously described15, dramatic morphological changes in the cells themselves were detected in the form of rounded and circular shapes at the 1st cycle of sensitization as well as rod and irregular shapes at the 2nd cycle (Supplementary Fig. 1). The original shapes were again observed at the 3rd cycle of sensitization, which might indicate that the HT-29 cells were becoming resistant to MTX and rapidly grew up under high MTX concentration.

As expected, DHFR expression, which was normalized by the B2M housekeeping gene, was steadily increased from the 1st to 3rd cycle, as the HT-29 clones became resistant to MTX (Supplementary Fig. 2a). After confirming these morphological changes and increased DHFR expression, the DHFR copy number was measured in several clones.

The clones’ cycle quantification values (Ct) were used to compute the rate of copy number. These values dropped from 26.04 to 19.99, as the number of cycles increased (Supplementary Table 1). Interestingly, there was a dramatic increase in the DHFR copy number between the 1st (0.97 copies) and 2nd (54.83 copies) cycles (Supplementary Fig. 2b).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1.

The comparison of detected structural variants between control and MTX resistant HT-29.

Based on these results, we ascertained that a specific time period and set of conditions were required for clones to survive in the presence of MTX, and the amplified DHFR gene was an indicator of MTX resistance, as previously described12. Also, it was confirmed that the increase in DHFR gene expression was not proportional to the increase in DHFR gene copy number in each cycle18.

Validation of gene amplification in MTX-resistant HT-29

After quantifying the DHFR gene copy number, the amplified DHFR gene at the 5q arm was visualized via fluorescent in situ hybridization (FISH) for a MTX-resistant clone (C1-2) and control sample. The DHFR gene region at the 5q arm was abnormally stretched compared to that of the control, at the 2nd and 3rd cycle, as expected (Supplementary Fig. 3).

The amplified DHFR gene patterns of C1-2 cells in metaphase revealed that the amplified region had a painting signal, and FISH signal patterns were highly heterogeneous. A total of 9 signal-amplified patterns and two major patterns accounted for 44% and 28% of abnormally DHFR gene amplified patterns, respectively (Supplementary Fig. 4). Even if the cells suffered from the same condition, the MTX effects and genetic status were different in each cell. This would significantly hinder technical analysis of the alignment and analysis of amplified patterns19, 20. Therefore, we had to optimize homogenously amplified patterns before sequencing by performing a serial clone selection and sub-culturing.

The optimized C1-2 clone (C1-2-4), which was robust in the presence of MTX, was visualized using FISH. Four different types of gene amplification pattern were detected; two patterns had amplified DHFR genes at two q arms (75 % and 12.5 %), one pattern had amplified DHFR genes at three q arms (8.3 %), and one completely lacked amplified DHFR genes at q arms (4.2 %) (Fig. 1). Overall, ~96% of the cells had a DHFR-amplified region.

A majority of these strains was resistant to MTX and had high amplification of DHFR genes at a 5q arm. However, each MTX-resistant C1-2-4 cell had a different DHFR gene amplification pattern and copy number, as well as heterogeneous genetic status in the presence of MTX, even if the clone was generated from a single MTX-resistant cell21.

Additionally, MTX-resistant HT-29 and control samples were karyotyped to accurately detect the amplified region. The C1-2-4 clone had a homogeneously staining region (HSR) at the chromosome 5 q arm, as previously described22, in addition to an abnormal stretch of the 17 q arm. This result confirmed that the aneuploidy and chromosomal instability of cancer samples could cause the many characteristic changes in chromosome copy number and karyotype diversity 23. It was previously found that chromosome 17 q arm amplification could be detected because of the genetic instability in colorectal cancer24.

Analysis of the structural variants and amplification unit

After confirming the HSR on the amplified DHFR region, the five genomic structural variants (deletion, duplication, inverted duplication, translocation, and inversion) and amplified units of MTX-resistant cells were analyzed to identify which genes and structural variants were involved in gene amplification. The log2 ratio of segmented coverage over whole chromosomes between the control and MTX-resistant sample was compared. A high segmented coverage in the MTX-resistant sample, compared to that of the control sample, was observed in chromosome 5 (Fig. 2a). The genes for which the segmented coverage was bigger than 20 were identified and annotated by position to identify the exact amplified region that included the DHFR gene.

Figure 2.
  • Download figure
  • Open in new tab
Figure 2. Detection and characterization of SVs in MTX-resistant HT-29 cells.

(a) The log2 ratio of segmented coverage across all chromosomes was compared between control and MTX-resistant HT-29 cells. The amplified region on 5q in MTX-resistant HT-29 cells is indicated by the blue arrow. (b) Five genomic variants (deletion, duplication, inverted duplication, translocation, and inversion) in MTX-resistant sample and control were analyzed and visualized with a bar graph, wherein the number of variants was counted and compared between the two samples. (c) In the scatter plot, the size of variants (log(size+1)) and depth of split-reads were plotted and compared between control and MTX-resistant HT 29 cells. The group and variant type are delineated by a different shape and color.

The total number of genomic variants in the MTX-resistant sample was bigger than in the control, and the size of variants such as duplicates and inversions was bigger in the MTX-resistant sample, which had a large number of split reads (Fig. 2b and 2c). Additionally, more variants for each structural variant were detected in MTX-resistant HT-29 cells, compared to those of the control.

The novel structural variants on chromosome 5 were selectively compared between control and MTX-resistant HT-29 cells. One duplication (median size of split reads: 371,675), three inversions (median size of split reads: 328,697), and three inverted duplications (median size of split reads: 1,529) were detected, while there were no such variants detected in the control sample (Table 1). Moreover, the number of split reads for HT-29 cells was bigger than the average coverage (10X), and most copy number variation (CNV) categories matched, which indicated that the detected structural variants had been accurately detected (Supplementary Table. 2). Finally, although the number of translocations on chromosome 5 was decreased in MTX-resistant sample, there were more detected inter-chromosomal genomic rearrangements in MTX-resistant HT-29 sample (Supplementary Fig. 5).

The position of the amplified region on the chromosome 5 q arm was between 80M to 83M, which included the DHFR gene as the start-point and the ATP6AP1L gene. The segmented coverage was approximately 197X, between 80.6M and 82.8M (~2.2M) (Fig. 3a). The MTX-resistant sample had an approximately twentyfold longer amplified region than that of the control. This amplified size was inferred from the designed coverage of long read sequencing, which was 10X that of the control sample, but the MTX-resistant sample had abnormally high coverage (~197X) for only this region.

Figure 3.
  • Download figure
  • Open in new tab
Figure 3. Detection of amplified units and structural variants over the amplified region in MTX-resistant HT-29 cells

(a) The segmented coverage and genomic variants were compared and visualized between control and MTX-resistant HT-29 cells across the amplified region on chromosome 5 (80,610,000-82,850,000). (b) The BioNano contig and PacBio long-reads matched up with the reference (hg38); several genomic rearrangements are indicated by blue arrows. (c) The heatmap depicts the gene expression (FPKM) in amplified units between MTX-resistant HT-29 cells and control. High expression in a specific region was indicated by a red arrow. The statistical significance (p-value) was computed using the Mann Whitney test.

Interestingly, there were both forward and reverse strands in the MTX-resistant sample that were not found in the control sample. These had only forward strands and small deletions in our defined amplified unit (Supplementary Fig. 6). Previous studies have shown that forward DNA synthesis is preferred during replication25; however, this result indicated that the replication was performed in the both forward and reverse direction in the MTX-resistant samples. A study by Kimura et al. (2016) found that DNA can be replicated in the reverse direction by a backward enzyme such as Thg1-like proteins (TLPs) to efficiently synthesize both chains26.

Overall, the amplification regions included the 11 genes, from DHFR to ATP6AP1L, which were tandem gene amplifications of this region, which had inversions or inverted duplications at the end of the amplification unit. The tandem repeats of several genes on chr5 (2.2Mbp) were initiated and terminated by inversion on the specific sequence. Such an inversion (chr5:80,618,750-80,631,409) at the start point included the LINC01137, DHFR, CTC325J23.2, and MTRNR2L2 genes (Supplementary Fig. 7).

Optical genome mapping over the amplified region

The region of tandem amplification (80.6Mbp – 82.8Mbp) on chromosome 5 was further analyzed using BioNano genome optical mapping, due to a lack of coverage and mapping over the whole range of the amplified region using other methods. The contigs of genome mapping mapped in complex ways to whole chromosomes, except for the amplified region, with high coverage of approximately 200x (Supplementary Fig. 8). Additionally, the gene-amplified unit had inversions at both the start and endpoints of the amplified region, as expected, but there was a newly identified insertion at the endpoint of the amplified region (Fig. 3b).

These inversions were certainly associated with the amplification mechanism and seemed to assist with and even initiate tandem repeat amplification, as previously reported27. The identified inverted repeat could stimulate the formation of a large DNA palindrome after breakage of an adjacent DNA double-strand28. This suggests that short inversion at the start and endpoints in MTX-resistant HT-29 cells could play an important role in the initiation of gene amplification.

Expression levels of MTX-resistant sample

There was extremely high gene expression within the identified amplification region, which matched with the high coverage of the amplified unit in our long-read sequencing data (Supplementary Fig. 9). The read coverage from long-read sequencing of the amplified region in the MTX-resistant sample was approximately ten times higher than that of the control. Similarly, the log2(FPKM+1) expression level from the DHFR to ATP6AP1L gene in MTX-resistant HT-29 cells was 5x that of control for DHFR and 122x that of control for RASGRF2 (P = 0.0104 via the Mann-Whitney test; Fig. 3c).

Identifying novel mutations and their impacts on gene amplification

In order to find relevant mutations on the amplification mechanism, single-nucleotide variants (SNVs) in both samples were identified using transcriptome sequencing data. There were more total exonic mutations in MTX-resistant HT-29 cells (13,982) compared to those of control sample (13,310) across all chromosomes, and 18 more exonic mutations were detected in MTX-resistant HT-29 cells on chromosome 5 only (Supplementary Tables 3 and 4).

After filtering the synonymous SNVs, there were a few additional non-synonymous mutations (non-synonymous SNVs, frameshift deletions, frameshift insertions, stop-gains, and stop-losses) in MTX-resistant HT-29 cells than those for control, across all chromosomes (Supplementary Fig. 10). Some non-synonymous mutations came from chromosome 5, in which the number and percentage of frameshift insertions noticeably increased in MTX-resistant sample (18.1%) than in control (4.3 %) (Supplementary Fig. 4a). The most frequently inserted nucleotides in frameshift insertions were thymine (73 %) and adenine (23 %).

This might explain why the detected frameshift thymine and adenine insertions in mRNA on chromosome 5 concurrently occurred with DHFR gene amplification, which conferred an ability to survive MTX exposure. Moreover, novel frameshift insertions -- either adenine (A) or thymine (T) -- were located on MSH3 and MSH6 genes as well as PMS1 and PMS2 genes in MTX-resistant HT-29 sample only (Supplementary Table. 5). Expression of these genes, except for MSH3, decreased in the mutated and MTX-resistant HT-29 cells, compared to control sample (Fig. 4b).

Figure 4.
  • Download figure
  • Open in new tab
Figure 4. Comparison of non-synonymous mutations and MMR expression levels.

(a) Non-synonymous mutations in chromosome 5 (non-synonymous SNV, frameshift deletion, frameshift insertion, stop-gain, and stop-loss) were compared between MTX-resistant HT-29 cells and control. Each portion of the four nucleobases in the frameshift insertions was indicated in the MTX-resistant HT-29 cells. (b) Gene expression (FPKM) of mutS homologs and mutL homologs was computed and compared. The change and difference between MTX-resistant HT-29 cells (M) and control (C) are indicated below the bar graph. (c) The top 10 enriched KEGG gene sets for up-regulated and down-regulated differentially expressed genes were visualized via an enrichment score (-log(qvalue)).

The MSH3 and MSH6 genes belong to mismatch repair (MMR) 29 gene families and are known to play an important role in repairing DNA for cell division as well as cooperatively suppressing intestinal tumors30. MLH genes (PMS1) as well as the MSH genes (MSH1 - 6) are significantly correlated with colon cancer; mutations in these genes could cause predisposition and susceptibility to Lynch syndrome, in conjunction with colon cancer31, 32, 33.

Therefore, novel frameshift insertions in these genes could prevent mismatch repair functionality and tumor suppression in the presence of MTX, and stimulate the rapid progression of gene amplification and MTX-resistance. The molecular explanation for this tandem-gene amplification mechanism could be a malfunction of MMR pathways34. It was previously known that MSH3 is concurrently amplified with the DHFR gene due to their proximity to each other in MTX-resistant cells. An imbalance in expression of mutS homologs could result in the malfunctioning of base-base mismatch repair and cause genetic instability as well as confer resistance to the cytotoxic effects of MTX35. Also, chromothripsis events by massive clustered genomic rearrangements might contribute to tandem gene amplification and inactivation of MMR genes under MTX condition36.

Identifying differentially expressed genes (DEGs)

To determine which genes and gene sets are involved in MTX-resistance, we analyzed differentially expressed genes and enriched gene sets (Fig. 4c). A total of 383 up-regulated and 287 down-regulated DEGs were identified in MTX-resistant HT-29 cells and compared to those of control sample (Supplementary Table. 6).

Through enrichment analysis of KEGG gene sets, the up-regulated DEGs included IL1B, MAPK11, JUN, MAP3K8, IL8,and CASP1 in MTX-resistant HT-29 cells. The affected signaling pathways included MAPK, toll-like receptor, and NOD-like receptor signaling. Interestingly, the down-regulated DEGs such as MAD2L1, CCNA2, MCM2, MCM4, FEN1, and CDK6 were enriched in DNA replication, tyrosine metabolism, and cell cycle pathways, which are commonly up-regulated in colon cancer37.

As previously reported, the down-regulated DEGs of MTX-resistant osteosarcoma cell lines were enriched in the mitotic cell cycle, cell cycle, and DNA replication pathways, which were also down-regulated in our MTX-resistant colon cancer cells. This result might explain the role of MTX in inhibiting dihyrofolate reductase (DHFR) and preventing tumor cells from proliferating in both cases38, 39. In addition, down-regulated expression of MMR genes, which affects G2/M cell cycle arrest and apoptosis, might prevent proper checkpoint and cell-death signaling; therefore, this phenomenon could contribute to the malfunctioning of DNA replication40.

Still, this does not explain which mechanism is associated with DHFR gene amplification in MTX-resistant colon cancer cells and cannot be used to identify MTX targets or other intracellular pathways and folate metabolism markers41.

Chromosomal interactions and their topologically associated domains

Genome-wide Intra-chromosomal interactions were identified and compared between MTX-resistant HT-29 cells and control, at 5kb resolution (Supplementary Fig. 11). A high interaction frequency was apparent due to a clear long red line on the amplified region only (5q14.1 to 5q14.2) in the MTX-resistant HT-29 cells. This interaction pattern was similar to those observed when amplification occurred in tumor samples in a previous study42(Supplementary Fig. 12).

The topologically associating domains (TADs) and several chromosomal rearrangements at chromosome 5 were identified to visualize conformations and interactions on intra or inter-chromosomes within the amplified region, as well as to detect unforeseen chromosomal rearrangements at 500kb resolution (Fig. 5a). Frequent intra-chromosomal interactions were observed in the amplified region (chr 5:80.6M-82.8M), and there were several newly identified TADs in the middle and endpoint of this region compared to those of control with high adjusted interaction frequencies (adjusted M) and an adjusted p-value < 0.05 (Supplementary Table 7).

Figure 5.
  • Download figure
  • Open in new tab
Figure 5. Topologically associating domains (TADs) on chromosome 5 and adjusted interaction frequencies.

(a) Intra-chromosomal interactions at chromosome 5 (MTX resistant HT-29 – control) are visualized with the coverage and eigenvectors (left), and the newly identified topologically associating domains (TADs) on the duplicated region are indicated by a yellow box, at 500kb resolution (right). (b) The difference in adjusted interaction frequencies (adjusted M) between MTX-resistant and control samples is plotted on a –log2 scale (p-value). Statistical significance (p-value=0.05) is indicated by the red line.

Interestingly, interactions were also frequent within this region between 109Mbp to 138Mbp, and more TADs and high eigenvector values were observed. These delineated compartmentalization in Hi-C43 and indicated that both the amplified region and an adjacent region had both frequent intra-chromosomal interactions and frequent contacts, and it seemed that the entire 5q region upregulated the amplification mechanism.

In order to compare the computed intra-chromosomal interactions between MTX-resistant HT-29 cells and control samples, the difference in adjusted interaction frequencies (adjusted M), with a p-value at 500Kb resolution, was analyzed, and the differentially interacting genomic regions at chromosome 5 were identified in the amplified region (p-value < 0.05; Fig. 5b). The adjusted M values were lower in the region from 109Mbp to 138Mbp than those in the amplified region, but this region still had a significant p-value and higher interactions compared to those in other positions. Therefore, the chromosomal structure, from the start position of the amplified unit (80Mb) to its endpoint on 5q, could have a complex network of spatial contacts to harbor gene amplification.

The relative copy number in each chromosome was estimated from Hi-C data, using chromosome 2 as a reference. Specifically, the relative copy number in chromosome 5 was significantly higher in MTX-resistant HT-29 cells compared to control, over whole chromosomes except for chromosome 17, which also had a stretched structure like chromosome 5 (Supplementary Fig. 13).

In addition, chromosomal rearrangements such as those that would result in DMs only were extrachromosomal DNA stretches that harbored the amplification of oncogenes that were involved in drug resistance. These rearrangements were detected in the amplified region via Hi-C data, and were not observed in the FISH data. Hi-C data was utilized because it could reveal unforeseen chromosomal rearrangements and copy numbers in highly amplified regions. However, this result should be confirmed using other techniques, since it was not clear how to distinguish between DMs and HSRs due to their similar structure.

Overall, tandem gene amplifications of several genes on chr5 (2.2Mbp) in long read sequencing data were confirmed via gene expression and gene mapping as well as HI-C data. More complicated interactions on intra-chromosome 5 can be critical factors in identifying the hotspots of spatial contacts across an amplified region.

The mechanisms behind tandem gene amplification

We have developed experimental and computational workflows to detect and analyze the mechanisms behind gene amplification in MTX-resistant HT-29 cells through breakage-fusion-bridge (BFB) cycles, as previously reported44 (Fig. 6 and Supplementary Fig. 14). Before gene amplification occurred, frameshift insertions in MSH and MLH genes across several chromosomes were caused by MTX toxicity, which resulted in a genetic predisposition and dysregulation of mismatch repair pathways in the presence of MTX16, 45.

Figure 6.
  • Download figure
  • Open in new tab
Figure 6. The mechanism of tandem gene amplification in the presence of MTX.

Frameshift insertions at the family of mismatch repair genes (mutS and mutL homologs) can initiate the dysregulation of mismatch repair pathways. A chromosomal breakage event occurs after the emergence of an inversion at DHFR. Those that occurred between DHFR and ATP6AP1L (2.2 Mb) were involved in producing the amplified unit. Finally, a variable size of tandem amplifications can be achieved via breakage fusion-bridge (BFB) cycles.

After malfunctioning in mismatch repair mechanisms in the presence of MTX, chromosomal breakages occurred at the DHFR gene on chromosome 5, because of the inverted repeat at the start position of the gene. Additionally, inverted repeats from the DHFR gene to ATP6AP1L (2.2 Mb) were involved in producing the amplified unit. The endpoint of the amplified unit had the same inverted repeat, which could indicate the stop position for gene amplification. However, it is still unknown how the specific genes were selected and involved in gene amplification mechanisms.

Finally, variable sizes of gene amplification with HSRs could be produced via BFB cycles, and the unstable HSRs could be occasionally transformed into either a different size of HSR fragment or DMs, accompanying inversions at the endpoints. Overall, the co-amplification of MSH3 and DHFR as well as frameshift mutations in MSH and MLH continuously affected genetic instability and enhanced the resistance to methotrexate.

Discussion

Cancer cells grown in the presence of anti-cancer drugs occasionally have many copies of specific genes, but the molecular mechanisms behind such adaptations are as yet unknown, since analyzing repetitive DNA sequences is limited by many technical difficulties and in evaluating alignment of NGS reads and data processing46,47. Recent developments in NGS technologies have enabled large reads (~10Kb) and are able to generate enormous amounts of genomic information at base-level resolution. Such detailed data can facilitate analysis of unknown regions, which is not possible using other technologies48, 49.

Here, complicated cancer genomics and abnormal structures were detected and analyzed using a combination of advanced technologies including the PacBio SMRT, optical mapping, and Hi-C analysis, as well as short-read sequencing and FISH. An integrative framework was also used to detect and decipher structural variations in MTX-resistant HT-29 cells. This made use of a large repeat and complex DNA segments. Finally, the amplified region was characterized and evaluated by size and relevant genetic defects, and potential gene amplification mechanisms were suggested.

In order to analyze complicated repetitive sequences involved in gene amplification in the presence of MTX, several MTX-resistant clones were selected and generated from a HT-29 cell line by increasing MTX concentration in a MTX sensitization study. When an amplified DHFR gene from several clones was detected via FISH, the cells in each clone had a very large variation of the 5q12 region. DHFR was amplified in a small portion of cells, given that tumor cells responded heterogeneously to MTX. Regardless, the effects of the drug would be dissimilar, even if the cells were grown in the same conditions50, 51. This could lead to bioinformatics difficulties in finding accurate amplified patterns and repetitive sequences in previous as well as future studies.

In order to overcome the limited analysis provided by heterogeneously amplified patterns, DHFR amplified patterns in specific clones were optimized via serial dilution and repeated single-cell selection to generate homogenous amplified patterns. Finally, an optimized clone was obtained that had 96% DHFR-amplified cells. Approximately 75% of the cells in the clone converged to a homogeneous DHFR amplification pattern.

After optimizing a resistant clone, the amplified unit and tandem gene amplification of 11 genes, from the DHFR gene to the ATP6AP1L gene on chr5 (2.2Mbp), were identified through the long-range genomic information from long-read sequencing and optical genome mapping. The amplified unit had high coverage (~197X) compared to control (~10X). This implied that the amplified region was approximately twenty-fold tandemly amplified compared to the original sequence, which was confirmed from the high gene expression pattern and alternatively splicing patterns. Gene expression on the amplified unit was highly overexpressed, going from a 5-fold change to a 122-fold change compared to that of the control. These variable expression patterns could be affected by the complexity of splicing patterns, even if these genes were amplified simultaneously.

Furthermore, an inversion at both the start and endpoints of the amplified unit was detected in long-read sequencing data, and this structure variant was cross-validated via optical genome mapping. Previous studies have also shown that an inversion in gene amplification could lead to the initiation of gene amplification by stimulating chromosomal breakage. In addition, such a phenomenon could represent a transformation from circular DNA segment DMs to intra-chromosomal HSRs52, 53.

Using Hi-C analysis, we identified high intra-chromosomal interactions on the amplified region and significant interactions at the novel TADs. Long-range chromosomal interactions, from 109Mbp to 138Mb more than those of the amplified region, were also unexpectedly detected. The amplified unit in the Hi-C analysis exactly matched that of the long-range genomic information and RNA sequencing results. This could explain why the amplified position was compactly packed and had frequent contact with each other, which could represent a chromatin architecture for gene amplification54.

We also found that the copy number in chromosome 5 was significantly higher in MTX-resistant HT-29 than that of control. Moreover, DMs, in the form of extrachromosomal DNA and another type of chromosomal abnormality, were detected in the amplified region and not in FISH analyses. This could indicate that this structure exists on highly compacted chromosome positions and harbors the amplification of several genes by generating a new chromosomal structure DM and dynamically converting it to another type of chromosomal HSR, which was previously described in neuroblastoma study55. This could explain the involvement of chromosomal abnormalities in MTX resistance. However, the mechanism involved in the conversion of HSRs between DMs could not be explained because of the unknown time point of the transition between two structures, its low coverage (10X) in sequencing data, and a large variation in amplification size.

Taken together, these findings suggest that DHFR could not be the only target for a MTX-associated mechanism, since 11 unsuspected genes were tandemly amplified between DHFR and ATP6AP1L, and the expanded region on the 5q arm over the amplified region interacted with other amplified genes each other. The highest gene expression in the amplified unit was RASGRF2, which was a 122-fold increase over control. Conversely, DHFR underwent a five-fold increase over control. Therefore, more investigations are needed to target and assess the role of RASGRF2 in MTX related gene amplification mechanism, which might utilize the promoter for DHFR to obtain its high efficiency in the presence of MTX.

In addition, novel frameshift insertions in MSH and MLH were identified in the MTX-resistant sample, which could play an important role in the rapid progression of gene amplification and MTX resistance. In this vein, the microsatellite instability (MSI), i.e., the effect of deficient DNA mismatch repair in colon cancer, was tested to evaluate the status of MMR in MTX-resistant HT-29 cells. There was no significant difference between MTX-resistant cells and control56.

This implied that MSI could not affect genetic instability and the entire MMR system over whole chromosomes; whereas, MTX toxicity could induce mutations on MMR genes as well as genetic predisposition on chromosome 5 in MTX-resistant HT-29 cells by inserting adenine or thymine nucleotides on MSH and MLH genes, thereby enabling the harboring of gene amplification. This points to a possible tandem gene amplification mechanism that progresses through breakage-fusion bridge cycles.

Further studies are needed to identify whether a frameshift insertion in MSH3 might be caused by co-amplification of DHFR and other frameshift insertions that result in the malfunctioning of mutS homologs and hyper-mutability in the presence of significant MTX57, 58. However, frameshift insertions in MSH and MLH could not be generated by the gene amplification process, since all mutated genes except for MSH3 are located outside of the amplified unit. Therefore, the frameshift mutations in each gene should have had a different cause in the presence MTX toxicity as well as different impacts on MTX-resistant HT-29 cells.

In addition, mismatch repair location is recognized by two mutS homologs such as mutS-alpha (MSH2 and MSH6), which is known for repair of single-nucleotide mismatches, and muS-beta (MSH2 and MSH3), which is known as a repair mechanism for large indels59, 60. Such mutS homologs should need mutL homologs (MLH1 and PMS2) for binding to a recognition site.

Therefore, mutations in mutS and mutL homologs could prevent repair from a single-nucleotide mismatch and large indels in MTX-resistant HT-29 cells. The prevention of mutations and inversions in mutS and mutL homologs could increase cells’ sensitivity to MTX and inhibit gene amplification. In the future, methods of preventing these mutations and structural variants could be developed to overcome drug-resistance and guide optimal clinical use of such drugs.

Most of all, additional validation for the identified SVs and repetitive sequences is required to interpret the entire sequence of the gene amplification mechanism and its impact on MTX resistance. This information will provide a clue to how cancers adapt to drugs. In this vein, inversions at the start and end positions on the amplified unit affect chromosomal breakage and formation of DMs from the repetitive sequence. Additionally, the effects of MTX-associated mechanisms on frameshift insertions in MMR, MLH, and inversions along the amplified region should be described and validated through a comprehensive analysis of the involved genes and pathways, Although several limitations exist in this study, our findings may give new insight into the mechanisms underlying the amplification process and evolution of MTX-resistance in colon cancer and leukemia. Moreover, the use of Hi-C data to detect unforeseen chromosomal rearrangements such as DMs and HSRs has been shown to be promising for future analyses.

Conclusions

Complex NGS technologies can be used to identify complicated genomic sequences and novel structural variants that are difficult to detect with other technologies. Whenever possible, a comprehensive suite of tools and techniques is required to interpret repetitive sequences and structural variations in gene amplification. This will help to identify the most important therapeutic mechanisms and new targets of anti-cancer drugs, which affect various intracellular pathways across many time and length scales.

Finally, our findings will bolster clinical cancer studies and inform diagnoses as well as the management and treatment of various cancers, and provide in-depth guidance toward pharmacologic targets for anti-cancer drugs as well as personalized medicine.

Methods

Preparation and sensitization studies for MTX-resistant cancer cells

After curating a list of cancer cell lines based on previous studies, we targeted and maintained human colon adenocarcinoma HT-29 cell lines from the Korean Cell Line Bank (KLCL) to develop MTX-resistant cancer cells, as described previously61. HT-29 was chosen because it can be engineered to grow in high concentrations of MTX via the amplification of DHFR.

To generate MTX-resistant HT-29 cells, we optimized the MTX concentration and graded concentrations (from 10−8 to 10−6 M) in five T25 flasks per cell line, using RPMI media supplemented with 10% fetal bovine serum. The MTX solution involved mixing 100mg MTX powder with 1.967ml DMSO. This mixture was then aliquoted at 100 µl/ml and kept at −20℃. A limited number of resistant cells (~3×105) from each T25 flask was plated in Petri dishes in the same culture medium and MTX concentration.

We then used 3.2mm-diameter cloning discs (Sigma) for 3 isolate clones (C1-2,C4-3,C8-22) from each Petri dish, which were grown up in another T25 flask under the same conditions. Subsequently, MTX-resistant clones were passaged 40 times, without MTX. For the second and third treatment cycles, 3×105 clones and parental HT-29 cells were added and cultured in 25 cm2 flasks. A stepwise treatment of MTX concentration from 10−8 mol/L to 10−6 mol/L was applied; MTX-resistant cells were grown in the 10−6 mol/L MTX concentration.

Detection of gene amplification

The copy number of MTX-resistant clones was measured via TaqMan Copy Number Reference Assays and Viia7 technology to select clones with highly amplified DHFR genes. The copy number of the DHFR gene was estimated via relative quantitation (RQ), using the comparative Ct method, and computed using the Ct difference (delta Ct) between MTX-resistant samples and reference samples (Hapmap NA19982). Then, the ∆Ct values of MTX-resistant samples were compared to a Hapmap reference sample known to have two copies of DHFR, such that they were two times the relative quantity of the reference. The following three equations were used to compute the copy number from the Ct value. Embedded Image Embedded Image Embedded Image

The samples with a high DHFR copy number were selected for fluorescence in situ hybridization (FISH) and karyotyping to visualize and map DHFR, and detect chromosomal abnormalities. The DHFR red signal and 5p12-green signal were counted at anaphase and metaphase, with 1000x magnification and a triple (RGB) filter.

RNA-seq analysis and transcriptome profiling

RNA-seq for MTX-resistant HT-29 cells and control samples was performed to investigate expression profiling. Total RNA, with ribosomal RNA removed, was prepared and sequenced using Illumina HiSeq (depth 100X), as previously described62. The obtained reads were mapped to the human reference genome (GRCh38) using the Spliced Transcripts Alignment to a Reference (STAR) tool to produce analysis-ready BAM files. We followed the key principles of processing and analysis steps from the GATK website. The mapped reads (BAM file) were visualized via SeqMonk.

To estimate the expression of each gene, raw reads were counted using the HTSeq-count tool and normalized to variance stabilizing data (VSD) expression via the R package DEseq2. Fragments per kilobase million (FPKM) values were calculated using the R package edgeR, and converted to log2 values. The median-centered gene expression was computed from FPKM expression, using Cluster 3.0 software, which subtracts the row-wise median from the expression values in each row. The median-centered VSD and FPKM expression were visualized in a heatmap.

Variant discovery analysis

Variant calling was performed on transcriptome datasets. The duplicated sites from analysis-ready BAM files were filtered with Picard, and variants were called and filtered by removing spurious and known RNA-editing sites, in VCF format. The genomic variants were compared between control and MTX-resistant HT-29 sample. We performed the variant discovery analyses by observing the step-by-step recommendations provided by the Genome Analysis Toolkit (GATK) to obtain high-quality variants63.

In order to determine the exact SNPs from the call set, the variants were filtered out according to several conditions. First, the cutoff for quality-by-depth (QD) was 3.0, which is the variant confidence score divided by the unfiltered depth of coverage. Variants were filtered out if they had a score less than 3.0. Second, the variants were filtered out when the Fisher Strand (FS) score was >30.0, which indicated the Phred-scaled p–value using Fisher’s Exact Test for detecting strand bias64. The identified and filtered variants were annotated using RefSeq genes and the ANNOVAR tool.

Differentially expressed gene analysis

Differentially expressed genes between MTX-resistant HT-29 cells and control samples were analyzed using the R packages DESeq2 and edgeR (P-value□<□0.05, |Log2 (fold change)| ≥ 1, and baseMean ≥ 100). The differentially expressed genes were determined for enrichment analysis with KEGG gene sets via Gene Set Enrichment Analysis (GSEA).

Alternative splicing event analysis

The exon inclusion levels, defined with junction reads from RNA sequencing results, were subsequently processed via rMATS.3.2.565. Five different types of alternative splicing event (SE:Skipped exon, MXE: Mutually exclusive exon, A5SS: Alternative 5’ splice site, A3SS: Alternative 3’ splice site, RI: Retained intron) were identified in both MTX-resistant HT-29 cells and control samples. The number of significant events was detected using both junction counts and on-target reads.

PacBio long-read sequencing analysis

Genomic DNA was extracted from MTX-resistant HT-29 cells and control samples using the Gentra Puregene Cell kit (Qiagen) and the library for PacBio. The PacBio long-reads were aligned to the human genome (version GRCh38) with BWA-mem aligner, and the pre-processing pipeline on the BWA-mem website was followed to ensure technical and biological quality results. The read depth was estimated via the Depth-of-Coverage option in GenomeAnalysisTK. Coverage differences between MTX-resistant HT-29 cells and control bigger than 10 were selected for determining the amplified region.

Detection of genomic variants and amplification units

The structural variants (deletion, duplication, inverted duplication, translocation, and inversion) in MTX-resistant HT-29 cells and control samples were analyzed using Sniffles from sorted PacBio output derived from BWA-MEM66. The BAM files were converted to binned copy numbers across a genome using Copycat. The genomic rearrangements were visualized from VCF files (Sniffles) and read coverage files (Copycat) via SplitThreader (http://splitthreader.com/)67.

Long-read sequencing and optical genome mapping

Optical mapping of the PacBio assembly data was performed using the BioNano Assembler software (Irys System, BioNano Genomics) to get accurate sequences. High molecular weight DNA was isolated using the IrysPrep Plug Lysis Long DNA Isolation Protocol (Bionano Genomics). In brief, cells were trypsinized, washed in FBS/PBS, counted, rewashed in PBS, and embedded in agarose plugs using components from the Bio-Rad Plug Lysis Kit. The plugs were subjected to Proteinase K digestion (2 × 2 h at RT). After a series of washes in buffer from the Bio-Rad kit, followed by washes in TE (Tris-EDTA), the plugs were melted and treated with GELase enzyme (Epicenter).

The high molecular weight DNA was released and subjected to drop dialysis. The DNA was left to equilibrate for 4 d, then quantified using the Qubit Broad Range dsDNA Assay Kit (Thermo Fisher Scientific). Using the IrysPrep NLRS assay (Bionano Genomics), 200–300 ng/µL of high molecular weight DNA underwent single-strand nicking with 10 units of Nt.BspQI nickase (New England BioLabs). Nicked sites were repaired with fluorophore-labeled nucleotide to restore strand integrity.

The backbone of fluorescently labeled double-stranded DNA was stained with the intercalation dye YOYO-1. Labeled molecules were directly loaded onto IrysChip, without further fragmentation or amplification, and imaged using the Irys instrument. Multiple cycles were performed to reach an average raw genome depth-of-coverage of 50x. Additionally, the tandem repeats in the amplified region in both the assembly and raw data were identified using IrysView 2.0 software.

Hi-C data analysis

Approximately 50 million MTX-resistant HT-29 cells and control cells were used for high-throughput chromatin conformation capture (Hi-C) data sets. We generated 2 Hi-C libraries using the HindIII restriction enzyme, following a previously established protocol68. In brief, the Hi-C protocol involves crosslinking cells with formaldehyde, permeabilizing them with nuclei intact, digesting DNA with a suitable restriction enzyme, filling the 5’-overhangs while incorporating a biotinylated nucleotide, ligating the resulting blunt-end fragments, shearing the DNA, capturing the biotinylated ligation junctions with streptavidin beads, and analyzing the resulting fragments with paired-end sequencing via Hi-Seq2000.

The HiC data (fastq files) was processed to normalized contact matrices using HiC-Pro version 2.10.069. The pipeline was based on the bowtie 2 aligner and the selected restriction enzyme (HindIII) was used to generate normalized contact maps, as described in the HiCPro pipline (https://github.com/nservant/HiC-Pro). Each aligned read was assessed to determine the valid interactions and control quality by excluding invalid ligation products and duplicated valid pairs. The aligned Hi-C sam file were converted into the HiCnv format, which calls CNVs from Hi-C data70. Also, inter-chromosomal translocations and their boundaries were detected using HiCtrans from a Hi-C matrix file. The list of valid interaction output files called by HiC-pro was converted to a Juicebox input file and visualized using Juicebox (https://github.com/theaidenlab/juicebox/wiki).

The R package HiCcompare was used to detect differential spatial chromatic interactions, on a genome-wide scale, between control and MTX-resistant HT-29 cells71. Using this package, the adjusted interaction frequencies after adjustment of joint-normalization functions with the adjusted p-values and filtered low-average expression were represented after multiple testing correction.

Statistical test

All statistical analyses were performed using R-3.3.0. The gene expression between MTX-resistant HT-29 sample and control (HT-29) was compared, and the p-value was indicated using the unpaired Student’s t-test or Mann-Whitney U-test, based on the Shapiro-Wilk normality test. P-values less than 0.05 were considered to be statistically significant.

Data availability

PacBio long read sequencing and whole transcriptome sequencing data are available under the NCBI Sequence Read Archive (SRA) accessions no. SRPXXXXX

Contributions

J.-S.S. and J.-Y.S. conceived and designed the experiments. A.K. conducted the experiment by preparing the MTX resistant HT-29 sample and extracting DNA and RNA. J.-Y.S. performed PacBio long read Seq and whole transcriptome Seq experiments. A.K. performed sequencing data processing, bioinformatics and statistical analyses. J.-S.S., A.K. and J.-Y.S wrote the manuscript

Ethics declarations

Competing Interests

No potential conflicts of interest relevant to this article were disclosed.

Acknowledgements

This work has been supported by Macrogen Inc. (grant no. MGR17-04)

Footnotes

  • ahreumkim88{at}snu.ac.kr +82-31-600-3001, jongyeon.anna{at}gmail.com +82-31-600-3001, jeongsun{at}snu.ac.kr +82-31-600-3001

Abbreviations

A3SS
Alternative 3’ splice site
A5SS
Alternative 5’ splice site
BAM
Binary alignment map
BFB
Breakage-fusion-bridge
CNV
Copy number variation
DEG
Differentially expressed gene
DEL
Deletion
DHFR
Dihyrofolate reductase
DUP
Duplication
FDR
False discovery rate
FISH
Fluorescent In Situ Hybridization
FPKM
Fragment per kilo base per million
GATK
Genome analysis toolkit
GSEA
Gene set enrichment analysis
Hi-C
High throughput chromosome conformation capture
IGV
Integrative genomics viewer
INV
Inversion
INVDUP
Inverted duplication
KEGG
Kyoto encyclopedia of genes and genomes
Lsign
Sum of the sign of the entries in the lower triangle
Lvar
the variance of the lower triangle
MTX
Methotrexate
MXE
Mutually exclusive exon
ncRNA
non-coding RNA
NGS
Next generation sequencing
PacBio
Pacific Bioscience
RI
Retained intron
RNA-seq
RNA sequencing
SE
Skipped exon
SMRT
Single molecule real-time
SNV
Single nucleotide variation
Score
Conner score
STAR
Spliced transcripts alignment to a reference
SV
Structural variation
TAD
Topologically associating domains
Usign
Sum of the sign of the entries in the upper triangle
Uvar
The variance of the upper triangle
VSD
Variance stabilizing data
WGS
Whole-genome sequencing

References

  1. 1.↵
    Schwab M. Oncogene amplification in solid tumors. Semin Cancer Biol 9, 319–325 (1999).
    OpenUrlCrossRefPubMedWeb of Science
  2. 2.↵
    Marotta M, et al. A common copy-number breakpoint of ERBB2 amplification in breast cancer colocalizes with a complex block of segmental duplications. Breast Cancer Res 14, R150 (2012).
    OpenUrlCrossRefPubMed
  3. 3.↵
    Mathew P, et al. Detection of MYCN gene amplification in neuroblastoma by fluorescence in situ hybridization: a pediatric oncology group study. Neoplasia 3, 105–109 (2001).
    OpenUrlCrossRefPubMedWeb of Science
  4. 4.↵
    Kaufman RJ, Schimke RT. Amplification and loss of dihydrofolate reductase genes in a Chinese hamster ovary cell line. Mol Cell Biol 1, 1069–1076 (1981).
    OpenUrlAbstract/FREE Full Text
  5. 5.↵
    Ohta JI, et al. Fluorescence in situ hybridization evaluation of c-erbB-2 gene amplification and chromosomal anomalies in bladder cancer. Clin Cancer Res 7, 2463–2467 (2001).
    OpenUrlAbstract/FREE Full Text
  6. 6.↵
    Dacic S, et al. Significance of EGFR protein expression and gene amplification in non-small cell lung carcinoma. Am J Clin Pathol 125, 860–865 (2006).
    OpenUrlCrossRefPubMed
  7. 7.↵
    Kamel HFM, Al-Amodi H. Exploitation of Gene Expression and Cancer Biomarkers in Paving the Path to Era of Personalized Medicine. Genomics Proteomics Bioinformatics 15, 220–235 (2017).
    OpenUrlCrossRef
  8. 8.↵
    Schimke RT. Gene amplification in cultured animal cells. Cell 37, 705–713 (1984).
    OpenUrlCrossRefPubMedWeb of Science
  9. 9.↵
    Kang JU. Characterization of amplification patterns and target genes on the short arm of chromosome 7 in early-stage lung adenocarcinoma. Mol Med Rep 8, 1373–1378 (2013).
    OpenUrl
  10. 10.↵
    Treangen TJ, Salzberg SL. Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat Rev Genet 13, 36–46 (2011).
    OpenUrlCrossRefPubMed
  11. 11.↵
    Schneeberger K, et al. Simultaneous alignment of short reads against multiple genomes. Genome Biol 10, R98 (2009).
    OpenUrlCrossRefPubMed
  12. 12.↵
    Brown PC, Tlsty TD, Schimke RT. Enhancement of methotrexate resistance and dihydrofolate reductase gene amplification by treatment of mouse 3T6 cells with hydroxyurea. Mol Cell Biol 3, 1097–1107 (1983).
    OpenUrlAbstract/FREE Full Text
  13. 13.↵
    Goker E, et al. Amplification of the dihydrofolate reductase gene is a mechanism of acquired resistance to methotrexate in patients with acute lymphoblastic leukemia and is correlated with p53 gene mutations. Blood 86, 677–684 (1995).
    OpenUrlAbstract/FREE Full Text
  14. 14.↵
    Morales C, Ribas M, Aiza G, Peinado MA. Genetic determinants of methotrexate responsiveness and resistance in colon cancer cells. Oncogene 24, 6842–6847 (2005).
    OpenUrlCrossRefPubMed
  15. 15.↵
    Meng X, et al. Novel role for non-homologous end joining in the formation of double minutes in methotrexate-resistant colon cancer cells. J Med Genet 52, 135–144 (2015).
    OpenUrlAbstract/FREE Full Text
  16. 16.↵
    Singer MJ, Mesner LD, Friedman CL, Trask BJ, Hamlin JL. Amplification of the human dihydrofolate reductase gene via double minutes is initiated by chromosome breaks. Proc Natl Acad Sci U S A 97, 7921–7926 (2000).
    OpenUrlAbstract/FREE Full Text
  17. 17.↵
    Hayes M, Li J. An integrative framework for the identification of double minute chromosomes using next generation sequencing data. BMC Genet 16 Suppl 2, S1 (2015).
    OpenUrl
  18. 18.↵
    Feder JN, Assaraf YG, Seamer LC, Schimke RT. The pattern of dihydrofolate reductase expression through the cell cycle in rodent and human cultured cells. J Biol Chem 264, 20583–20590 (1989).
    OpenUrlAbstract/FREE Full Text
  19. 19.↵
    Baumer C, Fisch E, Wedler H, Reinecke F, Korfhage C. Exploring DNA quality of single cells for genome analysis with simultaneous whole-genome amplification. Sci Rep 8, 7476 (2018).
    OpenUrl
  20. 20.↵
    Ulahannan D, Kovac MB, Mulholland PJ, Cazier JB, Tomlinson I. Technical and implementation issues in using next-generation sequencing of cancers in clinical practice. Br J Cancer 109, 827–835 (2013).
    OpenUrlCrossRefPubMedWeb of Science
  21. 21.↵
    Schildhaus HU, et al. Definition of a fluorescence in-situ hybridization score identifies high- and low-level FGFR1 amplification types in squamous cell lung cancer. Mod Pathol 25, 1473–1480 (2012).
    OpenUrlCrossRefPubMedWeb of Science
  22. 22.↵
    Schimke RT. Gene amplification and drug resistance. Sci Am 243, 60–69 (1980).
    OpenUrlPubMed
  23. 23.↵
    Potapova TA, Zhu J, Li R. Aneuploidy and chromosomal instability: a vicious cycle driving cellular evolution and cancer genome chaos. Cancer Metastasis Rev 32, 377–389 (2013).
    OpenUrlCrossRefPubMed
  24. 24.↵
    Ashktorab H, Schaffer AA, Daremipouran M, Smoot DT, Lee E, Brim H. Distinct genetic alterations in colorectal cancer. PLoS One 5, e8879 (2010).
    OpenUrlCrossRefPubMed
  25. 25.↵
    Perera L, Freudenthal BD, Beard WA, Shock DD, Pedersen LG, Wilson SH. Requirement for transient metal ions revealed through computational analysis for DNA polymerase going in reverse. Proc Natl Acad Sci U S A 112, E5228–5236 (2015).
    OpenUrlAbstract/FREE Full Text
  26. 26.↵
    Kimura S, et al. Template-dependent nucleotide addition in the reverse (3’-5’) direction by Thg1-like protein. Sci Adv 2, e1501397 (2016).
    OpenUrlFREE Full Text
  27. 27.↵
    Reams AB, Roth JR. Mechanisms of gene duplication and amplification. Cold Spring Harb Perspect Biol 7, a016592 (2015).
    OpenUrlAbstract/FREE Full Text
  28. 28.↵
    Tanaka H, Tapscott SJ, Trask BJ, Yao MC. Short inverted repeats initiate gene amplification through the formation of a large DNA palindrome in mammalian cells. Proc Natl Acad Sci U S A 99, 8772–8777 (2002).
    OpenUrlAbstract/FREE Full Text
  29. 29.↵
    Miller A, et al. High somatic mutation and neoantigen burden are correlated with decreased progression-free survival in multiple myeloma. Blood Cancer J 7, e612 (2017).
    OpenUrlCrossRef
  30. 30.↵
    Edelmann W, et al. The DNA mismatch repair genes Msh3 and Msh6 cooperate in intestinal tumor suppression. Cancer Res 60, 803–807 (2000).
    OpenUrlAbstract/FREE Full Text
  31. 31.↵
    Karahan B, Argon A, Yildirim M, Vardar E. Relationship between MLH-1, MSH-2, PMS-2, MSH-6 expression and clinicopathological features in colorectal cancer. Int J Clin Exp Pathol 8, 4044–4053 (2015).
    OpenUrlPubMed
  32. 32.↵
    Edelmann W, et al. Mutation in the mismatch repair gene Msh6 causes cancer susceptibility. Cell 91, 467–477 (1997).
    OpenUrlCrossRefPubMedWeb of Science
  33. 33.↵
    Silva FC, Valentin MD, Ferreira Fde O, Carraro DM, Rossi BM. Mismatch repair genes in Lynch syndrome: a review. Sao Paulo Med J 127, 46–51 (2009).
    OpenUrlCrossRefPubMedWeb of Science
  34. 34.↵
    Kariola R, Raevaara TE, Lonnqvist KE, Nystrom-Lahti M. Functional analysis of MSH6 mutations linked to kindreds with putative hereditary non-polyposis colorectal cancer syndrome. Hum Mol Genet 11, 1303–1310 (2002).
    OpenUrlCrossRefPubMed
  35. 35.↵
    Drummond JT, Genschel J, Wolf E, Modrich P. DHFR/MSH3 amplification in methotrexate-resistant cells alters the hMutSalpha/hMutSbeta ratio and reduces the efficiency of base-base mismatch repair. Proc Natl Acad Sci U S A 94, 10144–10149 (1997).
    OpenUrlAbstract/FREE Full Text
  36. 36.↵
    Cortes-Ciriano I, et al. Comprehensive analysis of chromothripsis in 2,658 human cancers using whole-genome sequencing. Nat Genet, (2020).
  37. 37.↵
    Guo Y, Bao Y, Ma M, Yang W. Identification of Key Candidate Genes and Pathways in Colorectal Cancer by Integrated Bioinformatical Analysis. Int J Mol Sci 18, (2017).
  38. 38.↵
    Yang XR, Xiong Y, Duan H, Gong RR. Identification of genes associated with methotrexate resistance in methotrexate-resistant osteosarcoma cell lines. J Orthop Surg Res 10, 136 (2015).
    OpenUrl
  39. 39.↵
    Uchiyama H, et al. Cyclin-dependent kinase inhibitor SU9516 enhances sensitivity to methotrexate in human T-cell leukemia Jurkat cells. Cancer Sci 101, 728–734 (2010).
    OpenUrlPubMed
  40. 40.↵
    O’Brien V, Brown R. Signalling cell cycle arrest and cell death through the MMR System. Carcinogenesis 27, 682–692 (2006).
    OpenUrlCrossRefPubMedWeb of Science
  41. 41.↵
    Sramek M, Neradil J, Veselska R. Much more than you expected: The non-DHFR-mediated effects of methotrexate. Biochim Biophys Acta Gen Subj 1861, 499–503 (2017).
    OpenUrl
  42. 42.↵
    Harewood L, et al. Hi-C as a tool for precise detection and characterisation of chromosomal rearrangements and copy number variation in human tumours. Genome Biol 18, 125 (2017).
    OpenUrlCrossRef
  43. 43.↵
    Rao SS, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
    OpenUrlCrossRefPubMedWeb of Science
  44. 44.↵
    Matsui A, Ihara T, Suda H, Mikami H, Semba K. Gene amplification: mechanisms and involvement in cancer. Biomol Concepts 4, 567–582 (2013).
    OpenUrlCrossRefPubMed
  45. 45.↵
    Verhaak RGW, Bafna V, Mischel PS. Extrachromosomal oncogene amplification in tumour pathogenesis and evolution. Nat Rev Cancer 19, 283–288 (2019).
    OpenUrlCrossRefPubMed
  46. 46.↵
    Gupta S, et al. Prioritization of anticancer drugs against a cancer using genomic features of cancer cells: A step towards personalized medicine. Sci Rep 6, 23857 (2016).
    OpenUrl
  47. 47.↵
    Hillcoat BL, Swett V, Bertino JR. Increase of dihydrofolate reductase activity in cultured mammalian cells after exposure to methotrexate. Proc Natl Acad Sci U S A 58, 1632–1637 (1967).
    OpenUrlFREE Full Text
  48. 48.↵
    Kulkarni P, Frommolt P. Challenges in the Setup of Large-scale Next-Generation Sequencing Analysis Workflows. Comput Struct Biotechnol J 15, 471–477 (2017).
    OpenUrl
  49. 49.↵
    Sedlazeck FJ, Lee H, Darby CA, Schatz MC. Piercing the dark matter: bioinformatics of long-range sequencing and mapping. Nat Rev Genet 19, 329–346 (2018).
    OpenUrl
  50. 50.↵
    Lengauer C, Kinzler KW, Vogelstein B. Genetic instabilities in human cancers. Nature 396, 643–649 (1998).
    OpenUrlCrossRefPubMedWeb of Science
  51. 51.↵
    Stapf M, Pompner N, Teichgraber U, Hilger I. Heterogeneous response of different tumor cell lines to methotrexate-coupled nanoparticles in presence of hyperthermia. Int J Nanomedicine 11, 485–500 (2016).
    OpenUrl
  52. 52.↵
    L’Abbate A, et al. Genomic organization and evolution of double minutes/homogeneously staining regions with MYC amplification in human cancer. Nucleic Acids Res 42, 9131–9145 (2014).
    OpenUrlCrossRefPubMedWeb of Science
  53. 53.↵
    Cheng C, et al. Whole-Genome Sequencing Reveals Diverse Models of Structural Variations in Esophageal Squamous Cell Carcinoma. Am J Hum Genet 98, 256–274 (2016).
    OpenUrlCrossRefPubMed
  54. 54.↵
    Carty M, et al. An integrated model for detecting significant chromatin interactions from high-resolution Hi-C data. Nat Commun 8, 15454 (2017).
    OpenUrl
  55. 55.↵
    Koche RP, et al. Extrachromosomal circular DNA drives oncogenic genome remodeling in neuroblastoma. Nat Genet 52, 29–34 (2020).
    OpenUrl
  56. 56.↵
    Kawakami H, Zaanan A, Sinicrope FA. Microsatellite instability testing and its role in the management of colorectal cancer. Curr Treat Options Oncol 16, 30 (2015).
    OpenUrlCrossRefPubMed
  57. 57.↵
    Matheson EC, Hogarth LA, Case MC, Irving JA, Hall AG. DHFR and MSH3 co-amplification in childhood acute lymphoblastic leukaemia, in vitro and in vivo. Carcinogenesis 28, 1341–1346 (2007).
    OpenUrlCrossRefPubMed
  58. 58.↵
    Marra G, Iaccarino I, Lettieri T, Roscilli G, Delmastro P, Jiricny J. Mismatch repair deficiency associated with overexpression of the MSH3 gene. Proc Natl Acad Sci U S A 95, 8568–8573 (1998).
    OpenUrlAbstract/FREE Full Text
  59. 59.↵
    Willems RJ, Top J, Smith DJ, Roper DI, North SE, Woodford N. Mutations in the DNA mismatch repair proteins MutS and MutL of oxazolidinone-resistant or -susceptible Enterococcus faecium. Antimicrob Agents Chemother 47, 3061–3066 (2003).
    OpenUrlAbstract/FREE Full Text
  60. 60.↵
    Junop MS, Yang W, Funchain P, Clendenin W, Miller JH. In vitro and in vivo studies of MutS, MutL and MutH mutants: correlation of mismatch repair and DNA recombination. DNA Repair (Amst) 2, 387–405 (2003).
    OpenUrlCrossRefPubMed
  61. 61.↵
    Morales C, et al. Dihydrofolate reductase amplification and sensitization to methotrexate of methotrexate-resistant colon cancer cells. Mol Cancer Ther 8, 424–432 (2009).
    OpenUrlAbstract/FREE Full Text
  62. 62.↵
    Seo JS, et al. Whole exome and transcriptome analyses integrated with microenvironmental immune signatures of lung squamous cell carcinoma. Cancer Immunol Res, (2018).
  63. 63.↵
    McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20, 1297–1303 (2010).
    OpenUrlAbstract/FREE Full Text
  64. 64.↵
    Seo JS, et al. The transcriptional landscape and mutational profile of lung adenocarcinoma. Genome Res 22, 2109–2119 (2012).
    OpenUrlAbstract/FREE Full Text
  65. 65.↵
    Shen S, et al. rMATS: robust and flexible detection of differential alternative splicing from replicate RNA-Seq data. Proc Natl Acad Sci U S A 111, E5593–5601 (2014).
    OpenUrlAbstract/FREE Full Text
  66. 66.↵
    Sedlazeck FJ, et al. Accurate detection of complex structural variations using single-molecule sequencing. Nat Methods 15, 461–468 (2018).
    OpenUrlCrossRef
  67. 67.↵
    Nattestad M, et al. Complex rearrangements and oncogene amplifications revealed by long-read DNA and RNA sequencing of a breast cancer cell line. Genome Res 28, 1126–1135 (2018).
    OpenUrlAbstract/FREE Full Text
  68. 68.↵
    Lieberman-Aiden E, et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
    OpenUrlAbstract/FREE Full Text
  69. 69.↵
    Servant N, et al. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol 16, 259 (2015).
    OpenUrlCrossRefPubMed
  70. 70.↵
    Chakraborty A, Ay F. Identification of copy number variations and translocations in cancer cells from Hi-C data. Bioinformatics, (2017).
  71. 71.↵
    Stansfield JC, Cresswell KG, Vladimirov VI, Dozmorov MG. HiCcompare: an R-package for joint normalization and comparison of HI-C datasets. BMC Bioinformatics 19, 279 (2018).
    OpenUrl
Back to top
PreviousNext
Posted February 27, 2020.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Genomic and transcriptomic analyses reveal a tandem amplification unit of 11 genes and mutations of mismatch repair genes in methotrexate-resistant HT-29 cells
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Genomic and transcriptomic analyses reveal a tandem amplification unit of 11 genes and mutations of mismatch repair genes in methotrexate-resistant HT-29 cells
Ahreum Kim, Jong-Yeon Shin, Jeong-Sun Seo
bioRxiv 2020.02.26.965814; doi: https://doi.org/10.1101/2020.02.26.965814
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Genomic and transcriptomic analyses reveal a tandem amplification unit of 11 genes and mutations of mismatch repair genes in methotrexate-resistant HT-29 cells
Ahreum Kim, Jong-Yeon Shin, Jeong-Sun Seo
bioRxiv 2020.02.26.965814; doi: https://doi.org/10.1101/2020.02.26.965814

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4237)
  • Biochemistry (9154)
  • Bioengineering (6797)
  • Bioinformatics (24052)
  • Biophysics (12149)
  • Cancer Biology (9560)
  • Cell Biology (13811)
  • Clinical Trials (138)
  • Developmental Biology (7651)
  • Ecology (11729)
  • Epidemiology (2066)
  • Evolutionary Biology (15534)
  • Genetics (10663)
  • Genomics (14344)
  • Immunology (9501)
  • Microbiology (22874)
  • Molecular Biology (9113)
  • Neuroscience (49078)
  • Paleontology (357)
  • Pathology (1487)
  • Pharmacology and Toxicology (2575)
  • Physiology (3851)
  • Plant Biology (8347)
  • Scientific Communication and Education (1473)
  • Synthetic Biology (2299)
  • Systems Biology (6202)
  • Zoology (1302)