Integrative Statistical Inferences for Drug Sensitivity Biomarkers in Cancer

Ehsan Ullah; Saila Shama; Noora Al Muftah; Ian Richard Thompson; Reda Rawi; Raghvendra Mall; Halima Bensmail

doi:10.1101/194670

Abstract

Personal medicine has been associated with different patient responses to different anti-cancer therapies. Recently, scientists are looking not only for new biomarkers associated with a disease such as cancer but also identifying biomarkers that predict patients who are most likely to respond to a particular cancer treatment. Orderly endeavors to relate cancer mutational information with biological conditions may encourage the interpretation of somatic mutation indexes into significant biomarkers for patient stratification.

We have screened and incorporated a board of cancer cell lines from Genomics of Drug Sensitivity in Cancer (GDSC) database to recognize genomic highlights related with drug sensitivity. We used mutation, DNA copy number variation, and gene expression information from Catalogue of Somatic Mutations in Cancer (COSMIC) and The Cancer Genome ATLAS (TCGA) for cell lines with their reactions to associate focused and cytotoxic treatments with approved drugs and drugs under clinical and preclinical examination.

We discovered mutated cancer genes were related with cell reaction to, mostly accessible, cancer medications and some mutated genes were related with sensitivity to an expansive scope of therapeutic agents. By connecting drug activity to the useful many-sided quality of cancer genomes, efficient pharmacogenomic profiling in tumor cell lines gives an intense biomarker revelation stage to guide balanced malignancy remedial systems.

Our study highlights that gene ANK2 amplification, and gene CELSER1 amplification and deletion are highly associated with anti-leukemic drug candidate LFM-A13. It also highlights that gene NUP214 and ROS1 copy number and gene NSD1 amplification are as a group highly associated with the parkinson drug Nilotinib. Finally, our study confirms that gene BRAF mutation is interacting with the BRAF-selective inhibitors drugs PLX4720 and SB590885. On the other hand, our study provides two open source analysis packages: bastah for the multitask-association analysis, and UNGeneAnno for automatic annotation of the variants.

1 Background

Clinical reactions to anti-cancer treatments are regularly confined to a subset of patients. Now and again, mutated cancer genes are strong biomarkers of reaction to targeted agents. There is an earnest need to recognize biomarkers that can anticipate patients ability to react to a treatment [32]. Moreover, the success of precision medicine is based on our ability to effectively translate genomic data into effective, customized prognosis and treatment strategies for individual patients. This requires identifying a genomic disease signature from a patient, then associating it with the most effective therapeutic intervention [6]. Recently, researchers have utilized cell lines to identify molecular consequences associated with various exposures, such as drugs, small molecules, or pathogens to move towards this goal [13], but many challenges still remain including what kind of data is needed to develop these genomic signatures and what kind of methods are needed to extract the appropriate information from high-throughput genomic data sets. Such analysis requires the use of genomic features such as gene expression, copy number variation, deletion, mutation, and sequence data, in the predictive modeling of anti-cancer drug response and holds the potential to speed up the emergence of personalized cancer therapies.

The first venture to address these challenges is to create far reaching drug affectability profiling estimations crosswise over different drugs, malady sub-sorts, and genomic profiling advances. Several of these data sets have been generated with a focus on cancer biology, in particular Glioma and Breast cancer. The bottleneck then becomes robust computational approaches that connect genomic profiles with drug and disease response in these datasets. For example, we view each tumor’s Copy Number Aberration (CNA) and mutation profiles as a system perturbation that simultaneously affect multiple genes. CNAs and mutations tend to appear in a patient-specific and multifactorial manner that is ideal for targeted medicine and targeted drug sensitivity [12].

In this paper, we present SLICED (StatisticaL Inferences for CEll lines screening in Drug discovery), a computational method that links genomic features, drugs and the associated half maximal inhibitory concentration (IC50) i.e., concentration at which half the target cells are killed. SLICED uses a newly developed two-layer algorithm which performs a multiplex association analysis on genomic features drugs cell lines with a focus on lasso (least absolute shrinkage and selection operator) [25].

We identified clinical trial datasets that had assessed tumor gene expression before drug treatment (using expression microarrays, copy numbers and mutations) and had subsequently measured a clear drug response phenotype using The Cancer Genome Atlas (TCGA), Catalogue of Somatic Mutations in Cancer (COSMIC) and Genomics of Drug Sensitivity in Cancer (GDSC) consortium to which we apply SLICED to predict the association of drugs-genomic features.

2 Results

Our goal was to use baseline gene expression and in vitro drug sensitivity derived from cell lines, coupled with in vivo baseline tumor gene expression, mutation and copy number to predict patients response to anti-cancer drugs. An overview of our approach is shown in Figure 1 (complete details are given in Materials and Methods section).

Figure 1: Flowchart of SLICED.

To capture the high degree of genomic diversity in cancer and to identify rare mutant subsets with altered drug sensitivity, we assembled 639 human tumour cell lines, representing the spectrum of common and rare types of adult and childhood cancers of epithelial, mesenchymal and haematopoietic origin. Cell lines were subjected to sequencing of the full coding exons of 64 commonly mutated cancer genes, genome-wide analysis of copy number gain and loss using Affymetrix SNP6.0 microarrays, and expression profiling using Affymetrix HT-U133A microarrays.

The available drugs covered a wide range of targets and processes implicated in cancer biology. They encompassed both targeted agents and cytotoxic chemotherapeutics, including approved drugs used in clinical practice, drugs in development undergoing studies in clinical trials, and experimental tool compounds (list of drugs is available as supplementary material S1 and annotations as S2). To dig into drug-to-drug variation, we included multiple drugs (related to the cancer) designed against well-credentialed targets. The effect of 72 hours of drug treatment on cell viability was used to derive a multi-parameter description of drug sensitivity (IC50), and the slope of the dose response curve. In preliminarily tests, we assessed several of the plethora of available machine learning algorithms, including principal component regression [16], elastic net regression [36], sslasso [14] and lasso.proj [27]. Among them, elastic net, sslasso and lasso.proj were consistently the best performers, with the added advantage of having an elasticNet being highly computationally efficient which is crucial for cross-validation analysis, but does not produce the significance of the effect of the association (p-value). Sslasso and lasso.proj which provide significance of the effect (p-value) are unfortunately computationally inefficient.

In the combined population, 1025 gene expressions, 1091 mutation and 476 CNV were evaluated. Using bastah, we find 41 genomic features whose expression significantly correlated to IC50 and 28 drugs (see Table 1). We identified mutations that were significantly associated with IC50 through regulation, gene expressions and deletion. In the context of compound sensitivity prediction, previous studies have demonstrated that such feature selection may provide the basis for identifying functional biomarkers underlying compound sensitivity or resistance [21, 9, 3, 11].

View this table:

Table 1:

Final list of significant association between drugs and genomics features using bastah

We compiled a list of known biomarkers of sensitivity (gold standards) for compounds represented in both the Sanger and CCLE panels, cell lines were categorized into 13 tissues, and evaluated the rank of each biomarker (based on the absolute value of the regression coefficients) in the model generated by bastah for the corresponding compound. In the following, we summarize the most important significant associations identified by our algorithm with a p-value threshold of 10^-06. In particular we focus on two categories of drugs: approved and under clinical trials.

2.1 Association of biomarker sensitivity with approved drugs

Nilotinib (associated with NSD1 amplification, and NUP214 and ROS1 copy number change): Nilotinib is a drug used for chronic myelogenous leukaemia (CML) [30].
Afatinib (associated with GPRASP1 amplification): Afatinib is a tyrosine kinase inhibitor of human EGF receptor (EGFR) and human epidermal growth factor receptor-2 (HER-2)/neu. EGFR and HER-2/neu used for treatment of solid tumors [18].
Lapatinib (associated with CPS1 and MAGI2 amplification): Lapatinib is a HER2-positive breast cancer medicine. It is a dual tyrosine kinase inhibitor that interrupts the epidermal growth factor receptor (EGFR) and HER2/neu pathways [35].
Imatinib (associated with GIGYF2 deletion and PPFIA4 mutation): Imatinib is a tyrosine-kinase inhibitor. It is used in the treatment of multiple cancers, most notably Philadelphia chromosome-positive (Ph+) chronic myelogenous leukemia (CML) [4].
Dasatinib (associated with PPFIA4 mutation): Dasatinib is an inhibitor of BCR-ABL tyrosine kinase and SRC family tyrosine kinase used to treat patients with chronic myelogenous leukemia (CML) [20].

2.2 Association of biomarker sensitivity with under clinical trial drugs

Nutlin-3 (associated with TP53 mutation): Nutlin-3 is a potent inhibitor of MDM2 (mouse double minute 2) binding to p53 that induces the expression of p53-regulated genes [5].
PLX-4720 (associated with BRAF mutation): PLX-4720 significantly inhibits the ERK phosphorylation in cell lines bearing B-Raf^V⁶⁰⁰E, but not the cells with wild-type B-Raf. PLX-4720 significantly inhibits the growth of tumor cell lines bearing the B-Raf^V⁶⁰⁰E oncogene [22].
VX-702 (associated with GNAS amplification): VX-702 is an active p38 MAP kinase inhibitor, for the potential treatment of inflammation, rheumatoid arthritis and cardiovascular diseases [7].
SB590885 (associated with BRAF mutation): SB590885 is used for treating melanoma. SB590885 is a potent and selective ATP competitive inhibitor of B-Raf kinase [31].
LFM-A13 (associated with ANK2 amplification and CELSR1 amplification and deletion): LFM- A13 is a potent and selective inhibitor of Bruton’s tyrosine kinase (BTK) used as an anti-breast cancer drug [2].
GW-441756 (associated with SYNE2 amplification): GW-441756 is a potent, selective inhibitor of the NGF receptor tyrosine kinase A (TrkA) [33].
GNF-2 (associated with PPFIA4 mutation): GNF-2 inhibits the cellular tyrosine phosphorylation of BCR-ABL in a dose-dependent manner induces a significant decrease in the levels of phospho-Stat5 in Ba/F3.p210 cells. [1].
TW-37 (associated with ITGB4 gene copy number): TW-37 is used for pancreatic cancer treatment. It is a small-molecule inhibitor of Bcl-2, inhibits cell growth and induces apoptosis in pancreatic cancer [28].
WH.4.023 (associated with PPFIA4 mutation): We do not have enough online data for this drug.
Motesanib (associated with PDGFRA copy number): Motesanib is used for treatment of non- small cell lung cancer. an inhibitor of VEGR, cKit and PDGF receptor tyrosine kinases [24].

Detailed results for all the annotated drugs are available in additional file S2. To get deep into our analysis, we looked closely at the interaction between different variants, genomic expression, copy numbers, amplifications, and deletions with drug response. Figure 2 and 3 summarize the significant association and interaction between different genomic features and a particular drug.

Figure 2: Significant association between different genomic features and a particular drug.

Figure 3: Summarizes the significant interaction between different genomic feature and a particular drug. For example one can see that genes ANK2 copy number variation, and CELSER1 copy number variation and CELSR1 amplification are interacting with each other and are highly associated as a group with LFM-A13.

From figure 3 we can see that gene ANK2 copy number variation, gene CELSER1 copy number variation and amplification are highly associated with LFM-A13. We also find that gene NSD1 amplification, gene NUP214 copy number and gene ROS1 copy number are as a group highly associated with Nilotinib. Finally, we find that gene BRAF mutation is interacting with the BRAF-selective inhibitors PLX4720 and SB590885.

3 Materials and Methods

We have screened and integrated a panel of cancer cell lines including mutation, DNA copy number, and gene expression data along with their responses to targeted and cytotoxic drug therapies from different databases. The dataset contained a broad range of drugs (5 categories) which are: targeted agents, cytotoxic chemotherapeutics, approved drugs used in clinical practice, drugs in development undergoing studies in clinical trials and experimental compounds.

3.1 Multi-dimensional genomic data

The data was collated together primarily from the publicly available GDSC data (version 5) [34], and IC50 measurements for a large number of drugs, against a wide range of cell lines [9].

3.2 Construction of genetic alteration profiles of candidate genes

Complete gene expression, mutation and copy number variation (cnv) data was downloaded from the publicly available Catalogue of Somatic Mutations in Cancer (COSMIC) dataset [8]. Mutation data was represented as a binary variable for each gene as mutated/non-mutated. Binary amplification and deletion variables were derived from copy number variation as cnv ≥ 5, and cnv < 2 respectively. A continuous value column, Δcnv, was also included representing amplifications as positive number and deletions as negative numbers to account for the magnitude of copy number variation.

The Broad Institute’s MutSig program [17] was used to identify commonly mutated genes in the publicly available data downloaded from The Cancer Genome Atlas (TCGA). The most commonly mutated 100 genes in the TCGA list were collected. To this list were added the 1000 most commonly mutated genes in the COSMIC dataset, giving a list of 1092 unique genes for analysis. The final data for the analysis contained the mutation status, copy number variation and expression data for these genes, giving a final dataset of 689 cell lines for which we have IC50 values for 141 drugs, mutation statuses for 1091 genes, copy number variation data for 476 genes, and expression data for 1025 genes. The final dataset, containing mutation, cnv and expression data, therefore has 3544 features. All data was normalized as explained next.

3.3 Processing each data set

Data set was first curated to include the samples of interest and all the information for each sample that the algorithm will use, such as drug subtype and cell lines types. Expression values in each data set are normalized and scaled by subtracting the mean and dividing by two standard deviations as in [10]. Categorical variables are binarized. Mutation and copy number variables have been centered by subtracting their mean. Missing expression values (for genes whose expression is present for some samples and not others in the data set) are imputed using nearest neighbor imputation (“knnImputation” in R package) [26].

3.4 Statistical analysis using bastah

An essential component of gathering all the information from different databases as explained is the systematic integration of large-scale genomic and drug sensitivity datasets. To identify genomic markers of drug response, we currently use a comprehensive analytical approach bastah whose goal is to correlate drug sensitivity (IC50 values and slope of the dose/response curve) with genomic alterations in cancer including point mutations, amplifications and deletions of common cancer genes for each drug. The approaches identify individual genomic features associated with drug sensitivity, and for each drug-gene association reports an unbiased size effect and statistical significance of the association with an adjusted p-value as will be briefly explained in this section.

In a linear regression model, we are given n pairs (y₁; x₁), (y₂; x₂),…., (y_n; x_n), with vectors x_i ∈ ℝ^p (p=3544) and response variables y_i given by where ε is the error (noise) term with independent and identically distributed (i.i.d.) components having E[ε] = 0 and V ar(ε) = σ². An intercept term may be implicitly present. In matrix form, letting y = (y₁, ….,y_n)^T and denoting by X the design matrix with rows ,we have

The goal is to estimate the unknown vector of parameters β ∈ ℝp.

In the high-dimensional setting where p>> n, the matrix (X^T X) is rank deficient and one has to resort to biased estimators. A particularly successful approach is the lasso [25] which promotes sparse reconstructions through an ℓ₁ penalization: where λ is the tuning parameter that determines the amount of penalization. A larger value of λ tends to encourage a greater number of the β_j to be set exactly to zero. The optimal value of λ can be determined by cross validation on regression error, or via an information-theoretic test based on Bayesian Information Criteria.

Using [29] and [14], we can furthermore establish a distribution of (an unbiased β estimator) that is

From which we derive the hypothesis test about β to test the null hypothesis H₀ : β_j = 0 versus the alternative H₁ : β_j≠0 (two sided test statistics)

3.5 Model fitting procedures

The genomic features matrix X contains information about gene mutations, gene copy number variation, and gene expression characteristics for each cell line. The rows of the matrix correspond to the cell lines, there are 689 cell lines, and each column represent a specific genomic feature. There are 3544 genomic features columns in total. These columns are divided into 1091 gene mutation, 1428 gene copy number and 1025 gene expression.

The gene mutations columns in the genomic features matrix are binary, representing two categories (0 →no mutation in gene, 1→gene contains mutation). The gene copy number columns are divided into three types for each gene: gene.cnv, gene amplification (gene.cnv.amp), and gene deletions (gene.cnv.del). The gene.cnv columns are continuous (values < 0 are deletions, values≥3 are amplifications), while the gene amplification and deletion columns are binary (0→no amplification/deletion in gene, 1→gene is amplified/deleted). The last group of columns of the genomic features matrix is the gene expression columns. The expression values are continuous. Each column contains the expression values of a specific gene across all cancer cell lines. The last group of columns of the genomic features matrix is the gene expression columns. The expression values are continuous. Each column contains the expression values of a specific gene across all cancer cell lines

This TCGA and COSMIC input dataset was divided into five non-overlapping sample groups, used as cross-validation folds for training and testing data. For each cross-validation fold, each model was trained on 4/5ths of the samples, and used to make predictions of sensitivity for the held out 1/5th of samples. Within each training step, a separate 5-fold cross-validation procedure was employed for parameter tuning of each model.

3.6 Feature selection

We use our approach of sparsity inducing regularizers embedded within the estimation of linear regression,bastah described before to achieve feature selection. In implementing l₁-regularized regression, the choiceof regularization term is crucial. This procedure was repeated 100 times for each drug in order to assessthe stability of the features (as done by [9]) when applying the 10-fold cross validation procedure. For each of the 100 runs, a feature list was built for the drug comprised of different genomic features with weights assigned to each. The final signature of markers for a drug consisted of all features that appear in any of the 100 runs along with significant p-value. The weights (w) and their respective p-values were used to assess effect sizes of features in a drug as a marker signature. The effect size of a feature was calculated by multiplying the feature’s weight by the p-values across the cell line panel. The effect size is therefore a normalization of the feature’s weight to account for the different scales used to measure the different genomic features.

Features with higher stability of correlation in cross-validation were considered of the highest confidence of truly being associated with drug response. The most significant features associated with drug response are those with large frequency and small p-value. In total, 40 features were highly reproduced and significantly associated with IC50 for 139 drugs. It is noticeable that copy number variation features have the wider ranges of average coefficients compared to gene mutation and expression features (Figure 4).

Figure 4: Top 30 most common significant genomic features among the panel of drugs

4 Discussion

We have shown that models constructed using variants, genomic expression, copy numbers, amplifications, deletions and drug IC50 values, from a large panel of cell lines, can predict the chemotherapeuticresponse in patients. While acquired resistance to chemotherapeutics or targeted agents is well recognized and is the subject of recently intensive studies [19, 15], the variation in sensitivities to these drugs displayed by the cell lines in culture is not likely to be due to resistance acquired from previous exposure to these drugs; the patients from whose tumors the cell lines were derived would not have been treated with many of these drugs.

Variation of gene expression levels in cancer is known to be brought about by copy number variation, deletion and amplification and in this way the three variables ought to be considered simultaneously when scanning for driver genes. We likewise considered cancer driver gene selection by means of investigation of copy number, mutation, amplification, deletion and expression levels [23] via newly developed generalized lasso techniques. When using gene network analysis on gene expressions only (results we did not report here), no structure was found associating drug sensitivity to gene expression. When variation in gene expression levels was used with penalized lasso, we obtained more information.

It might appear to be astounding that quality and variation in gene expression alone has such exceptional influence in enhancing drug responses, as the approach disregards what are thought to be important factors in drug sensitivity, for example, malignancy type or other specific markers. We demonstrated, through this work, that the good performance may (partially) come from not only the way the unbiased test statistics were used for genomics features prediction but also that the variability in gene expression as a surrogate for unmeasured phenotypes are straightforwardly important to chemotherapeutic sensitivity, therefore, can catch parts of both germline and tumor-specific genome variation.

In summary, we have proposed a novel integrative computational strategy based on a multidimensional multitask penalized regression for identifying driver genes for cancer. To effectively perform high dimensional genomic data analysis, we used recursive strategies in line with the penalized lasso method. Furthermore, we have proposed a parametric multidimensional statistical test for gene selection based on robust regression modeling. These findings have profound implications for personalized medicine and drug development. Future work will focus on improving predictions using more rigorous transcriptome quantification.

References

[1].↵
Francisco J Adrián, Qiang Ding, Taebo Sim, Anastasia Velentza, Christine Sloan, Yi Liu, Guobao Zhang, Wooyoung Hur, Sheng Ding, Paul Manley, et al. Allosteric inhibitors of bcr-abl–dependent cell proliferation. Nature chemical biology, 2(2):95–102, 2006.
OpenUrl
[2].↵
Akintunde Akinleye, Yamei Chen, Nikhil Mukhi, Yongping Song, and Delong Liu. Ibrutinib and novel BTK inhibitors in clinical development. Journal of hematology & oncology, 6(1):59, 2013.
OpenUrl
[3].↵
Jordi Barretina, Giordano Caponigro, Nicolas Stransky, Kavitha Venkatesan, Adam A Margolin, Sungjoon Kim, Christopher J Wilson, Joseph Lehár, Gregory V Kryukov, Dmitriy Sonkin, et al. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature, 483(7391):603–607, 2012.
OpenUrl CrossRef PubMed Web of Science
[4].↵
Harry G Brittain. Profiles of drug substances, excipients and related methodology, volume 39. Academic Press, USA, 2013.
[5].↵
Yun-Jeong Choe, Sun-Young Lee, Kyung Won Ko, Seok Joon Shin, and Ho-Shik Kim. Nutlin-3 induces HO-1 expression by activating JNK in a transcription-independent manner of p53. International journal of oncology, 44(3):761–768, 2014.
OpenUrl PubMed
[6].↵
James C Costello, Laura M Heiser, Elisabeth Georgii, Mehmet Gönen, Michael P Menden, Nicholas J Wang, Mukesh Bansal, Petteri Hintsanen, Suleiman A Khan, John-Patrick Mpindi, et al. A com-munity effort to assess and improve drug sensitivity prediction algorithms. Nature biotechnology, 32(12):1202–1212, 2014.
OpenUrl CrossRef PubMed
[7].↵
Changhai Ding. Drug evaluation: Vx-702, a map kinase inhibitor for rheumatoid arthritis and acute coronary syndrome. Current opinion in investigational drugs, 7(11):1020–1025, 2006.
OpenUrl
[8].↵
Simon A Forbes, David Beare, Prasad Gunasekaran, Kenric Leung, Nidhi Bindal, Harry Boutselakis, Minjie Ding, Sally Bamford, Charlotte Cole, Sari Ward, et al. COSMIC: exploring the world’s knowledge of somatic mutations in human cancer. Nucleic acids research, 43(D1):D805–D811, 2015.
OpenUrl CrossRef PubMed
[9].↵
Mathew J Garnett, Elena J Edelman, Sonja J Heidorn, Chris D Greenman, Anahita Dastur, King Wai Lau, Patricia Greninger, I Richard Thompson, Xi Luo, Jorge Soares, et al. System-atic identification of genomic markers of drug sensitivity in cancer cells. Nature, 483(7391):570–575, 2012.
OpenUrl CrossRef PubMed Web of Science
[10].↵
Andrew Gelman. Scaling regression inputs by dividing by two standard deviations. Statistics in medicine, 27(15):2865–2873, 2008.
OpenUrl CrossRef PubMed Web of Science
[11].↵
Laura M Heiser, Anguraj Sadanandam, Wen-Lin Kuo, Stephen C Benz, Theodore C Goldstein, Sam Ng, William J Gibb, Nicholas J Wang, Safiyyah Ziyad, Frances Tong, et al. Subtype and pathway specific responses to anticancer compounds in breast cancer. Proceedings of the National Academy of Sciences, 109(8):2724–2729, 2012.
OpenUrl Abstract/FREE Full Text
[12].↵
Swen Hoelder, Paul A Clarke, and Paul Workman. Discovery of small molecule cancer drugs: successes, challenges and opportunities. Molecular oncology, 6(2):155–176, 2012.
OpenUrl
[13].↵
Francesco Iorio, Theo A Knijnenburg, Daniel J Vis, Graham R Bignell, Michael P Menden, Michael Schubert, Nanne Aben, Emanuel Gonçalves, Syd Barthorpe, Howard Lightfoot, et al. A landscape of pharmacogenomic interactions in cancer. Cell, 166(3):740–754, 2016.
OpenUrl
[14].↵
Adel Javanmard and Andrea Montanari. Confidence intervals and hypothesis testing for high- dimensional regression. Journal of Machine Learning Research, 15(1):2869–2909, 2014.
OpenUrl
[15].↵
Yuqiu Jiang and Mu Wang. Personalized medicine in oncology: tailoring the right drug to the right patient. Biomarkers in medicine, 4(4):523–533, 2010.
OpenUrl
[16].↵
Ian T Jolliffe. A note on the use of principal components in regression. Applied Statistics, pages 300–303, 1982.
[17].↵
Michael S Lawrence, Petar Stojanov, Paz Polak, Gregory V Kryukov, Kristian Cibulskis, Andrey Sivachenko, Scott L Carter, Chip Stewart, Craig H Mermel, Steven A Roberts, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature, 499(7457):214–218, 2013.
OpenUrl CrossRef PubMed Web of Science
[18].↵
Natalie Minkovsky and Alan Berezov. BIBW-2992, a dual receptor tyrosine kinase inhibitor for the treatment of solid tumors. Current opinion in investigational drugs (London, England: 2000), 9(12):1336–1346, 2008.
OpenUrl PubMed
[19].↵
Alok Mishra and Mukesh Verma. Cancer biomarkers: are we ready for the prime time? Cancers, 2(1):190–208, 2010.
OpenUrl CrossRef PubMed
[20].↵
A Olivieri and L Manzione. Dasatinib: a new step in molecular target therapy. Annals of Oncology, 18(suppl 6):vi42–vi46, 2007.
OpenUrl CrossRef PubMed
[21].↵
Sreenath V Sharma, Daniel A Haber, and Jeff Settleman. Cell line-based platforms to evaluate the therapeutic efficacy of candidate anticancer agents. Nature reviews cancer, 10(4):241–253, 2010.
OpenUrl CrossRef PubMed Web of Science
[22].↵
Lindsay C Spender, G John Ferguson, Sijia Liu, Chao Cui, Maria Romina Girotti, Gary Sibbet, Ellen B Higgs, Morven K Shuttleworth, Tom Hamilton, Paul Lorigan, et al. Mutational activation of BRAF confers sensitivity to transforming growth factor beta inhibitors in human cancer cells. Oncotarget, 7(50):81995–82012, 2016.
OpenUrl
[23].↵
Michael R Stratton, Peter J Campbell, and P Andrew Futreal. The cancer genome. Nature, 458(7239):719–724, 2009.
OpenUrl CrossRef PubMed Web of Science
[24].↵
Niall Tebbutt, Dusan Kotasek, Howard A Burris, Lee S Schwartzberg, Herbert Hurwitz, Joe Stephenson, Douglas J Warner, Lisa Chen, Cheng-Pang Hsu, and David Goldstein. Motesanib with or without panitumumab plus FOLFIRI or FOLFOX for the treatment of metastatic colorectal cancer. Cancer chemotherapy and pharmacology, 75(5):993–1004, 2015.
OpenUrl
[25].↵
Robert Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological), pages 267–288, 1996.
[26].↵
Luis Torgo and Lúis Torgo. Data mining with R: learning with case studies. Chapman & Hall/CRC Boca Raton, FL:, USA, 2011.
[27].↵
Sara Van de Geer, Peter Bühlmann, Yaacov Ritov, Ruben Dezeure, et al. On asymptotically optimal confidence regions and tests for high-dimensional models. The Annals of Statistics, 42(3):1166–1202, 2014.
OpenUrl CrossRef
[28].↵
Haixia Wang, Zhifeng Zhang, Xiuping Wei, and Ruizhen Dai. Small-molecule inhibitor of Bcl-2 (TW- 37) suppresses growth and enhances cisplatin-induced apoptosis in ovarian cancer cells. Journal of ovarian research, 8(1):3, 2015.
OpenUrl
[29].↵
L Wasserman. All of statistics: a concise course in statistical inference. 2004.
[30].↵
Eea Weisberg, P Manley, J Mestan, S Cowan-Jacob, A Ray, and JD Griffin. Amn107 (nilotinib): a novel and selective inhibitor of bcr-abl. British Journal of Cancer, 94(12):1765–1769, 2006.
OpenUrl CrossRef PubMed Web of Science
[31].↵
Steven Whittaker, Delphine Ménard, Ruth Kirk, Lesley Ogilvie, Douglas Hedley, Alfonso Zambon, Filipa Lopes, Natasha Preece, Helen Manne, Sareena Rana, et al. A novel, selective, and efficacious nanomolar pyridopyrazinone inhibitor of V600EBRAF. Cancer research, 70(20):8036–8044, 2010.
OpenUrl Abstract/FREE Full Text
[32].↵
PD Williams, jk Lee, and D Theodorescu. Genomancy: predicting tumour response to cancer therapy based on the oracle of genetics. Curr Oncol, 16(1):56–58, 2009.
OpenUrl PubMed
[33].↵
Edgar R Wood, Lee Kuyper, Kimberly G Petrov, Robert N Hunter, Philip A Harris, and Karen Lackey. Discovery and in vitro evaluation of potent TrkA kinase inhibitors: oxindole and aza- oxindoles. Bioorganic & medicinal chemistry letters, 14(4):953–957, 2004.
OpenUrl
[34].↵
Wanjuan Yang, Jorge Soares, Patricia Greninger, Elena J Edelman, Howard Lightfoot, Simon Forbes, Nidhi Bindal, Dave Beare, James A Smith, I Richard Thompson, et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic acids research, 41(D1):D955–D961, 2013.
OpenUrl CrossRef PubMed Web of Science
[35].↵
Wen-Lei Zhuo, Liang Zhang, Qi-Chao Xie, Bo Zhu, and Zheng-Tang Chen. Identifying differentially expressed genes and screening small molecule drugs for lapatinib-resistance of breast cancer by a bioinformatics strategy. Asian Pac J Cancer Prev, 15:10847–53, 2014.
OpenUrl
[36].↵
Hui Zou and Trevor Hastie. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2):301–320, 2005.
OpenUrl CrossRef Web of Science

View the discussion thread.

Posted September 27, 2017.

Download PDF

Citation Tools

Subject Area

Genomics

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11715)
Bioengineering (8723)
Bioinformatics (29129)
Biophysics (14936)
Cancer Biology (12049)
Cell Biology (17359)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14144)
Epidemiology (2067)
Evolutionary Biology (18268)
Genetics (12221)
Genomics (16767)
Immunology (11843)
Microbiology (28014)
Molecular Biology (11560)
Neuroscience (60814)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10384)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] [1].↵
Francisco J Adrián, Qiang Ding, Taebo Sim, Anastasia Velentza, Christine Sloan, Yi Liu, Guobao Zhang, Wooyoung Hur, Sheng Ding, Paul Manley, et al. Allosteric inhibitors of bcr-abl–dependent cell proliferation. Nature chemical biology, 2(2):95–102, 2006.
OpenUrl

[2] [2].↵
Akintunde Akinleye, Yamei Chen, Nikhil Mukhi, Yongping Song, and Delong Liu. Ibrutinib and novel BTK inhibitors in clinical development. Journal of hematology & oncology, 6(1):59, 2013.
OpenUrl

[3] [3].↵
Jordi Barretina, Giordano Caponigro, Nicolas Stransky, Kavitha Venkatesan, Adam A Margolin, Sungjoon Kim, Christopher J Wilson, Joseph Lehár, Gregory V Kryukov, Dmitriy Sonkin, et al. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature, 483(7391):603–607, 2012.
OpenUrl CrossRef PubMed Web of Science

[4] [4].↵
Harry G Brittain. Profiles of drug substances, excipients and related methodology, volume 39. Academic Press, USA, 2013.

[5] [5].↵
Yun-Jeong Choe, Sun-Young Lee, Kyung Won Ko, Seok Joon Shin, and Ho-Shik Kim. Nutlin-3 induces HO-1 expression by activating JNK in a transcription-independent manner of p53. International journal of oncology, 44(3):761–768, 2014.
OpenUrl PubMed

[6] [6].↵
James C Costello, Laura M Heiser, Elisabeth Georgii, Mehmet Gönen, Michael P Menden, Nicholas J Wang, Mukesh Bansal, Petteri Hintsanen, Suleiman A Khan, John-Patrick Mpindi, et al. A com-munity effort to assess and improve drug sensitivity prediction algorithms. Nature biotechnology, 32(12):1202–1212, 2014.
OpenUrl CrossRef PubMed

[7] [7].↵
Changhai Ding. Drug evaluation: Vx-702, a map kinase inhibitor for rheumatoid arthritis and acute coronary syndrome. Current opinion in investigational drugs, 7(11):1020–1025, 2006.
OpenUrl

[8] [8].↵
Simon A Forbes, David Beare, Prasad Gunasekaran, Kenric Leung, Nidhi Bindal, Harry Boutselakis, Minjie Ding, Sally Bamford, Charlotte Cole, Sari Ward, et al. COSMIC: exploring the world’s knowledge of somatic mutations in human cancer. Nucleic acids research, 43(D1):D805–D811, 2015.
OpenUrl CrossRef PubMed

[9] [9].↵
Mathew J Garnett, Elena J Edelman, Sonja J Heidorn, Chris D Greenman, Anahita Dastur, King Wai Lau, Patricia Greninger, I Richard Thompson, Xi Luo, Jorge Soares, et al. System-atic identification of genomic markers of drug sensitivity in cancer cells. Nature, 483(7391):570–575, 2012.
OpenUrl CrossRef PubMed Web of Science

[10] [10].↵
Andrew Gelman. Scaling regression inputs by dividing by two standard deviations. Statistics in medicine, 27(15):2865–2873, 2008.
OpenUrl CrossRef PubMed Web of Science

[11] [11].↵
Laura M Heiser, Anguraj Sadanandam, Wen-Lin Kuo, Stephen C Benz, Theodore C Goldstein, Sam Ng, William J Gibb, Nicholas J Wang, Safiyyah Ziyad, Frances Tong, et al. Subtype and pathway specific responses to anticancer compounds in breast cancer. Proceedings of the National Academy of Sciences, 109(8):2724–2729, 2012.
OpenUrl Abstract/FREE Full Text

[12] [12].↵
Swen Hoelder, Paul A Clarke, and Paul Workman. Discovery of small molecule cancer drugs: successes, challenges and opportunities. Molecular oncology, 6(2):155–176, 2012.
OpenUrl

[13] [13].↵
Francesco Iorio, Theo A Knijnenburg, Daniel J Vis, Graham R Bignell, Michael P Menden, Michael Schubert, Nanne Aben, Emanuel Gonçalves, Syd Barthorpe, Howard Lightfoot, et al. A landscape of pharmacogenomic interactions in cancer. Cell, 166(3):740–754, 2016.
OpenUrl

[14] [14].↵
Adel Javanmard and Andrea Montanari. Confidence intervals and hypothesis testing for high- dimensional regression. Journal of Machine Learning Research, 15(1):2869–2909, 2014.
OpenUrl

[15] [15].↵
Yuqiu Jiang and Mu Wang. Personalized medicine in oncology: tailoring the right drug to the right patient. Biomarkers in medicine, 4(4):523–533, 2010.
OpenUrl

[16] [16].↵
Ian T Jolliffe. A note on the use of principal components in regression. Applied Statistics, pages 300–303, 1982.

[17] [17].↵
Michael S Lawrence, Petar Stojanov, Paz Polak, Gregory V Kryukov, Kristian Cibulskis, Andrey Sivachenko, Scott L Carter, Chip Stewart, Craig H Mermel, Steven A Roberts, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature, 499(7457):214–218, 2013.
OpenUrl CrossRef PubMed Web of Science

[18] [18].↵
Natalie Minkovsky and Alan Berezov. BIBW-2992, a dual receptor tyrosine kinase inhibitor for the treatment of solid tumors. Current opinion in investigational drugs (London, England: 2000), 9(12):1336–1346, 2008.
OpenUrl PubMed

[19] [19].↵
Alok Mishra and Mukesh Verma. Cancer biomarkers: are we ready for the prime time? Cancers, 2(1):190–208, 2010.
OpenUrl CrossRef PubMed

[20] [20].↵
A Olivieri and L Manzione. Dasatinib: a new step in molecular target therapy. Annals of Oncology, 18(suppl 6):vi42–vi46, 2007.
OpenUrl CrossRef PubMed

[21] [21].↵
Sreenath V Sharma, Daniel A Haber, and Jeff Settleman. Cell line-based platforms to evaluate the therapeutic efficacy of candidate anticancer agents. Nature reviews cancer, 10(4):241–253, 2010.
OpenUrl CrossRef PubMed Web of Science

[22] [22].↵
Lindsay C Spender, G John Ferguson, Sijia Liu, Chao Cui, Maria Romina Girotti, Gary Sibbet, Ellen B Higgs, Morven K Shuttleworth, Tom Hamilton, Paul Lorigan, et al. Mutational activation of BRAF confers sensitivity to transforming growth factor beta inhibitors in human cancer cells. Oncotarget, 7(50):81995–82012, 2016.
OpenUrl

[23] [23].↵
Michael R Stratton, Peter J Campbell, and P Andrew Futreal. The cancer genome. Nature, 458(7239):719–724, 2009.
OpenUrl CrossRef PubMed Web of Science

[24] [24].↵
Niall Tebbutt, Dusan Kotasek, Howard A Burris, Lee S Schwartzberg, Herbert Hurwitz, Joe Stephenson, Douglas J Warner, Lisa Chen, Cheng-Pang Hsu, and David Goldstein. Motesanib with or without panitumumab plus FOLFIRI or FOLFOX for the treatment of metastatic colorectal cancer. Cancer chemotherapy and pharmacology, 75(5):993–1004, 2015.
OpenUrl

[25] [25].↵
Robert Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological), pages 267–288, 1996.

[26] [26].↵
Luis Torgo and Lúis Torgo. Data mining with R: learning with case studies. Chapman & Hall/CRC Boca Raton, FL:, USA, 2011.

[27] [27].↵
Sara Van de Geer, Peter Bühlmann, Yaacov Ritov, Ruben Dezeure, et al. On asymptotically optimal confidence regions and tests for high-dimensional models. The Annals of Statistics, 42(3):1166–1202, 2014.
OpenUrl CrossRef

[28] [28].↵
Haixia Wang, Zhifeng Zhang, Xiuping Wei, and Ruizhen Dai. Small-molecule inhibitor of Bcl-2 (TW- 37) suppresses growth and enhances cisplatin-induced apoptosis in ovarian cancer cells. Journal of ovarian research, 8(1):3, 2015.
OpenUrl

[29] [29].↵
L Wasserman. All of statistics: a concise course in statistical inference. 2004.

[30] [30].↵
Eea Weisberg, P Manley, J Mestan, S Cowan-Jacob, A Ray, and JD Griffin. Amn107 (nilotinib): a novel and selective inhibitor of bcr-abl. British Journal of Cancer, 94(12):1765–1769, 2006.
OpenUrl CrossRef PubMed Web of Science

[31] [31].↵
Steven Whittaker, Delphine Ménard, Ruth Kirk, Lesley Ogilvie, Douglas Hedley, Alfonso Zambon, Filipa Lopes, Natasha Preece, Helen Manne, Sareena Rana, et al. A novel, selective, and efficacious nanomolar pyridopyrazinone inhibitor of V600EBRAF. Cancer research, 70(20):8036–8044, 2010.
OpenUrl Abstract/FREE Full Text

[32] [32].↵
PD Williams, jk Lee, and D Theodorescu. Genomancy: predicting tumour response to cancer therapy based on the oracle of genetics. Curr Oncol, 16(1):56–58, 2009.
OpenUrl PubMed

[33] [33].↵
Edgar R Wood, Lee Kuyper, Kimberly G Petrov, Robert N Hunter, Philip A Harris, and Karen Lackey. Discovery and in vitro evaluation of potent TrkA kinase inhibitors: oxindole and aza- oxindoles. Bioorganic & medicinal chemistry letters, 14(4):953–957, 2004.
OpenUrl

[34] [34].↵
Wanjuan Yang, Jorge Soares, Patricia Greninger, Elena J Edelman, Howard Lightfoot, Simon Forbes, Nidhi Bindal, Dave Beare, James A Smith, I Richard Thompson, et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic acids research, 41(D1):D955–D961, 2013.
OpenUrl CrossRef PubMed Web of Science

[35] [35].↵
Wen-Lei Zhuo, Liang Zhang, Qi-Chao Xie, Bo Zhu, and Zheng-Tang Chen. Identifying differentially expressed genes and screening small molecule drugs for lapatinib-resistance of breast cancer by a bioinformatics strategy. Asian Pac J Cancer Prev, 15:10847–53, 2014.
OpenUrl

[36] [36].↵
Hui Zou and Trevor Hastie. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2):301–320, 2005.
OpenUrl CrossRef Web of Science