Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Integrative Statistical Inferences for Drug Sensitivity Biomarkers in Cancer

Ehsan Ullah, Saila Shama, Noora Al Muftah, Ian Richard Thompson, Reda Rawi, Raghvendra Mall, Halima Bensmail
doi: https://doi.org/10.1101/194670
Ehsan Ullah
1Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Saila Shama
1Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
2Faculty of Applied Science and Engineering, Engineering Science, University of Toronto, 40 Saint George Street, Toronto, Canada M4C 3C2
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Noora Al Muftah
1Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
3Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ian Richard Thompson
4Qatar Biomedical Research Institute, Hamad Bin Khalifa University, Doha, Qatar
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Reda Rawi
1Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
5Vaccine Research Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health,Bethesda, MD 20814, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Raghvendra Mall
1Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Halima Bensmail
1Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Personal medicine has been associated with different patient responses to different anti-cancer therapies. Recently, scientists are looking not only for new biomarkers associated with a disease such as cancer but also identifying biomarkers that predict patients who are most likely to respond to a particular cancer treatment. Orderly endeavors to relate cancer mutational information with biological conditions may encourage the interpretation of somatic mutation indexes into significant biomarkers for patient stratification.

We have screened and incorporated a board of cancer cell lines from Genomics of Drug Sensitivity in Cancer (GDSC) database to recognize genomic highlights related with drug sensitivity. We used mutation, DNA copy number variation, and gene expression information from Catalogue of Somatic Mutations in Cancer (COSMIC) and The Cancer Genome ATLAS (TCGA) for cell lines with their reactions to associate focused and cytotoxic treatments with approved drugs and drugs under clinical and preclinical examination.

We discovered mutated cancer genes were related with cell reaction to, mostly accessible, cancer medications and some mutated genes were related with sensitivity to an expansive scope of therapeutic agents. By connecting drug activity to the useful many-sided quality of cancer genomes, efficient pharmacogenomic profiling in tumor cell lines gives an intense biomarker revelation stage to guide balanced malignancy remedial systems.

Our study highlights that gene ANK2 amplification, and gene CELSER1 amplification and deletion are highly associated with anti-leukemic drug candidate LFM-A13. It also highlights that gene NUP214 and ROS1 copy number and gene NSD1 amplification are as a group highly associated with the parkinson drug Nilotinib. Finally, our study confirms that gene BRAF mutation is interacting with the BRAF-selective inhibitors drugs PLX4720 and SB590885. On the other hand, our study provides two open source analysis packages: bastah for the multitask-association analysis, and UNGeneAnno for automatic annotation of the variants.

1 Background

Clinical reactions to anti-cancer treatments are regularly confined to a subset of patients. Now and again, mutated cancer genes are strong biomarkers of reaction to targeted agents. There is an earnest need to recognize biomarkers that can anticipate patients ability to react to a treatment [32]. Moreover, the success of precision medicine is based on our ability to effectively translate genomic data into effective, customized prognosis and treatment strategies for individual patients. This requires identifying a genomic disease signature from a patient, then associating it with the most effective therapeutic intervention [6]. Recently, researchers have utilized cell lines to identify molecular consequences associated with various exposures, such as drugs, small molecules, or pathogens to move towards this goal [13], but many challenges still remain including what kind of data is needed to develop these genomic signatures and what kind of methods are needed to extract the appropriate information from high-throughput genomic data sets. Such analysis requires the use of genomic features such as gene expression, copy number variation, deletion, mutation, and sequence data, in the predictive modeling of anti-cancer drug response and holds the potential to speed up the emergence of personalized cancer therapies.

The first venture to address these challenges is to create far reaching drug affectability profiling estimations crosswise over different drugs, malady sub-sorts, and genomic profiling advances. Several of these data sets have been generated with a focus on cancer biology, in particular Glioma and Breast cancer. The bottleneck then becomes robust computational approaches that connect genomic profiles with drug and disease response in these datasets. For example, we view each tumor’s Copy Number Aberration (CNA) and mutation profiles as a system perturbation that simultaneously affect multiple genes. CNAs and mutations tend to appear in a patient-specific and multifactorial manner that is ideal for targeted medicine and targeted drug sensitivity [12].

In this paper, we present SLICED (StatisticaL Inferences for CEll lines screening in Drug discovery), a computational method that links genomic features, drugs and the associated half maximal inhibitory concentration (IC50) i.e., concentration at which half the target cells are killed. SLICED uses a newly developed two-layer algorithm which performs a multiplex association analysis on genomic features drugs cell lines with a focus on lasso (least absolute shrinkage and selection operator) [25].

We identified clinical trial datasets that had assessed tumor gene expression before drug treatment (using expression microarrays, copy numbers and mutations) and had subsequently measured a clear drug response phenotype using The Cancer Genome Atlas (TCGA), Catalogue of Somatic Mutations in Cancer (COSMIC) and Genomics of Drug Sensitivity in Cancer (GDSC) consortium to which we apply SLICED to predict the association of drugs-genomic features.

2 Results

Our goal was to use baseline gene expression and in vitro drug sensitivity derived from cell lines, coupled with in vivo baseline tumor gene expression, mutation and copy number to predict patients response to anti-cancer drugs. An overview of our approach is shown in Figure 1 (complete details are given in Materials and Methods section).

Figure 1:
  • Download figure
  • Open in new tab
Figure 1: Flowchart of SLICED.

To capture the high degree of genomic diversity in cancer and to identify rare mutant subsets with altered drug sensitivity, we assembled 639 human tumour cell lines, representing the spectrum of common and rare types of adult and childhood cancers of epithelial, mesenchymal and haematopoietic origin. Cell lines were subjected to sequencing of the full coding exons of 64 commonly mutated cancer genes, genome-wide analysis of copy number gain and loss using Affymetrix SNP6.0 microarrays, and expression profiling using Affymetrix HT-U133A microarrays.

The available drugs covered a wide range of targets and processes implicated in cancer biology. They encompassed both targeted agents and cytotoxic chemotherapeutics, including approved drugs used in clinical practice, drugs in development undergoing studies in clinical trials, and experimental tool compounds (list of drugs is available as supplementary material S1 and annotations as S2). To dig into drug-to-drug variation, we included multiple drugs (related to the cancer) designed against well-credentialed targets. The effect of 72 hours of drug treatment on cell viability was used to derive a multi-parameter description of drug sensitivity (IC50), and the slope of the dose response curve. In preliminarily tests, we assessed several of the plethora of available machine learning algorithms, including principal component regression [16], elastic net regression [36], sslasso [14] and lasso.proj [27]. Among them, elastic net, sslasso and lasso.proj were consistently the best performers, with the added advantage of having an elasticNet being highly computationally efficient which is crucial for cross-validation analysis, but does not produce the significance of the effect of the association (p-value). Sslasso and lasso.proj which provide significance of the effect (p-value) are unfortunately computationally inefficient.

In the combined population, 1025 gene expressions, 1091 mutation and 476 CNV were evaluated. Using bastah, we find 41 genomic features whose expression significantly correlated to IC50 and 28 drugs (see Table 1). We identified mutations that were significantly associated with IC50 through regulation, gene expressions and deletion. In the context of compound sensitivity prediction, previous studies have demonstrated that such feature selection may provide the basis for identifying functional biomarkers underlying compound sensitivity or resistance [21, 9, 3, 11].

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1:

Final list of significant association between drugs and genomics features using bastah

We compiled a list of known biomarkers of sensitivity (gold standards) for compounds represented in both the Sanger and CCLE panels, cell lines were categorized into 13 tissues, and evaluated the rank of each biomarker (based on the absolute value of the regression coefficients) in the model generated by bastah for the corresponding compound. In the following, we summarize the most important significant associations identified by our algorithm with a p-value threshold of 10-06. In particular we focus on two categories of drugs: approved and under clinical trials.

2.1 Association of biomarker sensitivity with approved drugs

  • Nilotinib (associated with NSD1 amplification, and NUP214 and ROS1 copy number change): Nilotinib is a drug used for chronic myelogenous leukaemia (CML) [30].

  • Afatinib (associated with GPRASP1 amplification): Afatinib is a tyrosine kinase inhibitor of human EGF receptor (EGFR) and human epidermal growth factor receptor-2 (HER-2)/neu. EGFR and HER-2/neu used for treatment of solid tumors [18].

  • Lapatinib (associated with CPS1 and MAGI2 amplification): Lapatinib is a HER2-positive breast cancer medicine. It is a dual tyrosine kinase inhibitor that interrupts the epidermal growth factor receptor (EGFR) and HER2/neu pathways [35].

  • Imatinib (associated with GIGYF2 deletion and PPFIA4 mutation): Imatinib is a tyrosine-kinase inhibitor. It is used in the treatment of multiple cancers, most notably Philadelphia chromosome-positive (Ph+) chronic myelogenous leukemia (CML) [4].

  • Dasatinib (associated with PPFIA4 mutation): Dasatinib is an inhibitor of BCR-ABL tyrosine kinase and SRC family tyrosine kinase used to treat patients with chronic myelogenous leukemia (CML) [20].

2.2 Association of biomarker sensitivity with under clinical trial drugs

  • Nutlin-3 (associated with TP53 mutation): Nutlin-3 is a potent inhibitor of MDM2 (mouse double minute 2) binding to p53 that induces the expression of p53-regulated genes [5].

  • PLX-4720 (associated with BRAF mutation): PLX-4720 significantly inhibits the ERK phosphorylation in cell lines bearing B-RafV600E, but not the cells with wild-type B-Raf. PLX-4720 significantly inhibits the growth of tumor cell lines bearing the B-RafV600E oncogene [22].

  • VX-702 (associated with GNAS amplification): VX-702 is an active p38 MAP kinase inhibitor, for the potential treatment of inflammation, rheumatoid arthritis and cardiovascular diseases [7].

  • SB590885 (associated with BRAF mutation): SB590885 is used for treating melanoma. SB590885 is a potent and selective ATP competitive inhibitor of B-Raf kinase [31].

  • LFM-A13 (associated with ANK2 amplification and CELSR1 amplification and deletion): LFM- A13 is a potent and selective inhibitor of Bruton’s tyrosine kinase (BTK) used as an anti-breast cancer drug [2].

  • GW-441756 (associated with SYNE2 amplification): GW-441756 is a potent, selective inhibitor of the NGF receptor tyrosine kinase A (TrkA) [33].

  • GNF-2 (associated with PPFIA4 mutation): GNF-2 inhibits the cellular tyrosine phosphorylation of BCR-ABL in a dose-dependent manner induces a significant decrease in the levels of phospho-Stat5 in Ba/F3.p210 cells. [1].

  • TW-37 (associated with ITGB4 gene copy number): TW-37 is used for pancreatic cancer treatment. It is a small-molecule inhibitor of Bcl-2, inhibits cell growth and induces apoptosis in pancreatic cancer [28].

  • WH.4.023 (associated with PPFIA4 mutation): We do not have enough online data for this drug.

  • Motesanib (associated with PDGFRA copy number): Motesanib is used for treatment of non- small cell lung cancer. an inhibitor of VEGR, cKit and PDGF receptor tyrosine kinases [24].

Detailed results for all the annotated drugs are available in additional file S2. To get deep into our analysis, we looked closely at the interaction between different variants, genomic expression, copy numbers, amplifications, and deletions with drug response. Figure 2 and 3 summarize the significant association and interaction between different genomic features and a particular drug.

Figure 2:
  • Download figure
  • Open in new tab
Figure 2: Significant association between different genomic features and a particular drug.
Figure 3:
  • Download figure
  • Open in new tab
Figure 3: Summarizes the significant interaction between different genomic feature and a particular drug. For example one can see that genes ANK2 copy number variation, and CELSER1 copy number variation and CELSR1 amplification are interacting with each other and are highly associated as a group with LFM-A13.

From figure 3 we can see that gene ANK2 copy number variation, gene CELSER1 copy number variation and amplification are highly associated with LFM-A13. We also find that gene NSD1 amplification, gene NUP214 copy number and gene ROS1 copy number are as a group highly associated with Nilotinib. Finally, we find that gene BRAF mutation is interacting with the BRAF-selective inhibitors PLX4720 and SB590885.

3 Materials and Methods

We have screened and integrated a panel of cancer cell lines including mutation, DNA copy number, and gene expression data along with their responses to targeted and cytotoxic drug therapies from different databases. The dataset contained a broad range of drugs (5 categories) which are: targeted agents, cytotoxic chemotherapeutics, approved drugs used in clinical practice, drugs in development undergoing studies in clinical trials and experimental compounds.

3.1 Multi-dimensional genomic data

The data was collated together primarily from the publicly available GDSC data (version 5) [34], and IC50 measurements for a large number of drugs, against a wide range of cell lines [9].

3.2 Construction of genetic alteration profiles of candidate genes

Complete gene expression, mutation and copy number variation (cnv) data was downloaded from the publicly available Catalogue of Somatic Mutations in Cancer (COSMIC) dataset [8]. Mutation data was represented as a binary variable for each gene as mutated/non-mutated. Binary amplification and deletion variables were derived from copy number variation as cnv ≥ 5, and cnv < 2 respectively. A continuous value column, Δcnv, was also included representing amplifications as positive number and deletions as negative numbers to account for the magnitude of copy number variation.

The Broad Institute’s MutSig program [17] was used to identify commonly mutated genes in the publicly available data downloaded from The Cancer Genome Atlas (TCGA). The most commonly mutated 100 genes in the TCGA list were collected. To this list were added the 1000 most commonly mutated genes in the COSMIC dataset, giving a list of 1092 unique genes for analysis. The final data for the analysis contained the mutation status, copy number variation and expression data for these genes, giving a final dataset of 689 cell lines for which we have IC50 values for 141 drugs, mutation statuses for 1091 genes, copy number variation data for 476 genes, and expression data for 1025 genes. The final dataset, containing mutation, cnv and expression data, therefore has 3544 features. All data was normalized as explained next.

3.3 Processing each data set

Data set was first curated to include the samples of interest and all the information for each sample that the algorithm will use, such as drug subtype and cell lines types. Expression values in each data set are normalized and scaled by subtracting the mean and dividing by two standard deviations as in [10]. Categorical variables are binarized. Mutation and copy number variables have been centered by subtracting their mean. Missing expression values (for genes whose expression is present for some samples and not others in the data set) are imputed using nearest neighbor imputation (“knnImputation” in R package) [26].

3.4 Statistical analysis using bastah

An essential component of gathering all the information from different databases as explained is the systematic integration of large-scale genomic and drug sensitivity datasets. To identify genomic markers of drug response, we currently use a comprehensive analytical approach bastah whose goal is to correlate drug sensitivity (IC50 values and slope of the dose/response curve) with genomic alterations in cancer including point mutations, amplifications and deletions of common cancer genes for each drug. The approaches identify individual genomic features associated with drug sensitivity, and for each drug-gene association reports an unbiased size effect and statistical significance of the association with an adjusted p-value as will be briefly explained in this section.

In a linear regression model, we are given n pairs (y1; x1), (y2; x2),…., (yn; xn), with vectors xi ∈ ℝp (p=3544) and response variables yi given by Embedded Image where ε is the error (noise) term with independent and identically distributed (i.i.d.) components having E[ε] = 0 and V ar(ε) = σ2. An intercept term may be implicitly present. In matrix form, letting y = (y1, ….,yn)T and denoting by X the design matrix with rows Embedded Image,we have Embedded Image

The goal is to estimate the unknown vector of parameters β ∈ ℝp.

In the high-dimensional setting where p>> n, the matrix (XT X) is rank deficient and one has to resort to biased estimators. A particularly successful approach is the lasso [25] which promotes sparse reconstructions through an ℓ1 penalization: Embedded Image where λ is the tuning parameter that determines the amount of penalization. A larger value of λ tends to encourage a greater number of the βj to be set exactly to zero. The optimal value of λ can be determined by cross validation on regression error, or via an information-theoretic test based on Bayesian Information Criteria.

Using [29] and [14], we can furthermore establish a distribution of Embedded Image (an unbiased β estimator) that is Embedded Image

From which we derive the hypothesis test about β to test the null hypothesis H0 : βj = 0 versus the alternative H1 : βj≠0 (two sided test statistics) Embedded Image

3.5 Model fitting procedures

The genomic features matrix X contains information about gene mutations, gene copy number variation, and gene expression characteristics for each cell line. The rows of the matrix correspond to the cell lines, there are 689 cell lines, and each column represent a specific genomic feature. There are 3544 genomic features columns in total. These columns are divided into 1091 gene mutation, 1428 gene copy number and 1025 gene expression.

The gene mutations columns in the genomic features matrix are binary, representing two categories (0 →no mutation in gene, 1→gene contains mutation). The gene copy number columns are divided into three types for each gene: gene.cnv, gene amplification (gene.cnv.amp), and gene deletions (gene.cnv.del). The gene.cnv columns are continuous (values < 0 are deletions, values≥3 are amplifications), while the gene amplification and deletion columns are binary (0→no amplification/deletion in gene, 1→gene is amplified/deleted). The last group of columns of the genomic features matrix is the gene expression columns. The expression values are continuous. Each column contains the expression values of a specific gene across all cancer cell lines. The last group of columns of the genomic features matrix is the gene expression columns. The expression values are continuous. Each column contains the expression values of a specific gene across all cancer cell lines

This TCGA and COSMIC input dataset was divided into five non-overlapping sample groups, used as cross-validation folds for training and testing data. For each cross-validation fold, each model was trained on 4/5ths of the samples, and used to make predictions of sensitivity for the held out 1/5th of samples. Within each training step, a separate 5-fold cross-validation procedure was employed for parameter tuning of each model.

3.6 Feature selection

We use our approach of sparsity inducing regularizers embedded within the estimation of linear regression,bastah described before to achieve feature selection. In implementing l1-regularized regression, the choiceof regularization term is crucial. This procedure was repeated 100 times for each drug in order to assessthe stability of the features (as done by [9]) when applying the 10-fold cross validation procedure. For each of the 100 runs, a feature list was built for the drug comprised of different genomic features with weights assigned to each. The final signature of markers for a drug consisted of all features that appear in any of the 100 runs along with significant p-value. The weights (w) and their respective p-values were used to assess effect sizes of features in a drug as a marker signature. The effect size of a feature was calculated by multiplying the feature’s weight by the p-values across the cell line panel. The effect size is therefore a normalization of the feature’s weight to account for the different scales used to measure the different genomic features.

Features with higher stability of correlation in cross-validation were considered of the highest confidence of truly being associated with drug response. The most significant features associated with drug response are those with large frequency and small p-value. In total, 40 features were highly reproduced and significantly associated with IC50 for 139 drugs. It is noticeable that copy number variation features have the wider ranges of average coefficients compared to gene mutation and expression features (Figure 4).

Figure 4:
  • Download figure
  • Open in new tab
Figure 4: Top 30 most common significant genomic features among the panel of drugs

4 Discussion

We have shown that models constructed using variants, genomic expression, copy numbers, amplifications, deletions and drug IC50 values, from a large panel of cell lines, can predict the chemotherapeuticresponse in patients. While acquired resistance to chemotherapeutics or targeted agents is well recognized and is the subject of recently intensive studies [19, 15], the variation in sensitivities to these drugs displayed by the cell lines in culture is not likely to be due to resistance acquired from previous exposure to these drugs; the patients from whose tumors the cell lines were derived would not have been treated with many of these drugs.

Variation of gene expression levels in cancer is known to be brought about by copy number variation, deletion and amplification and in this way the three variables ought to be considered simultaneously when scanning for driver genes. We likewise considered cancer driver gene selection by means of investigation of copy number, mutation, amplification, deletion and expression levels [23] via newly developed generalized lasso techniques. When using gene network analysis on gene expressions only (results we did not report here), no structure was found associating drug sensitivity to gene expression. When variation in gene expression levels was used with penalized lasso, we obtained more information.

It might appear to be astounding that quality and variation in gene expression alone has such exceptional influence in enhancing drug responses, as the approach disregards what are thought to be important factors in drug sensitivity, for example, malignancy type or other specific markers. We demonstrated, through this work, that the good performance may (partially) come from not only the way the unbiased test statistics were used for genomics features prediction but also that the variability in gene expression as a surrogate for unmeasured phenotypes are straightforwardly important to chemotherapeutic sensitivity, therefore, can catch parts of both germline and tumor-specific genome variation.

In summary, we have proposed a novel integrative computational strategy based on a multidimensional multitask penalized regression for identifying driver genes for cancer. To effectively perform high dimensional genomic data analysis, we used recursive strategies in line with the penalized lasso method. Furthermore, we have proposed a parametric multidimensional statistical test for gene selection based on robust regression modeling. These findings have profound implications for personalized medicine and drug development. Future work will focus on improving predictions using more rigorous transcriptome quantification.

References

  1. [1].↵
    Francisco J Adrián, Qiang Ding, Taebo Sim, Anastasia Velentza, Christine Sloan, Yi Liu, Guobao Zhang, Wooyoung Hur, Sheng Ding, Paul Manley, et al. Allosteric inhibitors of bcr-abl–dependent cell proliferation. Nature chemical biology, 2(2):95–102, 2006.
    OpenUrl
  2. [2].↵
    Akintunde Akinleye, Yamei Chen, Nikhil Mukhi, Yongping Song, and Delong Liu. Ibrutinib and novel BTK inhibitors in clinical development. Journal of hematology & oncology, 6(1):59, 2013.
    OpenUrl
  3. [3].↵
    Jordi Barretina, Giordano Caponigro, Nicolas Stransky, Kavitha Venkatesan, Adam A Margolin, Sungjoon Kim, Christopher J Wilson, Joseph Lehár, Gregory V Kryukov, Dmitriy Sonkin, et al. The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature, 483(7391):603–607, 2012.
    OpenUrlCrossRefPubMedWeb of Science
  4. [4].↵
    Harry G Brittain. Profiles of drug substances, excipients and related methodology, volume 39. Academic Press, USA, 2013.
  5. [5].↵
    Yun-Jeong Choe, Sun-Young Lee, Kyung Won Ko, Seok Joon Shin, and Ho-Shik Kim. Nutlin-3 induces HO-1 expression by activating JNK in a transcription-independent manner of p53. International journal of oncology, 44(3):761–768, 2014.
    OpenUrlPubMed
  6. [6].↵
    James C Costello, Laura M Heiser, Elisabeth Georgii, Mehmet Gönen, Michael P Menden, Nicholas J Wang, Mukesh Bansal, Petteri Hintsanen, Suleiman A Khan, John-Patrick Mpindi, et al. A com-munity effort to assess and improve drug sensitivity prediction algorithms. Nature biotechnology, 32(12):1202–1212, 2014.
    OpenUrlCrossRefPubMed
  7. [7].↵
    Changhai Ding. Drug evaluation: Vx-702, a map kinase inhibitor for rheumatoid arthritis and acute coronary syndrome. Current opinion in investigational drugs, 7(11):1020–1025, 2006.
    OpenUrl
  8. [8].↵
    Simon A Forbes, David Beare, Prasad Gunasekaran, Kenric Leung, Nidhi Bindal, Harry Boutselakis, Minjie Ding, Sally Bamford, Charlotte Cole, Sari Ward, et al. COSMIC: exploring the world’s knowledge of somatic mutations in human cancer. Nucleic acids research, 43(D1):D805–D811, 2015.
    OpenUrlCrossRefPubMed
  9. [9].↵
    Mathew J Garnett, Elena J Edelman, Sonja J Heidorn, Chris D Greenman, Anahita Dastur, King Wai Lau, Patricia Greninger, I Richard Thompson, Xi Luo, Jorge Soares, et al. System-atic identification of genomic markers of drug sensitivity in cancer cells. Nature, 483(7391):570–575, 2012.
    OpenUrlCrossRefPubMedWeb of Science
  10. [10].↵
    Andrew Gelman. Scaling regression inputs by dividing by two standard deviations. Statistics in medicine, 27(15):2865–2873, 2008.
    OpenUrlCrossRefPubMedWeb of Science
  11. [11].↵
    Laura M Heiser, Anguraj Sadanandam, Wen-Lin Kuo, Stephen C Benz, Theodore C Goldstein, Sam Ng, William J Gibb, Nicholas J Wang, Safiyyah Ziyad, Frances Tong, et al. Subtype and pathway specific responses to anticancer compounds in breast cancer. Proceedings of the National Academy of Sciences, 109(8):2724–2729, 2012.
    OpenUrlAbstract/FREE Full Text
  12. [12].↵
    Swen Hoelder, Paul A Clarke, and Paul Workman. Discovery of small molecule cancer drugs: successes, challenges and opportunities. Molecular oncology, 6(2):155–176, 2012.
    OpenUrl
  13. [13].↵
    Francesco Iorio, Theo A Knijnenburg, Daniel J Vis, Graham R Bignell, Michael P Menden, Michael Schubert, Nanne Aben, Emanuel Gonçalves, Syd Barthorpe, Howard Lightfoot, et al. A landscape of pharmacogenomic interactions in cancer. Cell, 166(3):740–754, 2016.
    OpenUrl
  14. [14].↵
    Adel Javanmard and Andrea Montanari. Confidence intervals and hypothesis testing for high- dimensional regression. Journal of Machine Learning Research, 15(1):2869–2909, 2014.
    OpenUrl
  15. [15].↵
    Yuqiu Jiang and Mu Wang. Personalized medicine in oncology: tailoring the right drug to the right patient. Biomarkers in medicine, 4(4):523–533, 2010.
    OpenUrl
  16. [16].↵
    Ian T Jolliffe. A note on the use of principal components in regression. Applied Statistics, pages 300–303, 1982.
  17. [17].↵
    Michael S Lawrence, Petar Stojanov, Paz Polak, Gregory V Kryukov, Kristian Cibulskis, Andrey Sivachenko, Scott L Carter, Chip Stewart, Craig H Mermel, Steven A Roberts, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature, 499(7457):214–218, 2013.
    OpenUrlCrossRefPubMedWeb of Science
  18. [18].↵
    Natalie Minkovsky and Alan Berezov. BIBW-2992, a dual receptor tyrosine kinase inhibitor for the treatment of solid tumors. Current opinion in investigational drugs (London, England: 2000), 9(12):1336–1346, 2008.
    OpenUrlPubMed
  19. [19].↵
    Alok Mishra and Mukesh Verma. Cancer biomarkers: are we ready for the prime time? Cancers, 2(1):190–208, 2010.
    OpenUrlCrossRefPubMed
  20. [20].↵
    A Olivieri and L Manzione. Dasatinib: a new step in molecular target therapy. Annals of Oncology, 18(suppl 6):vi42–vi46, 2007.
    OpenUrlCrossRefPubMed
  21. [21].↵
    Sreenath V Sharma, Daniel A Haber, and Jeff Settleman. Cell line-based platforms to evaluate the therapeutic efficacy of candidate anticancer agents. Nature reviews cancer, 10(4):241–253, 2010.
    OpenUrlCrossRefPubMedWeb of Science
  22. [22].↵
    Lindsay C Spender, G John Ferguson, Sijia Liu, Chao Cui, Maria Romina Girotti, Gary Sibbet, Ellen B Higgs, Morven K Shuttleworth, Tom Hamilton, Paul Lorigan, et al. Mutational activation of BRAF confers sensitivity to transforming growth factor beta inhibitors in human cancer cells. Oncotarget, 7(50):81995–82012, 2016.
    OpenUrl
  23. [23].↵
    Michael R Stratton, Peter J Campbell, and P Andrew Futreal. The cancer genome. Nature, 458(7239):719–724, 2009.
    OpenUrlCrossRefPubMedWeb of Science
  24. [24].↵
    Niall Tebbutt, Dusan Kotasek, Howard A Burris, Lee S Schwartzberg, Herbert Hurwitz, Joe Stephenson, Douglas J Warner, Lisa Chen, Cheng-Pang Hsu, and David Goldstein. Motesanib with or without panitumumab plus FOLFIRI or FOLFOX for the treatment of metastatic colorectal cancer. Cancer chemotherapy and pharmacology, 75(5):993–1004, 2015.
    OpenUrl
  25. [25].↵
    Robert Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological), pages 267–288, 1996.
  26. [26].↵
    Luis Torgo and Lúis Torgo. Data mining with R: learning with case studies. Chapman & Hall/CRC Boca Raton, FL:, USA, 2011.
  27. [27].↵
    Sara Van de Geer, Peter Bühlmann, Yaacov Ritov, Ruben Dezeure, et al. On asymptotically optimal confidence regions and tests for high-dimensional models. The Annals of Statistics, 42(3):1166–1202, 2014.
    OpenUrlCrossRef
  28. [28].↵
    Haixia Wang, Zhifeng Zhang, Xiuping Wei, and Ruizhen Dai. Small-molecule inhibitor of Bcl-2 (TW- 37) suppresses growth and enhances cisplatin-induced apoptosis in ovarian cancer cells. Journal of ovarian research, 8(1):3, 2015.
    OpenUrl
  29. [29].↵
    L Wasserman. All of statistics: a concise course in statistical inference. 2004.
  30. [30].↵
    Eea Weisberg, P Manley, J Mestan, S Cowan-Jacob, A Ray, and JD Griffin. Amn107 (nilotinib): a novel and selective inhibitor of bcr-abl. British Journal of Cancer, 94(12):1765–1769, 2006.
    OpenUrlCrossRefPubMedWeb of Science
  31. [31].↵
    Steven Whittaker, Delphine Ménard, Ruth Kirk, Lesley Ogilvie, Douglas Hedley, Alfonso Zambon, Filipa Lopes, Natasha Preece, Helen Manne, Sareena Rana, et al. A novel, selective, and efficacious nanomolar pyridopyrazinone inhibitor of V600EBRAF. Cancer research, 70(20):8036–8044, 2010.
    OpenUrlAbstract/FREE Full Text
  32. [32].↵
    PD Williams, jk Lee, and D Theodorescu. Genomancy: predicting tumour response to cancer therapy based on the oracle of genetics. Curr Oncol, 16(1):56–58, 2009.
    OpenUrlPubMed
  33. [33].↵
    Edgar R Wood, Lee Kuyper, Kimberly G Petrov, Robert N Hunter, Philip A Harris, and Karen Lackey. Discovery and in vitro evaluation of potent TrkA kinase inhibitors: oxindole and aza- oxindoles. Bioorganic & medicinal chemistry letters, 14(4):953–957, 2004.
    OpenUrl
  34. [34].↵
    Wanjuan Yang, Jorge Soares, Patricia Greninger, Elena J Edelman, Howard Lightfoot, Simon Forbes, Nidhi Bindal, Dave Beare, James A Smith, I Richard Thompson, et al. Genomics of Drug Sensitivity in Cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic acids research, 41(D1):D955–D961, 2013.
    OpenUrlCrossRefPubMedWeb of Science
  35. [35].↵
    Wen-Lei Zhuo, Liang Zhang, Qi-Chao Xie, Bo Zhu, and Zheng-Tang Chen. Identifying differentially expressed genes and screening small molecule drugs for lapatinib-resistance of breast cancer by a bioinformatics strategy. Asian Pac J Cancer Prev, 15:10847–53, 2014.
    OpenUrl
  36. [36].↵
    Hui Zou and Trevor Hastie. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2):301–320, 2005.
    OpenUrlCrossRefWeb of Science
Back to top
PreviousNext
Posted September 27, 2017.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Integrative Statistical Inferences for Drug Sensitivity Biomarkers in Cancer
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Integrative Statistical Inferences for Drug Sensitivity Biomarkers in Cancer
Ehsan Ullah, Saila Shama, Noora Al Muftah, Ian Richard Thompson, Reda Rawi, Raghvendra Mall, Halima Bensmail
bioRxiv 194670; doi: https://doi.org/10.1101/194670
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Integrative Statistical Inferences for Drug Sensitivity Biomarkers in Cancer
Ehsan Ullah, Saila Shama, Noora Al Muftah, Ian Richard Thompson, Reda Rawi, Raghvendra Mall, Halima Bensmail
bioRxiv 194670; doi: https://doi.org/10.1101/194670

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genomics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4399)
  • Biochemistry (9637)
  • Bioengineering (7128)
  • Bioinformatics (24959)
  • Biophysics (12678)
  • Cancer Biology (10003)
  • Cell Biology (14406)
  • Clinical Trials (138)
  • Developmental Biology (7992)
  • Ecology (12155)
  • Epidemiology (2067)
  • Evolutionary Biology (16031)
  • Genetics (10957)
  • Genomics (14785)
  • Immunology (9911)
  • Microbiology (23750)
  • Molecular Biology (9517)
  • Neuroscience (51103)
  • Paleontology (370)
  • Pathology (1547)
  • Pharmacology and Toxicology (2694)
  • Physiology (4038)
  • Plant Biology (8700)
  • Scientific Communication and Education (1512)
  • Synthetic Biology (2406)
  • Systems Biology (6461)
  • Zoology (1350)