Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Percolate: an exponential family JIVE model to design DNA-based predictors of drug response

View ORCID ProfileSoufiane M.C. Mourragui, View ORCID ProfileMarco Loog, View ORCID ProfileMirrelijn van Nee, View ORCID ProfileMark A van de Wiel, View ORCID ProfileMarcel J.T. Reinders, View ORCID ProfileLodewyk F.A. Wessels
doi: https://doi.org/10.1101/2022.09.11.507473
Soufiane M.C. Mourragui
1Computational Cancer Biology, Division of Molecular Carcinogenesis, Oncode Institute, the Netherlands Cancer Institute, Amsterdam, The Netherlands
2Faculty of EEMCS, Delft University of Technology, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Soufiane M.C. Mourragui
Marco Loog
2Faculty of EEMCS, Delft University of Technology, The Netherlands
3Department of Computer Science, University of Copenhagen, Copenhagen, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marco Loog
Mirrelijn van Nee
4Department of Epidemiology and Biostatistics, Amsterdam Public Health research institute, Amsterdam University medical centers, Amsterdam, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mirrelijn van Nee
Mark A van de Wiel
4Department of Epidemiology and Biostatistics, Amsterdam Public Health research institute, Amsterdam University medical centers, Amsterdam, The Netherlands
5MRC Biostatistics Unit, University of Cambridge, Cambridge, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mark A van de Wiel
Marcel J.T. Reinders
2Faculty of EEMCS, Delft University of Technology, The Netherlands
6Leiden Computational Biology Center, Leiden University Medical Center, Leiden, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marcel J.T. Reinders
  • For correspondence: l.wessels@nki.nl m.j.t.reinders@tudelft.nl
Lodewyk F.A. Wessels
1Computational Cancer Biology, Division of Molecular Carcinogenesis, Oncode Institute, the Netherlands Cancer Institute, Amsterdam, The Netherlands
2Faculty of EEMCS, Delft University of Technology, The Netherlands
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lodewyk F.A. Wessels
  • For correspondence: l.wessels@nki.nl m.j.t.reinders@tudelft.nl
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Motivation Anti-cancer drugs may elicit resistance or sensitivity through mechanisms which involve several genomic layers. Nevertheless, we have demonstrated that gene expression contains most of the predictive capacity compared to the remaining omic data types. Unfortunately, this comes at a price: gene expression biomarkers are often hard to interpret and show poor robustness.

Results To capture the best of both worlds, i.e. the accuracy of gene expression and the robustness of other genomic levels, such as mutations, copy-number or methylation, we developed Percolate, a computational approach which extracts the joint signal between gene expression and the other omic data types. We developed an out-of-sample extension of Percolate which allows predictions on unseen samples without the necessity to recompute the joint signal on all data. We employed Percolate to extract the joint signal between gene expression and either mutations, copy-number or methylation, and used the out-of sample extension to perform response prediction on unseen samples. We showed that the joint signal recapitulates, and sometimes exceeds, the predictive performance achieved with each data type individually. Importantly, molecular signatures created by Percolate do not require gene expression to be evaluated, rendering them suitable to clinical applications where only one data type is available.

Availability Percolate is available as a Python 3.7 package and the scripts to reproduce the results are available here.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • We added a few elements to the methodology: - A more thorough definition of saturated parameters. - A general theorem showing the convergence of our GLM-PCA algorithm. - Generally, a more mathematically-grounded version of the method

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted November 07, 2022.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Percolate: an exponential family JIVE model to design DNA-based predictors of drug response
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Percolate: an exponential family JIVE model to design DNA-based predictors of drug response
Soufiane M.C. Mourragui, Marco Loog, Mirrelijn van Nee, Mark A van de Wiel, Marcel J.T. Reinders, Lodewyk F.A. Wessels
bioRxiv 2022.09.11.507473; doi: https://doi.org/10.1101/2022.09.11.507473
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Percolate: an exponential family JIVE model to design DNA-based predictors of drug response
Soufiane M.C. Mourragui, Marco Loog, Mirrelijn van Nee, Mark A van de Wiel, Marcel J.T. Reinders, Lodewyk F.A. Wessels
bioRxiv 2022.09.11.507473; doi: https://doi.org/10.1101/2022.09.11.507473

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4383)
  • Biochemistry (9599)
  • Bioengineering (7092)
  • Bioinformatics (24864)
  • Biophysics (12615)
  • Cancer Biology (9957)
  • Cell Biology (14354)
  • Clinical Trials (138)
  • Developmental Biology (7949)
  • Ecology (12107)
  • Epidemiology (2067)
  • Evolutionary Biology (15989)
  • Genetics (10925)
  • Genomics (14743)
  • Immunology (9869)
  • Microbiology (23676)
  • Molecular Biology (9485)
  • Neuroscience (50872)
  • Paleontology (369)
  • Pathology (1539)
  • Pharmacology and Toxicology (2683)
  • Physiology (4015)
  • Plant Biology (8657)
  • Scientific Communication and Education (1509)
  • Synthetic Biology (2397)
  • Systems Biology (6436)
  • Zoology (1346)