Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Min-Redundancy and Max-Relevance Multi-view Feature Selection for Predicting Ovarian Cancer Survival using Multi-omics Data

View ORCID ProfileYasser El-Manzalawy, Tsung-Yu Hsieh, Manu Shivakumar, Dokyoon Kim, Vasant Honavar
doi: https://doi.org/10.1101/317982
Yasser El-Manzalawy
Pennsylvania State University;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yasser El-Manzalawy
  • For correspondence: yasser.isu@gmail.com
Tsung-Yu Hsieh
Pennsylvania State University;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Manu Shivakumar
Geisinger Health System
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dokyoon Kim
Geisinger Health System
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vasant Honavar
Pennsylvania State University;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Background: Large-scale collaborative precision medicine initiatives (e.g., The Cancer Genome Atlas (TCGA)) are yielding rich multi-omics data. Integrative analyses of the resulting multi-omics data, such as somatic mutation, copy number alteration (CNA), DNA methylation, miRNA, gene expression, and protein expression, offer the tantalizing possibilities of realizing the potential of precision medicine in cancer prevention, diagnosis, and treatment by substantially improving our understanding of underlying mechanisms as well as the discovery of novel biomarkers for different types of cancers. However, such analyses present a number of challenges, including the heterogeneity of data types, and the extreme high-dimensionality of omics data. Methods: In this study, we propose a novel framework for integrating multi-omics data based on multi-view feature selection, an emerging research problem in machine learning research. We also present a novel multi-view feature selection algorithm, MRMR-mv, which adapts the well-known Min-Redundancy and Maximum-Relevance (MRMR) single-view feature selection algorithm for the multi-view settings. Results: We report results of experiments on the task of building a predictive model of cancer survival from an ovarian cancer multi-omics dataset derived from the TCGA database. Our results suggest that multi-view models for predicting ovarian cancer survival outperform both view-specific models (i.e., models trained and tested using one multi-omics data source) and models based on two baseline data fusion methods. Conclusions: Our results demonstrate the potential of multi-view feature selection in integrative analyses and predictive modeling from multi-omics data.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted May 09, 2018.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Min-Redundancy and Max-Relevance Multi-view Feature Selection for Predicting Ovarian Cancer Survival using Multi-omics Data
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
Share
Min-Redundancy and Max-Relevance Multi-view Feature Selection for Predicting Ovarian Cancer Survival using Multi-omics Data
Yasser El-Manzalawy, Tsung-Yu Hsieh, Manu Shivakumar, Dokyoon Kim, Vasant Honavar
bioRxiv 317982; doi: https://doi.org/10.1101/317982
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Min-Redundancy and Max-Relevance Multi-view Feature Selection for Predicting Ovarian Cancer Survival using Multi-omics Data
Yasser El-Manzalawy, Tsung-Yu Hsieh, Manu Shivakumar, Dokyoon Kim, Vasant Honavar
bioRxiv 317982; doi: https://doi.org/10.1101/317982

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (999)
  • Biochemistry (1493)
  • Bioengineering (946)
  • Bioinformatics (6830)
  • Biophysics (2429)
  • Cancer Biology (1792)
  • Cell Biology (2530)
  • Clinical Trials (106)
  • Developmental Biology (1700)
  • Ecology (2575)
  • Epidemiology (1495)
  • Evolutionary Biology (5030)
  • Genetics (3623)
  • Genomics (4636)
  • Immunology (1175)
  • Microbiology (4252)
  • Molecular Biology (1629)
  • Neuroscience (10799)
  • Paleontology (83)
  • Pathology (240)
  • Pharmacology and Toxicology (409)
  • Physiology (555)
  • Plant Biology (1458)
  • Scientific Communication and Education (412)
  • Synthetic Biology (542)
  • Systems Biology (1878)
  • Zoology (260)