Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

scCapsNet: a deep learning classifier with the capability of interpretable feature extraction, applicable for single cell RNA data analysis

Lifei Wang, Rui Nie, Ruyue Xin, Jiang Zhang, Jun Cai
doi: https://doi.org/10.1101/506642
Lifei Wang
aKey Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
bUniversity of Chinese Academy of Sciences, Beijing, 100049, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rui Nie
aKey Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
bUniversity of Chinese Academy of Sciences, Beijing, 100049, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruyue Xin
cSchool of Systems Science, Beijing Normal University, Beijing, 100875, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jiang Zhang
cSchool of Systems Science, Beijing Normal University, Beijing, 100875, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: zhangjiang@bnu.edu.cn juncai@big.ac.cn
Jun Cai
aKey Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
bUniversity of Chinese Academy of Sciences, Beijing, 100049, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: zhangjiang@bnu.edu.cn juncai@big.ac.cn
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Recently deep learning methods have been applied to process biological data and greatly pushed the development of the biological research forward. However, the interpretability of the deep learning methods still needs to improve. Here for the first time, we present scCapsNet, a totally interpretable deep learning model adapted from CapsNet. The scCapsNet model retains the capsule parts of CapsNet but replaces the part of convolutional neural networks with several parallel fully connected neural networks. We apply scCapsNet to scRNA-seq data. The results show that scCapsNet performs well as a classifier and also that the parallel fully connected neural networks function like feature extractors as we supposed. The scCapsNet model provides contribution of each extracted feature to the cell type recognition. Evidences show that some extracted features are nearly orthogonal to each other. After training, through analysis of the internal weights of each neural network connected inputs and primary capsule, and with the information about the contribution of each extracted feature to the cell type recognition, the scCapsNet model could relate gene sets from inputs to cell types. The specific gene set is responsible for the identification of its corresponding cell types but does not affect the recognition of other cell types by the model. Many well-studied cell type markers are in the gene set with corresponding cell type. The internal weights of neural network for those well-studied cell type markers are different for different primary capsules. The internal weights of neural network connected to a primary capsule could be viewed as an embedding for genes, convert genes to real value low dimensional vectors. Furthermore, we mix the RNA expression data of two cells with different cell types and then use the scCapsNet model trained with non-mixed data to predict the cell types in the mixed data. Our scCapsNet model could predict cell types in a cell mixture with high accuracy.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted May 21, 2019.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
scCapsNet: a deep learning classifier with the capability of interpretable feature extraction, applicable for single cell RNA data analysis
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
scCapsNet: a deep learning classifier with the capability of interpretable feature extraction, applicable for single cell RNA data analysis
Lifei Wang, Rui Nie, Ruyue Xin, Jiang Zhang, Jun Cai
bioRxiv 506642; doi: https://doi.org/10.1101/506642
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
scCapsNet: a deep learning classifier with the capability of interpretable feature extraction, applicable for single cell RNA data analysis
Lifei Wang, Rui Nie, Ruyue Xin, Jiang Zhang, Jun Cai
bioRxiv 506642; doi: https://doi.org/10.1101/506642

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4234)
  • Biochemistry (9129)
  • Bioengineering (6782)
  • Bioinformatics (23999)
  • Biophysics (12124)
  • Cancer Biology (9534)
  • Cell Biology (13776)
  • Clinical Trials (138)
  • Developmental Biology (7635)
  • Ecology (11699)
  • Epidemiology (2066)
  • Evolutionary Biology (15509)
  • Genetics (10644)
  • Genomics (14324)
  • Immunology (9480)
  • Microbiology (22836)
  • Molecular Biology (9089)
  • Neuroscience (48987)
  • Paleontology (355)
  • Pathology (1482)
  • Pharmacology and Toxicology (2570)
  • Physiology (3845)
  • Plant Biology (8331)
  • Scientific Communication and Education (1471)
  • Synthetic Biology (2296)
  • Systems Biology (6190)
  • Zoology (1301)