Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Venice: A New Algorithm for Finding Marker Genes in Single-Cell Transcriptomic Data

Hy Vuong, Thao Truong, Tan Phan, Son Pham
doi: https://doi.org/10.1101/2020.11.16.384479
Hy Vuong
1BioTuring Inc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Thao Truong
1BioTuring Inc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tan Phan
1BioTuring Inc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Son Pham
1BioTuring Inc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: sonpham@bioturing.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Preview PDF
Loading

Abstract

Most widely used tools for finding marker genes in single cell data (SeuratT/NegBinom/Poisson, CellRanger, EdgeR, limmatrend) use a conventional definition of differentially expressed genes: genes with different mean expression values. However, in single-cell data, a cell population can be a mixture of many cell types/cell states, hence the mean expression of genes cannot represent the whole population. In addition, these tools assume that gene expression of a population belongs to a specific family of distribution. This assumption is often violated in single-cell data. In this work, we define marker genes of a cell population as genes that can be used to distinguish cells in the population from cells in other populations. Besides log-fold change, we devise a new metric to classify genes into up-regulated, down-regulated, and transitional states. In a benchmark for finding up-regulated and down-regulated genes, our tool outperforms all compared methods, including Seurat, ROTS, scDD, edgeR, MAST, limma, normal t-test, Wilcoxon and Kolmogorov–Smirnov test. Our method is much faster than all compared methods, therefore, enables interactive analysis for large single-cell data sets in BioTuring Browser. Venice algorithm is available within Signac package: https://github.com/bioturing/signac 1).

Competing Interest Statement

All authors of the paper are supported by BioTuring Inc, a bioinformatics company.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted November 17, 2020.
Download PDF
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Venice: A New Algorithm for Finding Marker Genes in Single-Cell Transcriptomic Data
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Venice: A New Algorithm for Finding Marker Genes in Single-Cell Transcriptomic Data
Hy Vuong, Thao Truong, Tan Phan, Son Pham
bioRxiv 2020.11.16.384479; doi: https://doi.org/10.1101/2020.11.16.384479
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Venice: A New Algorithm for Finding Marker Genes in Single-Cell Transcriptomic Data
Hy Vuong, Thao Truong, Tan Phan, Son Pham
bioRxiv 2020.11.16.384479; doi: https://doi.org/10.1101/2020.11.16.384479

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (4225)
  • Biochemistry (9101)
  • Bioengineering (6750)
  • Bioinformatics (23941)
  • Biophysics (12087)
  • Cancer Biology (9493)
  • Cell Biology (13738)
  • Clinical Trials (138)
  • Developmental Biology (7614)
  • Ecology (11659)
  • Epidemiology (2066)
  • Evolutionary Biology (15477)
  • Genetics (10616)
  • Genomics (14293)
  • Immunology (9460)
  • Microbiology (22774)
  • Molecular Biology (9069)
  • Neuroscience (48851)
  • Paleontology (354)
  • Pathology (1479)
  • Pharmacology and Toxicology (2564)
  • Physiology (3822)
  • Plant Biology (8308)
  • Scientific Communication and Education (1467)
  • Synthetic Biology (2289)
  • Systems Biology (6171)
  • Zoology (1297)