Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

The Gene Expression Deconvolution Interactive Tool (GEDIT): Accurate Cell Type Quantification from Gene Expression Data

View ORCID ProfileBrian Nadel, David Lopez, View ORCID ProfileDennis J. Montoya, Hannah Waddel, Misha M. Khan, Matteo Pellegrini
doi: https://doi.org/10.1101/728493
Brian Nadel
1Bioinformatics Interdepartmental Degree Program, University of California Los Angeles, Los Angeles, CA
2Molecular Biology Institute, Department of Molecular Cellular and Developmental Biology, and Institute for Genomics and Proteomics, University of California Los Angeles, Los Angeles, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Brian Nadel
  • For correspondence: brian.nadel@gmail.com
David Lopez
2Molecular Biology Institute, Department of Molecular Cellular and Developmental Biology, and Institute for Genomics and Proteomics, University of California Los Angeles, Los Angeles, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dennis J. Montoya
2Molecular Biology Institute, Department of Molecular Cellular and Developmental Biology, and Institute for Genomics and Proteomics, University of California Los Angeles, Los Angeles, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Dennis J. Montoya
Hannah Waddel
4Department of Mathematics, University of Utah, Salt Lake City, UT
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Misha M. Khan
5Departments of Biology and Computer Science, Swarthmore College, Swarthmore, PA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matteo Pellegrini
2Molecular Biology Institute, Department of Molecular Cellular and Developmental Biology, and Institute for Genomics and Proteomics, University of California Los Angeles, Los Angeles, CA
3Department of Dermatology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

The cell type composition of heterogeneous tissue samples can be a critical variable in both clinical and laboratory settings. However, current experimental methods of cell type quantification (e.g. cell flow cytometry) are costly, time consuming and can introduce bias. Computational approaches that infer cell type abundance from expression data offer an alternate solution. While these methods have gained popularity, most are limited to predicting hematopoietic cell types and do not produce accurate predictions for stromal cell types. Many are also limited to particular platforms, whether RNA-Seq or specific microarray models. To overcome these limitations, we present the Gene Expression Deconvolution Interactive Tool, or GEDIT. Using simulated and experimental data, we demonstrate that GEDIT produces accurate results for both stromal and hematopoietic cell types. Moreover, GEDIT is capable of producing inputs using RNA-Seq data, microarray data, or a combination of the two. Finally, we provide reference data from 7 sources spanning a wide variety of stromal and hematopoietic types. GEDIT also accepts user submitted reference data, thus allowing deconvolution of any cell type, provided that accurate reference data is available.

Author Summary The Gene Expression Deconvolution Interactive Tool (GEDIT) is a software tool that uses gene expression data to estimate cell type abundances. The tool accepts expression data collected from blood or tissue samples and sequenced using either RNA-Seq or microarray technology. GEDIT also requires reference data describing the expression profile of purified cell types. Several reference matrices are provided with this publication and on the tool’s website (webtools.mcdb.ucla.edu), and the user also has the option to supply their own. The tool then applies a linear regression to predict which cell types are present in the tissue sample, and in what proportions. GEDIT applies several novel techniques and outperforms other tools on test data.

Footnotes

  • ImmQuant was previously incorrectly listed as "ImmuneQuant", it's compatible platforms specified, and a newer version of GEDIT included in the supplements

  • http://webtools.mcdb.ucla.edu/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted August 13, 2019.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
The Gene Expression Deconvolution Interactive Tool (GEDIT): Accurate Cell Type Quantification from Gene Expression Data
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
The Gene Expression Deconvolution Interactive Tool (GEDIT): Accurate Cell Type Quantification from Gene Expression Data
Brian Nadel, David Lopez, Dennis J. Montoya, Hannah Waddel, Misha M. Khan, Matteo Pellegrini
bioRxiv 728493; doi: https://doi.org/10.1101/728493
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
The Gene Expression Deconvolution Interactive Tool (GEDIT): Accurate Cell Type Quantification from Gene Expression Data
Brian Nadel, David Lopez, Dennis J. Montoya, Hannah Waddel, Misha M. Khan, Matteo Pellegrini
bioRxiv 728493; doi: https://doi.org/10.1101/728493

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (2633)
  • Biochemistry (5221)
  • Bioengineering (3643)
  • Bioinformatics (15711)
  • Biophysics (7213)
  • Cancer Biology (5593)
  • Cell Biology (8045)
  • Clinical Trials (138)
  • Developmental Biology (4735)
  • Ecology (7462)
  • Epidemiology (2059)
  • Evolutionary Biology (10520)
  • Genetics (7698)
  • Genomics (10082)
  • Immunology (5148)
  • Microbiology (13823)
  • Molecular Biology (5354)
  • Neuroscience (30577)
  • Paleontology (211)
  • Pathology (871)
  • Pharmacology and Toxicology (1519)
  • Physiology (2234)
  • Plant Biology (4983)
  • Scientific Communication and Education (1036)
  • Synthetic Biology (1379)
  • Systems Biology (4130)
  • Zoology (803)