Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Spotless: a reproducible pipeline for benchmarking cell type deconvolution in spatial transcriptomics

View ORCID ProfileChananchida Sang-aram, View ORCID ProfileRobin Browaeys, View ORCID ProfileRuth Seurinck, View ORCID ProfileYvan Saeys
doi: https://doi.org/10.1101/2023.03.22.533802
Chananchida Sang-aram
1Data mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
2Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Chananchida Sang-aram
  • For correspondence: [email protected]
Robin Browaeys
1Data mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
2Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Robin Browaeys
Ruth Seurinck
1Data mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
2Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ruth Seurinck
Yvan Saeys
1Data mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
2Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yvan Saeys
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Spatial transcriptomics (ST) is an emerging field that aims to profile the transcriptome of a cell while keeping its spatial context. Although the resolution of non-targeted ST technologies has been rapidly improving in recent years, most commercial methods do not yet operate at single-cell resolution. To tackle this issue, computational methods such as deconvolution can be used to infer cell type proportions in each spot by learning cell type-specific expression profiles from reference single-cell RNA-sequencing (scRNA-seq) data. Here, we benchmarked the performance of 11 deconvolution methods using 63 silver standards, three gold standards, and two case studies on liver and melanoma tissues. The silver standards were generated using our novel simulation engine synthspot, where we used seven scRNA-seq datasets to create synthetic spots that followed one of nine different biological tissue patterns. The gold standards were generated using imaging-based ST technologies at single-cell resolution. We evaluated method performance based on the root-mean-squared error, area under the precision-recall curve, and Jensen-Shannon divergence. Our evaluation revealed that method performance significantly decreases in datasets with highly abundant or rare cell types. Moreover, we evaluated the stability of each method when using different reference datasets and found that having sufficient number of genes for each cell type is crucial for good performance. We conclude that while cell2location and RCTD are the top-performing methods, a simple off-the-shelf deconvolution method surprisingly outperforms almost half of the dedicated spatial deconvolution methods. Our freely available Nextflow pipeline allows users to generate synthetic data, run deconvolution methods and optionally benchmark them on their dataset (https://github.com/saeyslab/spotless-benchmark).

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • Method section on evaluation metrics updated to include aggregation procedure

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted January 07, 2024.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Spotless: a reproducible pipeline for benchmarking cell type deconvolution in spatial transcriptomics
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Spotless: a reproducible pipeline for benchmarking cell type deconvolution in spatial transcriptomics
Chananchida Sang-aram, Robin Browaeys, Ruth Seurinck, Yvan Saeys
bioRxiv 2023.03.22.533802; doi: https://doi.org/10.1101/2023.03.22.533802
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Spotless: a reproducible pipeline for benchmarking cell type deconvolution in spatial transcriptomics
Chananchida Sang-aram, Robin Browaeys, Ruth Seurinck, Yvan Saeys
bioRxiv 2023.03.22.533802; doi: https://doi.org/10.1101/2023.03.22.533802

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (6034)
  • Biochemistry (13730)
  • Bioengineering (10460)
  • Bioinformatics (33214)
  • Biophysics (17140)
  • Cancer Biology (14199)
  • Cell Biology (20132)
  • Clinical Trials (138)
  • Developmental Biology (10875)
  • Ecology (16040)
  • Epidemiology (2067)
  • Evolutionary Biology (20365)
  • Genetics (13412)
  • Genomics (18649)
  • Immunology (13777)
  • Microbiology (32205)
  • Molecular Biology (13402)
  • Neuroscience (70166)
  • Paleontology (527)
  • Pathology (2195)
  • Pharmacology and Toxicology (3745)
  • Physiology (5884)
  • Plant Biology (12029)
  • Scientific Communication and Education (1816)
  • Synthetic Biology (3372)
  • Systems Biology (8175)
  • Zoology (1844)