Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Nicheformer: a foundation model for single-cell and spatial omics

View ORCID ProfileAnna C. Schaar, View ORCID ProfileAlejandro Tejada-Lapuerta, View ORCID ProfileGiovanni Palla, View ORCID ProfileRobert Gutgesell, View ORCID ProfileLennard Halle, View ORCID ProfileMariia Minaeva, View ORCID ProfileLarsen Vornholz, View ORCID ProfileLeander Dony, Francesca Drummer, View ORCID ProfileMojtaba Bahrami, View ORCID ProfileFabian J. Theis
doi: https://doi.org/10.1101/2024.04.15.589472
Anna C. Schaar
1TUM School of Computation, Information & Technology, Technical University of Munich, Garching, Germany
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Anna C. Schaar
Alejandro Tejada-Lapuerta
1TUM School of Computation, Information & Technology, Technical University of Munich, Garching, Germany
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alejandro Tejada-Lapuerta
Giovanni Palla
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
3TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Giovanni Palla
Robert Gutgesell
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
4Institute for Diabetes and Obesity, Helmholtz Diabetes Center, Helmholtz Munich, 85764, Neuherberg, Germany.
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Robert Gutgesell
Lennard Halle
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lennard Halle
Mariia Minaeva
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
3TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mariia Minaeva
Larsen Vornholz
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
3TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Larsen Vornholz
Leander Dony
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
3TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
5Department Genes and Environment, Max Planck Institute of Psychiatry and International Max Planck Research School for Translational Psychiatry (IMPRS-TP), Munich, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Leander Dony
Francesca Drummer
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
6Institute for Stroke and Dementia Research, Klinikum Der Universität München, Ludwig-Maximilians-Universität, Munich, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mojtaba Bahrami
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
3TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mojtaba Bahrami
Fabian J. Theis
1TUM School of Computation, Information & Technology, Technical University of Munich, Garching, Germany
2Institute of Computational Biology, Computational Health Center, Helmholtz Munich, Neuherberg, Germany
3TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Fabian J. Theis
  • For correspondence: [email protected]
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Tissue makeup and the corresponding orchestration of vital biological activities, ranging from development and differentiation to immune response and regeneration, rely fundamentally on the cellular microenvironment and the interactions between cells. Spatial single-cell genomics allows probing such interactions in an unbiased and, increasingly, scalable fashion. To learn a unified cell representation that accounts for local dependencies in the cellular microenvironment and the underlying cell interactions, we propose to generalize recent foundation modeling approaches for disassociated single-cell transcriptomics to the spatial omics setting.

Our model, Nicheformer, is a transformer-based foundation model that combines human and mouse dissociated single-cell and targeted spatial transcriptomics data to learn a cellular representation useful for a large variety of downstream tasks. Nicheformer is pretrained on over 57 million dissociated and 53 million spatially resolved cells across 73 tissues from both human and mouse. Subsequently, the model is fine-tuned on spatial tasks for spatial omics data to decode spatially resolved cellular information.

We demonstrate the usefulness of Nicheformer in both zero-shot-like as well as fine-tuning scenarios on a novel set of spatially-relevant downstream tasks such as spatial density prediction or niche and region label prediction. In particular, we show that Nicheformer enables the prediction of the spatial context of dissociated cells, allowing the transfer of rich spatial information to scRNA-seq datasets. We define a series of novel spatial prediction problems and observe consistent top performance of Nicheformer, demonstrating the advantage of the improved model capacity of the underlying transformer. Altogether, our large-scale resource of more than 110 million cells in a partial spatial context, together with the set of novel spatial learning tasks and the Nicheformer model itself, will pave the way for the next generation of machine-learning models for spatial single-cell analysis.

Competing Interest Statement

F.J.T. consults for Immunai Inc., Singularity Bio B.V., CytoReason Ltd, Cellarity, Curie Bio Operations, LLC and has an ownership interest in Dermagnostix GmbH and Cellarity. The other authors declare no conflict of interest.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted April 17, 2024.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Nicheformer: a foundation model for single-cell and spatial omics
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Nicheformer: a foundation model for single-cell and spatial omics
Anna C. Schaar, Alejandro Tejada-Lapuerta, Giovanni Palla, Robert Gutgesell, Lennard Halle, Mariia Minaeva, Larsen Vornholz, Leander Dony, Francesca Drummer, Mojtaba Bahrami, Fabian J. Theis
bioRxiv 2024.04.15.589472; doi: https://doi.org/10.1101/2024.04.15.589472
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Nicheformer: a foundation model for single-cell and spatial omics
Anna C. Schaar, Alejandro Tejada-Lapuerta, Giovanni Palla, Robert Gutgesell, Lennard Halle, Mariia Minaeva, Larsen Vornholz, Leander Dony, Francesca Drummer, Mojtaba Bahrami, Fabian J. Theis
bioRxiv 2024.04.15.589472; doi: https://doi.org/10.1101/2024.04.15.589472

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (6312)
  • Biochemistry
  • Biochemistry (14398)
  • Bioengineering (11012)
  • Bioinformatics (34750)
  • Biophysics (17891)
  • Cancer Biology (15009)
  • Cell Biology (21089)
  • Clinical Trials (138)
  • Developmental Biology (11335)
  • Ecology (16692)
  • Epidemiology (2067)
  • Evolutionary Biology (21045)
  • Genetics (13809)
  • Genomics (19308)
  • Immunology (14439)
  • Microbiology (33677)
  • Molecular Biology (14052)
  • Neuroscience (73400)
  • Paleontology (550)
  • Pathology (2314)
  • Pharmacology and Toxicology (3913)
  • Physiology (6197)
  • Plant Biology (12588)
  • Scientific Communication and Education (1854)
  • Synthetic Biology (3526)
  • Systems Biology (8480)
  • Zoology (1939)