Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

MGPfactXMBD: A Model-Based Factorization Method for scRNA Data Unveils Bifurcating Transcriptional Modules Underlying Cell Fate Determination

Jun Ren, Ying Zhou, Yudi Hu, Jing Yang, Hongkun Fang, Xuejing Lyu, Jintao Guo, Xiaodong Shi, View ORCID ProfileQiyuan Li
doi: https://doi.org/10.1101/2024.04.02.587768
Jun Ren
1School of Informatics, Xiamen University, Xiamen, 361105, China
2National Institute for Data Science in Health and Medicine, School of Medicine, Xiamen University, Xiamen, 361102, China
3Department of Hematology, The First Affiliated Hospital of Xiamen University and Institute of Hematology, School of Medicine, Xiamen University, Xiamen, 361102, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ying Zhou
2National Institute for Data Science in Health and Medicine, School of Medicine, Xiamen University, Xiamen, 361102, China
3Department of Hematology, The First Affiliated Hospital of Xiamen University and Institute of Hematology, School of Medicine, Xiamen University, Xiamen, 361102, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yudi Hu
2National Institute for Data Science in Health and Medicine, School of Medicine, Xiamen University, Xiamen, 361102, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jing Yang
2National Institute for Data Science in Health and Medicine, School of Medicine, Xiamen University, Xiamen, 361102, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hongkun Fang
2National Institute for Data Science in Health and Medicine, School of Medicine, Xiamen University, Xiamen, 361102, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xuejing Lyu
2National Institute for Data Science in Health and Medicine, School of Medicine, Xiamen University, Xiamen, 361102, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jintao Guo
2National Institute for Data Science in Health and Medicine, School of Medicine, Xiamen University, Xiamen, 361102, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiaodong Shi
1School of Informatics, Xiamen University, Xiamen, 361105, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qiyuan Li
2National Institute for Data Science in Health and Medicine, School of Medicine, Xiamen University, Xiamen, 361102, China
3Department of Hematology, The First Affiliated Hospital of Xiamen University and Institute of Hematology, School of Medicine, Xiamen University, Xiamen, 361102, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Qiyuan Li
  • For correspondence: [email protected]
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Manifold-learning is particularly useful to resolve the complex cellular state space from single-cell RNA sequences. While current manifold-learning methods provide insights into cell fate by inferring graph-based trajectory at cell level, challenges remain to retrieve interpretable biology underlying the diverse cellular states. Here, we described MGPfactXMBD, a model-based manifold-learning framework and capable to factorize complex development trajectories into independent bifurcation processes of gene sets, and thus enables trajectory inference based on relevant features. MGPfactXMBD offers more nuanced understanding of the biological processes underlying cellular trajectories with potential determinants. When bench-tested across 239 datasets, MGPfactXMBD showed advantages in major quantity-control metrics, such as branch division accuracy and trajectory topology, outperforming most established methods. In real datasets, MGPfactXMBD recovered the critical pathways and cell types in microglia development with experimentally valid regulons and markers. Furthermore, MGPfactXMBD discovered evolutionary trajectories of tumor-associated CD8+ T cells and yielded new subtypes of CD8+ T cells with gene expression signatures significantly predictive of the responses to immune checkpoint inhibitor in independent cohorts. In summary, MGPfactXMBD offers a manifold-learning framework in scRNA-seq data which enables feature selection for specific biological processes and contributing to advance our understanding of biological determination of cell fate.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • Real data trajectory comparison charts added to the benchmark analysis to demonstrate MGPfact's performance. Pseudotime acquisition detailed in the methods section. Role of MURP downsampling highlighted through comparative experiments. Additional content on synthetic datasets and consensus trajectory generation included in the introduction and discussion sections. Structure and presentation of the manuscript adjusted according to journal publication standards and reviewers' suggestions.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted October 27, 2024.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
MGPfactXMBD: A Model-Based Factorization Method for scRNA Data Unveils Bifurcating Transcriptional Modules Underlying Cell Fate Determination
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
MGPfactXMBD: A Model-Based Factorization Method for scRNA Data Unveils Bifurcating Transcriptional Modules Underlying Cell Fate Determination
Jun Ren, Ying Zhou, Yudi Hu, Jing Yang, Hongkun Fang, Xuejing Lyu, Jintao Guo, Xiaodong Shi, Qiyuan Li
bioRxiv 2024.04.02.587768; doi: https://doi.org/10.1101/2024.04.02.587768
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
MGPfactXMBD: A Model-Based Factorization Method for scRNA Data Unveils Bifurcating Transcriptional Modules Underlying Cell Fate Determination
Jun Ren, Ying Zhou, Yudi Hu, Jing Yang, Hongkun Fang, Xuejing Lyu, Jintao Guo, Xiaodong Shi, Qiyuan Li
bioRxiv 2024.04.02.587768; doi: https://doi.org/10.1101/2024.04.02.587768

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Bioinformatics
Subject Areas
All Articles
  • Animal Behavior and Cognition (6029)
  • Biochemistry (13714)
  • Bioengineering (10449)
  • Bioinformatics (33189)
  • Biophysics (17124)
  • Cancer Biology (14193)
  • Cell Biology (20129)
  • Clinical Trials (138)
  • Developmental Biology (10869)
  • Ecology (16029)
  • Epidemiology (2067)
  • Evolutionary Biology (20351)
  • Genetics (13401)
  • Genomics (18637)
  • Immunology (13767)
  • Microbiology (32177)
  • Molecular Biology (13395)
  • Neuroscience (70110)
  • Paleontology (527)
  • Pathology (2194)
  • Pharmacology and Toxicology (3745)
  • Physiology (5874)
  • Plant Biology (12024)
  • Scientific Communication and Education (1815)
  • Synthetic Biology (3368)
  • Systems Biology (8170)
  • Zoology (1842)