Abstract
Single-cell RNA sequencing (scRNA-seq) data analysis faces multiple challenges, including high dimensionality, significant noise, and data loss. To effectively address these issues, we introduce AIGS, a robust and transparent single-cell analysis framework. AIGS utilizes an intelligent gene selection method that systematically identifies the most informative genes for clustering based on the normalized mutual information between pre-learned pseudo-labels and quantified genes. Additionally, AIGS incorporates a scale-invariant distance metric to assess cell-to-cell similarity, enhancing connections between homogenous cells and ensuring more accurate and robust results. Through comprehensive comparisons with state-of-the-art techniques, AIGS demonstrates superior performance in both clustering accuracy and multi-resolution visualization quality. Our in-depth analysis of clustering and visualization results further reveals that AIGS can uncover complex, stage-specific gene expression patterns during the same developmental cell stage.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
↵* thni{at}zju.edu.cn, xyzbiocompt{at}gmail.com, jinkaixiu{at}pazhoulab.cn
pgx{at}zhejianglab.com
xuenan{at}ieee.org
gayan{at}g.ucla.edu
lith{at}zhejianglab.com
bjlistat{at}nus.edu.sg
8 List of abbreviations
- scRNA-seq
- Single Cell RNA Sequencing
- AIGS
- Analyzer with Intelligent Gene Selection
- NMI
- Normalizaed Mutual Information
- ARI
- Adjusted Rand Index
- MZT
- Maternal-Zygotic Transition
- ZGA
- Zygotic Genome Activation
- SI
- Silhouette Index
- NF
- Encompassing Neuronal Fibers
- NP
- Non-Peptidergic Nociceptors
- PEP
- Peptidergic Noci-ceptors
- TH
- Cells Containing Tyrosine Hydroxylase
- ACC
- Accuracy
- ARI
- Adjusted Rand Index
- Jaccard
- Jaccard Coefficient
- GEO
- Gene Expression Omnibus.