PT - JOURNAL ARTICLE AU - Wang, Jingtao AU - Fonseca, Gregory AU - Ding, Jun TI - scSemiProfiler: Advancing Large-scale Single-cell Studies through Semi-profiling with Deep Generative Models and Active Learning AID - 10.1101/2023.11.20.567929 DP - 2024 Jan 01 TA - bioRxiv PG - 2023.11.20.567929 4099 - http://biorxiv.org/content/early/2024/05/02/2023.11.20.567929.short 4100 - http://biorxiv.org/content/early/2024/05/02/2023.11.20.567929.full AB - Single-cell sequencing is a crucial tool for dissecting the cellular intricacies of complex diseases. Its prohibitive cost, however, hampers its application in expansive biomedical studies. Traditional cellular deconvolution approaches can infer cell type proportions from more affordable bulk sequencing data, yet they fall short in providing the detailed resolution required for single-cell-level analyses. To overcome this challenge, we introduce “scSemiProfiler”, an innovative computational framework that marries deep generative models with active learning strategies. This method adeptly infers single-cell profiles across large cohorts by fusing bulk sequencing data with targeted single-cell sequencing from a few rigorously chosen representatives. Extensive validation across heterogeneous datasets verifies the precision of our semi-profiling approach, aligning closely with true single-cell profiling data and empowering refined cellular analyses. Originally developed for extensive disease cohorts, “scSemiProfiler” is adaptable for broad applications. It provides a scalable, cost-effective solution for single-cell profiling, facilitating in-depth cellular investigation in various biological domains.Competing Interest StatementThe authors have declared no competing interest.