RT Journal Article SR Electronic T1 scPNMF: sparse gene encoding of single cells to facilitate gene selection for targeted gene profiling JF bioRxiv FD Cold Spring Harbor Laboratory SP 2021.02.09.430550 DO 10.1101/2021.02.09.430550 A1 Dongyuan Song A1 Kexin Aileen Li A1 Zachary Hemminger A1 Roy Wollman A1 Jingyi Jessica Li YR 2021 UL http://biorxiv.org/content/early/2021/02/11/2021.02.09.430550.abstract AB Single-cell RNA sequencing (scRNA-seq) captures whole transcriptome information of individual cells. While scRNA-seq measures thousands of genes, researchers are often interested in only dozens to hundreds of genes for a closer study. Then a question is how to select those informative genes from scRNA-seq data. Moreover, single-cell targeted gene profiling technologies are gaining popularity for their low costs, high sensitivity, and extra (e.g., spatial) information; however, they typically can only measure up to a few hundred genes. Then another challenging question is how to select genes for targeted gene profiling based on existing scRNA-seq data. Here we develop the single-cell Projective Non-negative Matrix Factorization (scPNMF) method to select informative genes from scRNA-seq data in an unsupervised way. Compared with existing gene selection methods, scPNMF has two advantages. First, its selected informative genes can better distinguish cell types. Second, it enables the alignment of new targeted gene profiling data with reference data in a low-dimensional space to facilitate the prediction of cell types in the new data. Technically, scPNMF modifies the PNMF algorithm for gene selection by changing the initialization and adding a basis selection step, which selects informative bases to distinguish cell types. We demonstrate that scPNMF outperforms the state-of-the-art gene selection methods on diverse scRNA-seq datasets. Moreover, we show that scPNMF can guide the design of targeted gene profiling experiments and cell-type annotation on targeted gene profiling data.Competing Interest StatementThe authors have declared no competing interest.