RT Journal Article SR Electronic T1 Probabilistic tensor decomposition extracts better latent embeddings from single-cell multiomic data JF bioRxiv FD Cold Spring Harbor Laboratory SP 2022.08.26.505382 DO 10.1101/2022.08.26.505382 A1 Ruohan Wang A1 Jianping Wang A1 Shuai Cheng Li YR 2022 UL http://biorxiv.org/content/early/2022/08/26/2022.08.26.505382.abstract AB Single-cell sequencing technology enables the simultaneous capture of multiomic data from multiple cells. The captured data can be represented by tensors, i.e., the higher-rank matrices. However, the proposed analysis tools often take the data as a collection of two-order matrices, renouncing the correspondences among the features. Consequently, we propose a probabilistic tensor decomposition framework, SCOIT, to extract embeddings from single-cell multiomic data. To deal with sparse, noisy, and heterogeneous single-cell data, we incorporate various distributions in SCOIT, including Gaussian, Poisson, and negative binomial distributions. Our framework can decompose a multiomic tensor into a cell embedding matrix, a gene embedding matrix, and an omic embedding matrix, allowing for various downstream analyses. We applied SCOIT to seven single-cell multiomic datasets from different sequencing protocols. With cell embeddings, SCOIT achieves superior performance for cell clustering compared to seven state-of-the-art tools under various metrics, demonstrating its ability to dissect cellular heterogeneity. With the gene embeddings, SCOIT enables cross-omics gene expression analysis and integrative gene regulatory network study. Furthermore, the embeddings allow cross-omics imputation simultaneously, outperforming conventional imputation methods with the Pearson correlation coefficient increased by 0.03-0.28.Competing Interest StatementThe authors have declared no competing interest.