Abstract
Multi-sample single-cell multi-omics datasets, which simultaneously measure multiple data modalities in the same cells and in multiple samples, facilitate the study of gene expression and gene regulatory activities on a population scale. Existing integration methods can integrate either multiple samples or multiple modalities, but not both simultaneously. To address this limitation, we developed Smmit, a computational pipeline that leverages existing integration methods to simultaneously integrate both samples and modalities and produces a unified representation of reduced dimensions. We demonstrate Smmit’s capability to integrate information across samples and modalities while preserving cell type differences in two real datasets. Smmit is an R software package that is freely available at Github: https://github.com/zji90/Smmit
Competing Interest Statement
The authors have declared no competing interest.