RT Journal Article SR Electronic T1 Evaluation of Cell Type Deconvolution R Packages on Single Cell RNA-seq Data JF bioRxiv FD Cold Spring Harbor Laboratory SP 827139 DO 10.1101/827139 A1 Qianhui Huang A1 Yu Liu A1 Yuheng Du A1 Lana X. Garmire YR 2019 UL http://biorxiv.org/content/early/2019/11/01/827139.abstract AB Annotating cell types is a critical step in single cell RNA-Seq (scRNA-Seq) data analysis. Some deconvolution methods have recently emerged to enable automated cell type identification. However, comprehensive evaluations of these methods are lacking to provide practical guidelines. Moreover, it is not clear whether some deconvolution methods originally designed for analyzing other omics data are adaptable to scRNA-Seq analysis. In this study, we evaluated ten cell-type deconvolution methods publicly available as R packages. Eight of them are popular methods developed specifically for single cell research (Seurat, scmap, SingleR, CHETAH, SingleCellNet, scID, Garnett, SCINA). The other two methods are repurposed from deconvoluting DNA methylation data: Linear Constrained Projection (CP) and Robust Partial Correlations (RPC). We conducted systematic comparisons on a wide variety of public scRNA-seq datasets as well as simulation data. We assessed the accuracy through intra-dataset and inter-dataset predictions, the robustness over practical challenges such as gene filtering and high similarity among cell types, as well as the capabilities on rare and unknown cell-type detection. Overall, methods such as Seurat, SingleR, CP, RPC and SingleCellNet performed well, with Seurat being the best at annotating major cell types. Also, Seurat, SingleR and CP are more robust against down-sampling. However, Seurat does have a major drawback at predicting rare cell populations, and it is suboptimal at differentiating cell types that are highly similar to each other, while SingleR and CP are much better in these aspects.