PT - JOURNAL ARTICLE AU - Cheng Jia AU - Derek Kelly AU - Junhyong Kim AU - Mingyao Li AU - Nancy R. Zhang TI - Accounting for technical noise in Single-cell RNA sequencing analysis AID - 10.1101/116939 DP - 2017 Jan 01 TA - bioRxiv PG - 116939 4099 - http://biorxiv.org/content/early/2017/03/15/116939.short 4100 - http://biorxiv.org/content/early/2017/03/15/116939.full AB - Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq.