PT - JOURNAL ARTICLE AU - Min Dai AU - Xiaobing Pei AU - Xiu-Jie Wang TI - Accurate and fast cell marker gene identification with COSG AID - 10.1101/2021.06.15.448484 DP - 2021 Jan 01 TA - bioRxiv PG - 2021.06.15.448484 4099 - http://biorxiv.org/content/early/2021/06/16/2021.06.15.448484.short 4100 - http://biorxiv.org/content/early/2021/06/16/2021.06.15.448484.full AB - Accurate cell classification is the groundwork for downstream analysis of single-cell sequencing data, yet how to identify marker genes to distinguish different cell types still remains as a big challenge. We developed COSG as a cosine similarity-based method for more accurate and scalable marker gene identification. COSG is applicable to single-cell RNA sequencing data, single-cell ATAC sequencing data and spatially resolved transcriptome data. COSG is fast and scalable for ultra-large datasets of million-scale cells. Application on both simulated and real experimental datasets demonstrates the superior performance of COSG in terms of both accuracy and efficiency as compared with other available methods. Marker genes or genomic regions identified by COSG are more indicative and with greater cell-type specificity.Competing Interest StatementThe authors have declared no competing interest.