PT - JOURNAL ARTICLE AU - Brian Hie AU - Bryan Bryson AU - Bonnie Berger TI - Panoramic stitching of heterogeneous single-cell transcriptomic data AID - 10.1101/371179 DP - 2018 Jan 01 TA - bioRxiv PG - 371179 4099 - http://biorxiv.org/content/early/2018/07/17/371179.short 4100 - http://biorxiv.org/content/early/2018/07/17/371179.full AB - Researchers are generating single-cell RNA sequencing (scRNA-seq) profiles of diverse biological systems1–4 and every cell type in the human body.5 Leveraging this data to gain unprecedented insight into biology and disease will require assembling heterogeneous cell populations across multiple experiments, laboratories, and technologies. Although methods for scRNA-seq data integration exist6,7, they often naively merge data sets together even when the data sets have no cell types in common, leading to results that do not correspond to real biological patterns. Here we present Scanorama, inspired by algorithms for panorama stitching, that overcomes the limitations of existing methods to enable accurate, heterogeneous scRNA-seq data set integration. Our strategy identifies and merges the shared cell types among all pairs of data sets and is orders of magnitude faster than existing techniques. We use Scanorama to combine 105,476 cells from 26 diverse scRNA-seq experiments across 9 different technologies into a single comprehensive reference, demonstrating how Scanorama can be used to obtain a more complete picture of cellular function across a wide range of scRNA-seq experiments.