PT - JOURNAL ARTICLE AU - Namit Kumar AU - Ryan Golhar AU - Kriti Sen Sharma AU - James L Holloway AU - Srikant Sarangi AU - Isaac Neuhaus AU - Alice M. Walsh AU - Zachary W. Pitluk TI - Rapid Single Cell Evaluation of Human Disease and Disorder Targets Using REVEAL: SingleCell<sup>™</sup> AID - 10.1101/2020.06.24.169730 DP - 2020 Jan 01 TA - bioRxiv PG - 2020.06.24.169730 4099 - http://biorxiv.org/content/early/2020/06/25/2020.06.24.169730.short 4100 - http://biorxiv.org/content/early/2020/06/25/2020.06.24.169730.full AB - Single-cell (sc) sequencing performs unbiased profiling of individual cells and enables evaluation of less prevalent cellular populations, often missed using bulk sequencing. However, the scale and the complexity of the sc datasets poses a great challenge in its utility and this problem is further exacerbated when working with larger datasets typically generated by consortium efforts. As the scale of single cell datasets continues to increase exponentially, there is an unmet technological need to develop database platforms that can evaluate key biological hypothesis by querying extensive single-cell datasets.Large single-cell datasets like human cell atlas and COVID-19 cell atlas (collection of annotated sc datasets from various human organs) are excellent resources for profiling target genes involved in human diseases and disorders ranging from oncology, auto-immunity, as well as infectious diseases like COVID-19 caused by SARS-CoV-2 virus. SARS-CoV-2 infections have led to a worldwide pandemic with massive loss of lives, infections exceeding 7 million cases. The virus uses ACE2 and TMPRSS2 as key viral entry associated proteins expressed in human cells for infections. Evaluating the expression profile of key genes in large single-cell datasets can facilitate testing for diagnostics, therapeutics and vaccine targets; as the world struggles to cope with the on-going spread of COVID-19 infections.In this manuscript we describe, REVEAL: SingleCell which enables storage, retrieval and rapid query of single-cell datasets inclusive of millions of cells. The analytical database described here enables selecting and analyzing cells across multiple studies. Cells can be selected using individual metadata tags, more complex hierarchical ontology filtering, and gene expression threshold ranges, including co-expression of multiple genes. The tags on selected cells can be further evaluated for testing biological hypothesis. One such example includes identifying the most prevalent cell type annotation tag on returned cells.We used REVEAL: SingleCell to evaluate expression of key SARS-CoV-2 entry associated genes, and queried the current database (2.2 Million cells, 32 projects) to obtain the results in &lt;60 seconds. We highlighted cells expressing COVID-19 associated genes are expressed on multiple tissue types, thus in part explains the multi-organ involvement in infected patients observed worldwide during the on-going COVID-19 pandemic.Competing Interest StatementNamit Kumar, Ryan Golhar, James L. Holloway, Brian Kidd, Alice M. Walsh, Isaac Neuhaus are employees of Bristol Myers Squibb. Kriti S. Sharma, Srikant Sarangi and Zachary Pitluk are employees of Paradigm4, Inc.