Abstract
We demonstrate a novel variant calling strategy using barcode-stratified alignments on 25 tumor and normal 10XGenomics scRNA-seq datasets (>200,000 cells). Our approach identified 24,528 exonic non-dbSNP single cell expressed (sce)SNVs, a third of which are shared across multiple samples. The novel sceSNVs include unreported somatic and germline variants, as well as RNA-originating variants; some are expressed in up to 17% of the cells, and many are found in known cancer genes. Our findings suggest that there is an unacknowledged repertoire of expressed genetic variants, possibly recurrent and common across samples, in the normal and cancer transcriptome.
Competing Interest Statement
The authors have declared no competing interest.
Copyright
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.