%0 Journal Article %A Jiami Han %A Raphael Kuhn %A Chrysa Papadopoulou %A Andreas Agrafiotis %A Victor Kreiner %A Danielle Shlesinger %A Raphael Dizerens %A Kai-Lin Hong %A Cédric Weber %A Victor Greiff %A Annette Oxenius %A Sai T. Reddy %A Alexander Yermanos %T Echidna: integrated simulations of single-cell immune receptor repertoires and transcriptomes %D 2021 %R 10.1101/2021.07.17.452792 %J bioRxiv %P 2021.07.17.452792 %X Single-cell sequencing now enables the recovery of full-length immune repertoires [B cell receptor (BCR) and T cell receptor (TCR) repertoires], in addition to gene expression information. The feature-rich datasets produced from such experiments require extensive and diverse computational analyses, each of which can significantly influence the downstream immunological interpretations, such as clonal selection and expansion. Simulations produce validated standard datasets, where the underlying generative model can be precisely defined and furthermore perturbed to investigate specific questions of interest. Currently, there is no tool that can be used to simulate a comprehensive ground truth single-cell dataset that incorporates both immune receptor repertoires and gene expression. Therefore, we developed Echidna, an R package that simulates immune receptors and transcriptomes at single-cell resolution. Our simulation tool generates annotated single-cell sequencing data with user-tunable parameters controlling a wide range of features such as clonal expansion, germline gene usage, somatic hypermutation, and transcriptional phenotypes. Echidna can additionally simulate time-resolved B cell evolution, producing mutational networks with complex selection histories incorporating class-switching and B cell subtype information. Finally, we demonstrate the benchmarking potential of Echidna by simulating clonal lineages and comparing the known simulated networks with those inferred from only the BCR sequences as input. Together, Echidna provides a framework that can incorporate experimental data to simulate single-cell immune repertoires to aid software development and bioinformatic benchmarking of clonotyping, phylogenetics, transcriptomics and machine learning strategies.Competing Interest StatementThe authors have declared no competing interest. %U https://www.biorxiv.org/content/biorxiv/early/2021/07/19/2021.07.17.452792.full.pdf