TY - JOUR T1 - Bamgineer: Introduction of simulated allele-specific copy number variants into exome and targeted sequence data sets JF - bioRxiv DO - 10.1101/119636 SP - 119636 AU - Soroush Samadian AU - Jeff P. Bruce AU - Trevor J. Pugh Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/03/22/119636.abstract N2 - Somatic copy number variations (CNVs) play a crucial role in development of many human cancers. The broad availability of next-generation sequencing (NGS) data has enabled the development of algorithms to computationally infer CNV profiles from a variety of data types including exome and targeted sequence data; currently the most prevalent types of cancer genomics data. However, systemic evaluation and comparison of these tools remains challenging due to a lack of ground truth reference sets. To address this need, we have developed Bamgineer, a tool written in Python to introduce user-defined haplotype-phased allele-specific copy number events into an existing Binary Alignment Mapping (BAM) file, with a focus on targeted and exome sequencing experiments. As input, this tool requires a read alignment file (BAM format), lists of non-overlapping genome coordinates for introduction of gains and losses (bed file), and an optional file defining known haplotypes (vcf format). To improve runtime performance, Bamgineer introduces the desired CNVs in parallel using queuing and parallel processing on a local machine or on a high-performance computing cluster. As proof-of-principle, we applied Bamgineer to a single high-coverage (mean: 220X) exome sequence file from a blood sample to simulate copy number profiles of 3 exemplar tumours from each of 10 tumour types at 5 tumour cellularity levels (20-100%, 150 BAM files in total). In addition to these reference sets, we expect Bamgineer to be of use for systematic benchmarking of CNV calling algorithms using their own data and expected tumour content for a variety of applications. The source code and reference datasets are freely available at http://github.org/pughlab/bamgineer. ER -