PT - JOURNAL ARTICLE AU - Yun-Ching Chen AU - Fayaz Seifuddin AU - Cu Nguyen AU - Zhaowei Yang AU - Wanqiu Chen AU - Chunhua Yan AU - Qingrong Chen AU - Charles Wang AU - Wenming Xiao AU - Mehdi Pirooznia AU - Daoud Meerzaman AU - The Somatic Mutation Working Group of the SEQC-II Consortium TI - Comprehensive Assessment of Somatic Copy Number Variation Calling Using Next-Generation Sequencing Data AID - 10.1101/2021.02.18.431906 DP - 2021 Jan 01 TA - bioRxiv PG - 2021.02.18.431906 4099 - http://biorxiv.org/content/early/2021/02/22/2021.02.18.431906.short 4100 - http://biorxiv.org/content/early/2021/02/22/2021.02.18.431906.full AB - Copy number variation (CNV) is a common type of mutation that often drives cancer progression. With advances in next-generation sequencing (NGS), CNVs can be detected in a detailed manner via newly developed computational tools but quality of such CNV calls has not been carefully evaluated. We analyzed CNV calls reported by 6 cutting-edge callers for 91 samples which were derived from the same cancer cell line, prepared and sequenced by varying the following factors: type of tissue sample (Fresh vs. Formalin Fixed Paraffin Embedded (FFPE)), library DNA amount, tumor purity, sequencing platform (Whole-Genome Sequencing (WGS) versus Whole-Exome Sequencing (WES)), and sequencing coverage. We found that callers greatly determined the pattern of CNV calls. Calling quality was drastically impaired by low purity (<50%) and became variable when WES, FFPE, and medium purity (50%-75%) were applied. Effects of low DNA amount and low coverage were relatively minor. Our analysis demonstrates the limitation of benchmarking somatic CNV callers when the real ground truth is not available. Our comprehensive analysis has further characterized each caller with respect to confounding factors and examined the consistency of CNV calls, thereby providing guidelines for conducting somatic CNV analysis.Competing Interest StatementThe authors have declared no competing interest.