PT - JOURNAL ARTICLE AU - Xuefang Zhao AU - Alexandra M. Weber AU - Ryan E. Mills TI - VaPoR: a high-speed validation approach for structural variation using long-read sequencing technology AID - 10.1101/105817 DP - 2017 Jan 01 TA - bioRxiv PG - 105817 4099 - http://biorxiv.org/content/early/2017/02/03/105817.short 4100 - http://biorxiv.org/content/early/2017/02/03/105817.full AB - Summary Although there are numerous algorithms that have been developed to identify structural variation (SVs) in genomic sequences, there is a dearth of approaches that can be used to evaluate their results. The emergence of new sequencing technologies that generate longer sequence reads can, in theory, provide direct evidence for all types of SVs regardless of the length of region through which it spans. However, current efforts to use these data in this manner require the use of large computational resources to assemble these sequences as well as manual inspection of each region. Here, we present VaPoR, a highly efficient algorithm that autonomously validates large SV sets using long read sequencing data. We assess of the performance of VaPoR on both simulated and real SVs with regards to various features including accuracy and sensitivity of breakpoint evaluation and report a high fidelity rate.Availability https://github.com/mills-lab/VaPoRContact remills{at}umich.eduSupplementary information Supplementary data are available at Bioinformatics online.