RT Journal Article SR Electronic T1 A recurrence based approach for validating structural variation using long-read sequencing technology JF bioRxiv FD Cold Spring Harbor Laboratory SP 105817 DO 10.1101/105817 A1 Xuefang Zhao A1 Alexandra M. Weber A1 Ryan E. Mills YR 2017 UL http://biorxiv.org/content/early/2017/02/24/105817.abstract AB Although there are numerous algorithms that have been developed to identify structural variation (SVs) in genomic sequences, there is a dearth of approaches that can be used to evaluate their results. The emergence of new sequencing technologies that generate longer sequence reads can, in theory, provide direct evidence for all types of SVs regardless of the length of region through which it spans. However, current efforts to use these data in this manner require the use of large computational resources to assemble these sequences as well as manual inspection of each region. Here, we present VaPoR, a highly efficient algorithm that autonomously validates large SV sets using long read sequencing data. We assess of the performance of VaPoR on both simulated and real SVs and report a high-fidelity rate for various features including overall accuracy, sensitivity of breakpoint precision, and predicted genotype.