PT - JOURNAL ARTICLE AU - Sedlazeck, Fritz J. AU - Lemmon, Zachary AU - Soyk, Sebastian AU - Salerno, William J. AU - Lippman, Zachary AU - Schatz, Michael C. TI - SVCollector: Optimized sample selection for validating and long-read resequencing of structural variants AID - 10.1101/342386 DP - 2018 Jan 01 TA - bioRxiv PG - 342386 4099 - http://biorxiv.org/content/early/2018/06/08/342386.short 4100 - http://biorxiv.org/content/early/2018/06/08/342386.full AB - Summary Structural Variations (SVs) are increasingly recognized for their importance in genomics. Short-read sequencing is the most widely-used approach for genotyping large numbers of samples for SVs but suffers from relatively poor accuracy. Here we present SVCollector, an open-source method that optimally selects samples to maximize variant discovery and validation using long read resequencing or PCR-based validation. SVCollector has two modes: selecting those samples that are individually the most diverse or those that collectively capture the largest number of variations.Availability https://github.com/fritzsedlazeck/SVCollectorContact fritz.sedlazeck{at}bcm.eduSupplementary information Supplementary data are available at Bioinformatics online.