TY - JOUR T1 - DiscoSnp-RAD: <em>de novo</em> detection of small variants for population genomics JF - bioRxiv DO - 10.1101/216747 SP - 216747 AU - Jèrèmy Gauthier AU - Charlotte Mouden AU - Tomasz Suchan AU - Nadir Alvarez AU - Nils Arrigo AU - Chloé Riou AU - Claire Lemaitre AU - Pierre Peterlongo Y1 - 2019/01/01 UR - http://biorxiv.org/content/early/2019/10/04/216747.abstract N2 - We present an original method to de novo call variants for Restriction site associated DNA Sequencing (RAD-Seq). RAD-Seq is a technique characterized by the sequencing of specific loci along the genome, that is widely employed in the field of evolutionary biology since it allows to exploit variants (mainly SNPs) information from entire populations at a reduced cost. Common RAD dedicated tools, as STACKS or IPyRAD, are based on all-versus-all read comparisons, which require consequent time and computing resources. Based on the variant caller DiscoSnp, initially designed for shotgun sequencing, DiscoSnp-RAD avoids this pitfall as variants are detected by exploring the De Bruijn Graph built from all the read datasets. We tested the implementation on RAD data from 259 specimens of Chiastocheta flies, morphologically assigned to 7 species. All individuals were successfully assigned to their species using both STRUCTURE and Maximum Likelihood phylogenetic reconstruction. Moreover, identified variants succeeded to reveal a within species structuration and the existence of two populations linked to their geographic distributions. Furthermore, our results show that DiscoSnp-RAD is at least one order of magnitude faster than state-of-the-art tools. The overall results show that DiscoSnp-RAD is suitable to identify variants from RAD data, and stands out from other tools due to his completely different principle, making it significantly faster, in particular on large datasets.License GNU Affero general public licenseAvailability https://github.com/GATB/DiscoSnpContact jeremy.gauthier{at}inria.fr ER -