TY - JOUR T1 - Assessing Runs of Homozygosity: A comparison of SNP Array and Whole Genome Sequence low coverage data JF - bioRxiv DO - 10.1101/160705 SP - 160705 AU - Francisco C. Ceballos AU - Scott Hazelhurst AU - Michèle Ramsay Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/07/07/160705.abstract N2 - Runs of Homozygosity (ROH) are sequences that arise when identical haplotypes are inherited from each parent. Since their first detection due to technological advances in the late 1990s, ROHs have been shedding light on human population history and deciphering the genetic basis of monogenic and complex traits and diseases. ROH studies have predominantly exploited SNP array data, but are gradually moving to whole genome sequence (WGS) data as it becomes available. WGS data, covering more genetic variability, can add value to ROH studies, but require additional considerations during analysis. Using SNP array and low coverage WGS data from 1885 individuals from 20 world populations, our aims were to compare ROH from the two datasets and to establish software conditions to get comparable results, thus providing guidelines for combining disparate datasets in joint ROH analyses. Using the PLINK Homozygosity functions, we found that by allowing 3 heterozygous SNPs per window when dealing with WGS low coverage data, it is possible to establish meaningful comparisons between data using the two technologies. ER -