TY - JOUR T1 - Copy number variants in the sheep genome detected using multiple approaches JF - bioRxiv DO - 10.1101/041475 SP - 041475 AU - Gemma M Jenkins AU - Michael E Goddard AU - Michael A Black AU - Rudiger Brauning AU - Benoit Auvray AU - Ken G Dodds AU - James W Kijas AU - Noelle Cockett AU - John C McEwan Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/04/26/041475.abstract N2 - Background. Copy number variants (CNVs) are a type of polymorphism found to underlie phenotypic variation, both in humans and livestock. Most surveys of CNV in livestock have been conducted in the cattle genome, and often utilise only a single approach for the detection of copy number differences. Here we performed a study of CNV in sheep, using multiple methods to identify and characterise copy number changes. Comprehensive information from small pedigrees (trios) was collected using multiple platforms (array CGH, SNP chip and whole genome sequence data), with these data then analysed via multiple approaches to identify and verify CNVs.Results. In total, 3,488 autosomal CNV regions (CNVRs) were identified in this study, which substantially builds on an initial survey of the sheep genome that identified 135 CNVRs. The average length of the identified CNVRs was 19kb (range of 1kb to 3.6Mb), with shorter CNVRs being more frequent than longer CNVRs. The total length of all CNVRs was 67.6Mbps, which equates to 2.7% of the sheep autosomes. For individuals this value ranged from 0.24 to 0.55%, and the majority of CNVRs were identified in single animals. Rather than being uniformly distributed throughout the genome, CNVRs tended to be clustered. Application of three independent approaches for CNVR detection facilitated a comparison of validation rates. CNVs identified on the Roche-NimbleGen 2.1M CGH array generally had low validation rates with lower density arrays, while whole genome sequence data had the highest validation rate (>60%).Conclusions. This study represents the first comprehensive survey of the distribution, prevalence and characteristics of CNVR in sheep. Multiple approaches were used to detect CNV regions and it appears that the best method for verifying CNVR on a large scale involves using a combination of detection methodologies. The characteristics of the 3,488 autosomal CNV regions identified in this study are comparable to other CNV regions reported in the literature and provide a valuable and sizeable addition to the small subset of published sheep CNVs.Abbreviations:Absl2r ‐absolute log2 ratiobps ‐base pairsBWA ‐Burrows-Wheeler AlignmentCGH ‐comparative genomic hybridisationCNV ‐Copy number variantsCNVR ‐CNV regionsIMF ‐International Mapping Flockindels ‐INsertions/DELetionSISGC ‐International Sheep Genomics ConsortiumKb ‐kilobaseMb ‐megabaseOligo aCGH ‐oligonucleotide CGH arrayQTL ‐quantitative trait lociSD ‐Segmental duplicationsSLE ‐systemic lupus erythematosus ER -