Analysis of copy number variations among diverse cattle breeds

  1. John W. Keele9,13
  1. 1 USDA-ARS, ANRI, Bovine Functional Genomics Laboratory, Beltsville, Maryland 20705, USA;
  2. 2 Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA;
  3. 3 Department of Bioengineering, University of Maryland, College Park, Maryland 20742, USA;
  4. 4 Department of Genetics and Microbiology, University of Bari, Bari 70126, Italy;
  5. 5 USDA-ARS, LARRL, Fort Keogh Miles City, Montana 59301, USA;
  6. 6 Departamento de Zootecnia, ESALQ-USP, Piracicaba SP 13418-900, Brazil;
  7. 7 Department of Animal Production, Faculty of Biotechnological Sciences, S. Prov. Casamassima, km 3-70010 Valenzano (Bari), Italy;
  8. 8 Bioinformatics and Computational Biology, George Mason University, Manassas, Virginia 20110, USA;
  9. 9 USDA-ARS, US Meat Animal Research Center, Clay Center, Nebraska 68933, USA;
  10. 10 Embrapa Pecuaria Sudeste, Sao Carlos–Sao Paulo, Rodovia Washington Luiz, km 234, Caixa Postal 339, CEP 13560-970, Brazil;
  11. 11 Deparment of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA;
  12. 12 Howard Hughes Medical Institute, Seattle, Washington 98195, USA

    Abstract

    Genomic structural variation is an important and abundant source of genetic and phenotypic variation. Here, we describe the first systematic and genome-wide analysis of copy number variations (CNVs) in modern domesticated cattle using array comparative genomic hybridization (array CGH), quantitative PCR (qPCR), and fluorescent in situ hybridization (FISH). The array CGH panel included 90 animals from 11 Bos taurus, three Bos indicus, and three composite breeds for beef, dairy, or dual purpose. We identified over 200 candidate CNV regions (CNVRs) in total and 177 within known chromosomes, which harbor or are adjacent to gains or losses. These 177 high-confidence CNVRs cover 28.1 megabases or ∼1.07% of the genome. Over 50% of the CNVRs (89/177) were found in multiple animals or breeds and analysis revealed breed-specific frequency differences and reflected aspects of the known ancestry of these cattle breeds. Selected CNVs were further validated by independent methods using qPCR and FISH. Approximately 67% of the CNVRs (119/177) completely or partially span cattle genes and 61% of the CNVRs (108/177) directly overlap with segmental duplications. The CNVRs span about 400 annotated cattle genes that are significantly enriched for specific biological functions, such as immunity, lactation, reproduction, and rumination. Multiple gene families, including ULBP, have gone through ruminant lineage-specific gene amplification. We detected and confirmed marked differences in their CNV frequencies across diverse breeds, indicating that some cattle CNVs are likely to arise independently in breeds and contribute to breed differences. Our results provide a valuable resource beyond microsatellites and single nucleotide polymorphisms to explore the full dimension of genetic variability for future cattle genomic research.

    Footnotes

    • 13 Corresponding authors.

      E-mail John.Keele{at}ars.usda.gov; fax (402) 762-4155.

      E-mail George.Liu{at}ars.usda.gov; fax (301) 504-8414.

    • [Supplemental material is available online at http://www.genome.org. The array CGH data from this study have been submitted to the NCBI Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo) under accession no. GSE19866.]

    • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.105403.110.

      • Received January 19, 2010.
      • Accepted March 4, 2010.
    | Table of Contents

    Preprint Server