Extensive copy-number variation of young genes across stickleback populations

PLoS Genet. 2014 Dec 4;10(12):e1004830. doi: 10.1371/journal.pgen.1004830. eCollection 2014 Dec.

Abstract

Duplicate genes emerge as copy-number variations (CNVs) at the population level, and remain copy-number polymorphic until they are fixed or lost. The successful establishment of such structural polymorphisms in the genome plays an important role in evolution by promoting genetic diversity, complexity and innovation. To characterize the early evolutionary stages of duplicate genes and their potential adaptive benefits, we combine comparative genomics with population genomics analyses to evaluate the distribution and impact of CNVs across natural populations of an eco-genomic model, the three-spined stickleback. With whole genome sequences of 66 individuals from populations inhabiting three distinct habitats, we find that CNVs generally occur at low frequencies and are often only found in one of the 11 populations surveyed. A subset of CNVs, however, displays copy-number differentiation between populations, showing elevated within-population frequencies consistent with local adaptation. By comparing teleost genomes to identify lineage-specific genes and duplications in sticklebacks, we highlight rampant gene content differences among individuals in which over 30% of young duplicate genes are CNVs. These CNV genes are evolving rapidly at the molecular level and are enriched with functional categories associated with environmental interactions, depicting the dynamic early copy-number polymorphic stage of genes during population differentiation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adaptation, Biological / genetics
  • Animals
  • DNA Copy Number Variations*
  • Evolution, Molecular
  • Female
  • Gene Deletion
  • Gene Dosage
  • Gene Duplication
  • Genes, Duplicate / genetics*
  • Genetic Variation*
  • Male
  • Metagenomics
  • Phylogeny
  • Smegmamorpha / genetics*

Grants and funding

MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.