On the origin of new genes in Drosophila

  1. Qi Zhou1,2,4,
  2. Guojie Zhang1,2,3,4,
  3. Yue Zhang1,4,
  4. Shiyu Xu1,
  5. Ruoping Zhao1,
  6. Zubing Zhan1,2,
  7. Xin Li1,2,
  8. Yun Ding1,2,
  9. Shuang Yang1,3, and
  10. Wen Wang1,5
  1. 1 CAS-Max Planck Junior Research Group, State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China;
  2. 2 Graduate School of Chinese Academy Sciences, Beijing 100086, China;
  3. 3 Beijing Genomics Institute-Shenzhen, Shenzhen 518083, China
  1. 4 These authors contributed equally to this work.

Abstract

Several mechanisms have been proposed to account for the origination of new genes. Despite extensive case studies, the general principles governing this fundamental process are still unclear at the whole-genome level. Here, we unveil genome-wide patterns for the mutational mechanisms leading to new genes and their subsequent lineage-specific evolution at different time nodes in the Drosophila melanogaster species subgroup. We find that (1) tandem gene duplication has generated ∼80% of the nascent duplicates that are limited to single species (D. melanogaster or Drosophila yakuba); (2) the most abundant new genes shared by multiple species (44.1%) are dispersed duplicates, and are more likely to be retained and be functional; (3) de novo gene origination from noncoding sequences plays an unexpectedly important role during the origin of new genes, and is responsible for 11.9% of the new genes; (4) retroposition is also an important mechanism, and had generated ∼10% of the new genes; (5) ∼30% of the new genes in the D. melanogaster species complex recruited various genomic sequences and formed chimeric gene structures, suggesting structure innovation as an important way to help fixation of new genes; and (6) the rate of the origin of new functional genes is estimated to be five to 11 genes per million years in the D. melanogaster subgroup. Finally, we survey gene frequencies among 19 globally derived strains for D. melanogaster-specific new genes and reveal that 44.4% of them show copy number polymorphisms within a population. In conclusion, we provide a panoramic picture for the origin of new genes in Drosophila species.

Footnotes

  • 5 Corresponding author.

    5 E-mail wwang{at}mail.kiz.ac.cn; fax 86-871-5193137.

  • [Supplemental material is available online at www.genome.org.]

  • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.076588.108.

    • Received January 25, 2008.
    • Accepted May 28, 2008.
| Table of Contents

Preprint Server