Expression profile and gene age jointly shaped the genome-wide distribution of premature termination codons in a Drosophila melanogaster population

Mol Biol Evol. 2015 Jan;32(1):216-28. doi: 10.1093/molbev/msu299. Epub 2014 Nov 3.

Abstract

Widespread premature termination codon mutations (PTCs) were recently observed in human and fly populations. We took advantage of the population resequencing data in the Drosophila Genetic Reference Panel to investigate how the expression profile and the evolutionary age of genes shaped the allele frequency distribution of PTCs. After generating a high-quality data set of PTCs, we clustered genes harboring PTCs into three categories: genes encoding low-frequency PTCs (≤ 1.5%), moderate-frequency PTCs (1.5-10%), and high-frequency PTCs (>10%). All three groups show narrow transcription compared with PTC-free genes, with the moderate- and high-PTC frequency groups showing a pronounced pattern. Moreover, nearly half (42%) of the PTC-encoding genes are not expressed in any tissue. Interestingly, the moderate-frequency PTC group is strongly enriched for genes expressed in midgut, whereas genes harboring high-frequency PTCs tend to have sex-specific expression. We further find that although young genes born in the last 60 My compose a mere 9% of the genome, they represent 16%, 30%, and 50% of the genes containing low-, moderate-, and high-frequency PTCs, respectively. Among DNA-based and RNA-based duplicated genes, the child copy is approximately twice as likely to contain PTCs as the parent copy, whereas young de novo genes are as likely to encode PTCs as DNA-based duplicated new genes. Based on these results, we conclude that expression profile and gene age jointly shaped the landscape of PTC-mediated gene loss. Therefore, we propose that new genes may need a long time to become stably maintained after the origination.

Keywords: gene duplication; gene loss; midgut; premature termination codon; young gene.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chromosomes
  • Codon, Nonsense*
  • Computational Biology / methods*
  • Drosophila Proteins / genetics*
  • Drosophila melanogaster / genetics*
  • Evolution, Molecular
  • Gene Expression Profiling
  • Sequence Analysis, DNA
  • Tissue Distribution

Substances

  • Codon, Nonsense
  • Drosophila Proteins