The evolution of hexapod engrailed-family genes: evidence for conservation and concerted evolution

Proc Biol Sci. 2006 Jul 22;273(1595):1733-42. doi: 10.1098/rspb.2006.3497.

Abstract

Phylogenetic analyses imply that multiple engrailed-family gene duplications occurred during hexapod evolution, a view supported by previous reports of only a single engrailed-family gene in members of the grasshopper genus Schistocerca and in the beetle Tribolium castaneum. Here, we report the cloning of a second engrailed-family gene from Schistocerca gregaria and present evidence for two engrailed-family genes from four additional hexapod species. We also report the existence of a second engrailed-family gene in the Tribolium genome. We suggest that the engrailed and invected genes of Drosophila melanogaster have existed as a conserved gene cassette throughout holometabolous insect evolution. In total 11 phylogenetically diverse hexapod orders are now known to contain species that possess two engrailed-family paralogues, with in each case only one paralogue encoding the RS-motif, a characteristic feature of holometabolous insect invected proteins. We propose that the homeoboxes of hexapod engrailed-family paralogues are evolving in a concerted fashion, resulting in gene trees that overestimate the frequency of gene duplication. We present new phylogenetic analyses using non-homeodomain amino acid sequence that support this view. The S. gregaria engrailed-family paralogues provide strong evidence that concerted evolution might in part be explained by recurrent gene conversion. Finally, we hypothesize that the RS-motif is part of a serine-rich domain targeted for phosphorylation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs
  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Cloning, Molecular
  • Conserved Sequence
  • Drosophila / genetics
  • Evolution, Molecular*
  • Gene Duplication
  • Genome, Insect
  • Grasshoppers / genetics*
  • Grasshoppers / metabolism
  • Homeodomain Proteins / chemistry
  • Homeodomain Proteins / genetics*
  • Homeodomain Proteins / metabolism
  • Insect Proteins / chemistry
  • Insect Proteins / genetics*
  • Insect Proteins / metabolism
  • Molecular Sequence Data
  • Multigene Family*
  • Phylogeny
  • Protein Structure, Tertiary
  • Sequence Alignment
  • Transcription Factors / chemistry
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism
  • Tribolium / genetics

Substances

  • Homeodomain Proteins
  • Insect Proteins
  • Transcription Factors