Identification of small RNA pathway genes using patterns of phylogenetic conservation and divergence

Nature. 2013 Jan 31;493(7434):694-8. doi: 10.1038/nature11779. Epub 2012 Dec 23.

Abstract

Genetic and biochemical analyses of RNA interference (RNAi) and microRNA (miRNA) pathways have revealed proteins such as Argonaute and Dicer as essential cofactors that process and present small RNAs to their targets. Well-validated small RNA pathway cofactors such as these show distinctive patterns of conservation or divergence in particular animal, plant, fungal and protist species. We compared 86 divergent eukaryotic genome sequences to discern sets of proteins that show similar phylogenetic profiles with known small RNA cofactors. A large set of additional candidate small RNA cofactors have emerged from functional genomic screens for defects in miRNA- or short interfering RNA (siRNA)-mediated repression in Caenorhabditis elegans and Drosophila melanogaster, and from proteomic analyses of proteins co-purifying with validated small RNA pathway proteins. The phylogenetic profiles of many of these candidate small RNA pathway proteins are similar to those of known small RNA cofactor proteins. We used a Bayesian approach to integrate the phylogenetic profile analysis with predictions from diverse transcriptional coregulation and proteome interaction data sets to assign a probability for each protein for a role in a small RNA pathway. Testing high-confidence candidates from this analysis for defects in RNAi silencing, we found that about one-half of the predicted small RNA cofactors are required for RNAi silencing. Many of the newly identified small RNA pathway proteins are orthologues of proteins implicated in RNA splicing. In support of a deep connection between the mechanism of RNA splicing and small-RNA-mediated gene silencing, the presence of the Argonaute proteins and other small RNA components in the many species analysed strongly correlates with the number of introns in those species.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Caenorhabditis elegans / classification
  • Caenorhabditis elegans / genetics*
  • Caenorhabditis elegans Proteins / genetics
  • Eukaryota / classification
  • Eukaryota / genetics
  • Genetic Variation*
  • Genome / genetics
  • MicroRNAs / genetics
  • Phylogeny*
  • Proteome
  • RNA Splicing
  • RNA, Small Interfering / genetics*

Substances

  • Caenorhabditis elegans Proteins
  • MicroRNAs
  • Proteome
  • RNA, Small Interfering