Analysis of human P[4]G2 rotavirus strains isolated in Brazil reveals codon usage bias and strong compositional constraints

Infect Genet Evol. 2011 Apr;11(3):580-6. doi: 10.1016/j.meegid.2011.01.006. Epub 2011 Jan 19.

Abstract

The Rotavirus genus belongs to the family Reoviridae and its genome consist of 11 segments of double-stranded RNA. Group A rotaviruses (RV-A) are the main etiological agent of acute viral gastroenteritis in infants and young children worldwide. Understanding the extent and causes of biases in codon usage is essential to the understanding of viral evolution. However, the factors shaping synonymous codon usage bias and nucleotide composition in human RV-A are currently unknown. In order to gain insight into these matters, we analyzed the codon usage and base composition constraints on the two genes that codify the two outer capsid proteins (VP4 [VP8*] and VP7) of 58 P[4]G2 RV-A strains isolated in Brazil and investigated the possible key evolutionary determinants of codon usage bias. The results of these studies revealed that the frequencies of codon usage in both RV-A proteins studied are significantly different than the ones used by human cells. In order to observe if similar trends of codon usage are found when RV-A complete genomes are considered, we compare these results with results found using a dataset of 10 reference strains for whom the complete codes of the 11 segments are known. Similar results were obtained using capsid proteins or complete genomes. The general correlations found between the position of each sequence on the first axis generated by correspondence analysis and the relative dinucleotide abundances indicate that codon usage in RV-A can also be strongly influenced by underlying biases in dinucleotide frequencies. CpG and GpC containing codons are markedly suppressed. Thus, the results of this study suggest that RV-A genomic biases are the result of the evolution of genome composition in relation to host adaptation and the ability to escape antiviral cell responses.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antigens, Viral / genetics*
  • Base Composition
  • Brazil
  • Capsid Proteins / genetics*
  • Child, Preschool
  • Codon*
  • Feces / virology
  • Genes, Viral*
  • Humans
  • Infant
  • Multivariate Analysis
  • Open Reading Frames
  • RNA, Viral / isolation & purification
  • Rotavirus / genetics*
  • Sequence Analysis, DNA
  • Statistics, Nonparametric

Substances

  • Antigens, Viral
  • Capsid Proteins
  • Codon
  • RNA, Viral
  • VP4 protein, Rotavirus
  • VP7 protein, Rotavirus