Experimental determination of translational start sites resolves uncertainties in genomic open reading frame predictions - application to Mycobacterium tuberculosis

Microbiology (Reading). 2009 Jan;155(Pt 1):186-197. doi: 10.1099/mic.0.022889-0.

Abstract

Correct identification of translational start sites is important for understanding protein function and transcriptional regulation. The annotated translational start sites contained in genome databases are often predicted using bioinformatics and are rarely verified experimentally, and so are not all accurate. Therefore, we devised a simple approach for determining translational start sites using a combination of epitope tagging and frameshift mutagenesis. This assay was used to determine the start sites of three Mycobacterium tuberculosis proteins: LexA, SigC and Rv1955. We were able to show that proteins may begin before or after the predicted site. We also found that a small, non-annotated open reading frame upstream of Rv1955 was expressed as a protein, which we have designated Rv1954A. This approach is readily applicable to any bacterial species for which plasmid transformation can be achieved.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Bacterial Proteins / chemistry
  • Bacterial Proteins / genetics*
  • Bacterial Proteins / metabolism
  • Base Sequence
  • Codon, Initiator*
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism
  • Epitopes
  • Frameshift Mutation
  • Genome, Bacterial
  • Humans
  • Molecular Sequence Data
  • Mycobacterium tuberculosis / genetics*
  • Mycobacterium tuberculosis / metabolism
  • Open Reading Frames / genetics*
  • Open Reading Frames / physiology
  • Plasmids / genetics
  • Protein Biosynthesis*
  • Serine Endopeptidases / chemistry
  • Serine Endopeptidases / genetics
  • Serine Endopeptidases / metabolism
  • Sigma Factor / chemistry
  • Sigma Factor / genetics
  • Sigma Factor / metabolism

Substances

  • Bacterial Proteins
  • Codon, Initiator
  • DNA-Binding Proteins
  • Epitopes
  • LexA protein, Bacteria
  • Sigma Factor
  • sigC protein, Bacteria
  • Serine Endopeptidases