Comparative analysis of regulatory elements between Escherichia coli and Klebsiella pneumoniae by genome-wide transcription start site profiling

PLoS Genet. 2012;8(8):e1002867. doi: 10.1371/journal.pgen.1002867. Epub 2012 Aug 9.

Abstract

Genome-wide transcription start site (TSS) profiles of the enterobacteria Escherichia coli and Klebsiella pneumoniae were experimentally determined through modified 5' RACE followed by deep sequencing of intact primary mRNA. This identified 3,746 and 3,143 TSSs for E. coli and K. pneumoniae, respectively. Experimentally determined TSSs were then used to define promoter regions and 5' UTRs upstream of coding genes. Comparative analysis of these regulatory elements revealed the use of multiple TSSs, identical sequence motifs of promoter and Shine-Dalgarno sequence, reflecting conserved gene expression apparatuses between the two species. In both species, over 70% of primary transcripts were expressed from operons having orthologous genes during exponential growth. However, expressed orthologous genes in E. coli and K. pneumoniae showed a strikingly different organization of upstream regulatory regions with only 20% identical promoters with TSSs in both species. Over 40% of promoters had TSSs identified in only one species, despite conserved promoter sequences existing in the other species. 662 conserved promoters having TSSs in both species resulted in the same number of comparable 5' UTR pairs, and that regulatory element was found to be the most variant region in sequence among promoter, 5' UTR, and ORF. In K. pneumoniae, 48 sRNAs were predicted and 36 of them were expressed during exponential growth. Among them, 34 orthologous sRNAs between two species were analyzed in depth, and the analysis showed that many sRNAs of K. pneumoniae, including pleiotropic sRNAs such as rprA, arcZ, and sgrS, may work in the same way as in E. coli. These results reveal a new dimension of comparative genomics such that a comparison of two genomes needs to be comprehensive over all levels of genome organization.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions
  • Amino Acid Sequence
  • Base Sequence
  • Conserved Sequence
  • Escherichia coli / genetics*
  • Gene Expression Profiling
  • Genome, Bacterial*
  • Genomics
  • Klebsiella pneumoniae / genetics*
  • Molecular Sequence Data
  • Operon / genetics
  • Promoter Regions, Genetic*
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Transcription Initiation Site*
  • Transcription, Genetic

Substances

  • 5' Untranslated Regions

Grants and funding

National Health Research Institutes in Taiwan funded this work through PH-099-SP-10. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.