Regular article
Leaderless transcripts of the crenarchaeal hyperthermophile Pyrobaculum aerophilum1

https://doi.org/10.1006/jmbi.2001.4669Get rights and content

Abstract

We mapped transcription start sites for ten unrelated protein-encoding Pyrobaculum aerophilum genes by primer extension and S1 nuclease mapping. All of the mapped transcripts start at the computationally predicted translation start codons, two of which were supported by N-terminal protein sequencing. A whole genome computational analysis of the regions from −50 to +50 nt around the predicted translation starts codons revealed a clear upstream pattern matching the consensus sequence of the archaeal TATA box located unusually close to the translation starts. For genes with the TATA boxes that best matched the consensus sequence, the distance between the TATA box and the translation start codon appears to be shorter than 30 nt. Two other promoter elements distinguished were also found unusually close to the translation start codons: a transcription initiator element with significant elevation of C and T frequencies at the −1 position and a BRE element with more frequent A bases at position −29 to −32 (counting from the translation start site). We also show that one of the mapped genes is transcribed as the first gene of an operon. For a set of genes likely to be internal in operons the upstream signal extracted by computer analysis was a Shine-Dalgarno pattern matching the complementary sequence of P. aerophilum 16 S rRNA. Together these results suggest that the translation of proteins encoded by single genes or genes that are first in operons in the hyperthermophilic crenarchaeon P. aerophilum proceeds mostly, if not exclusively, through leaderless transcripts. Internal genes in operons are likely to undergo translation via a mechanism that is facilitated by ribosome binding to the Shine-Dalgarno sequence.

Introduction

Archaea (archaebacteria) constitute one of the three major evolutionary lineages of life on Earth. They are globally widespread, populating soil1 and other ecological niches; it has been found that Archaea may constitute up to 30% of the oceanic pikoplankton in coastal Antarctic surface water.2 Hyperthermophilic archaea represent the organisms at the upper temperature border of life, with upper growth limits between 80°C-113°C.3 They are an interesting source for the study of phylogeny and organization of ancestral life. Archaea are currently divided into three kingdoms: Euryarchaeota, Crenarchaeota and Korarchaeota.4

Pyrobaculum aerophilum is a rod-shaped crenarchaeon that grows optimally at 100°C. The species was isolated by Karl Stetter and co-workers from a boiling marine water hole.5 It is facultatively aerobic and has a plating efficiency of up to 100%.5 These features make it a good candidate for development as a model hyperthermophilic organism. The entire genome of P. aerophilum has recently been sequenced. We have reported the results from sampling small cDNA and genomic DNA libraries,6 and described the development of a genome map and identification of 489 genes.7 Here we present the next step in our study of P. aerophilum. In order to get some information on the mechanisms of P. aerophilum transcription and translation we mapped the transcription initiation sites by primer extension and in some cases additionally by S1 nuclease mapping of ten randomly chosen P. aerophilum genes. Surprisingly, all mapped transcripts start just before the computationally predicted translation start codons, two of which were confirmed by N-terminal sequencing. By Northern blot analysis we have shown that one of the mapped genes is transcribed as the first gene in an operon. We also performed whole genome analysis and found that the predominant start context signal, located unusually close to the translation start codons, was similar to an archaeal promoter sequence with consensus TTTA{A,T}A. We did not detect a Shine-Dalgarno pattern while dealing with the whole set of genes from P. aerophilum, but found this pattern for a set of genes likely to be integral in operons. The experimental and computational results together suggest that the translation of proteins encoded by single or operon-leading genes in P. aerophilum proceeds mostly through leaderless transcripts, whereas genes that are inside operons are likely to utilize Shine-Dalgarno sequences for initiation of translation. We discuss this phenomenon in light of the available data on leaderless transcripts and possible translation mechanisms.

Section snippets

Genes chosen for analysis

We chose a few unrelated P. aerophilum genes with high BLAST scores to sequences from a variety of organisms and obtained BESTFIT8 pairwise alignment of the encoded protein sequences with their homologs. Genes chosen for the study, their accession numbers, and homology to known genes are shown in Table 1. Initially we designated the AUG or GUG near the 5′ end of an open reading frame, and close to the sequence resembling the consensus sequence for the archaeal TATA box, as the putative

Discussion

Results presented here suggest that transcription of single genes and polycistronic mRNA in P. aerophilum proceeds mostly with the formation of leaderless transcripts, which do not contain 5′ untranslated sequences (5′-UTR). Two lines of evidence support this conclusion: (i) the experimental mapping of the transcripts of randomly picked genes; and (ii) results of computational analysis of the genomic DNA sequence.

The experimental mapping of ten randomly picked genes, with primer extension and S1

Strains and media

P. aerophilum strain IM2 growth conditions were as described5 except that 5% oxygen was used for anaerobic growth.

DNA synthesis and sequencing

Oligonucleotides were synthesized on a Beckman oligo 1000 DNA synthesizer by using solid-phase cyanoethyl phosphoramidite chemistry. All oligonucleotides were deprotected in ammonium hydroxide and used without further purification.

DNA sequencing was done by using [α-32P]dATP and the SequiTherm Cycle Sequencing Kit (Epicentre Technologies, Madison, WI) with reagents supplied by the

Acknowledgements

The authors thank Claudia Baikalov and Ju-Huei Chiang for help with DNA sequencing, Dr Jorg Soppa for helpful references and Alex Lomsadze his help and advice on using GeneMark.hmm 2.0.

This work was supported by the grant from the National Institutes of Health (GM 57917 to J.H.M.) and by the grant from the National Institutes of Health (HG 00783 to M.B.).

References (52)

  • S.B Bintrim et al.

    Molecular phylogeny of Archaea from soil

    Proc. Natl Acad. Sci. USA

    (1997)
  • E.F DeLong et al.

    High abundance of Archaea in Antarctic marine picoplankton

    Nature

    (1994)
  • K.O Stetter

    Extremophiles and their adaptation to hot environments

    FEBS Letters

    (1999)
  • S.M Barns et al.

    Perspectives on archaeal diversity, thermophily and monophyly from environmental rRNA sequences

    Proc. Natl Acad. Sci. USA

    (1996)
  • P Völkl et al.

    Pyrobaculum aerophilum sp. nov., a novel nitrate-reducing hyperthermophilic archaeum

    Appl. Environ. Microbiol.

    (1993)
  • P Völkl et al.

    Genomic and cDNA sequence tags of the hyperthermophilic archaeon Pyrobaculum aerophilum

    Nucl. Acids Res.

    (1996)
  • S Fitz-Gibbon et al.

    A fosmid-based genomic map and identification of 474 genes of the hyperthermophilic archaeon Pyrobaculum aerophilum

    Extremophiles

    (1997)
  • Wisconsin Package 9.1 edit

    (1997)
  • J Hain et al.

    Elements of an archaeal promoter defined by mutational analysis

    Nucl. Acids Res.

    (1992)
  • J.R Palmer et al.

    In vivo definition of an archaeal promoter

    J. Bacteriol.

    (1995)
  • J Soppa

    Normalized nucleotide frequencies allow the definition of archaeal promoter elements for different archaeal groups and reveal base-specific TFB contacts upstream of the TATA box [letter]

    Mol. Microbiol.

    (1999)
  • W.-D Reiter et al.

    Transcription termination in the archaebacterium Sulolobussignal structures and linkage to transcription initiation

    Nucl. Acids Res.

    (1988)
  • J.N Reeve

    Molecular biology of methanogens

    Annu. Rev. Microbiol.

    (1992)
  • J Soppa

    Transcription initiation in Archaeafacts, factors and future aspects

    Mol. Microbiol.

    (1999)
  • M Hensel et al.

    The genetic basis of tetrathionate respiration in Salmonella typhimurium

    Mol. Microbiol.

    (1999)
  • C.E Lawrence et al.

    Detecting subtle sequence signalsa Gibbs sampling strategy for multiple alignment

    Science

    (1993)
  • M Borodovsky et al.

    Detection of new genes in the bacterial genome using Markov models for three gene classes

    Nucl. Acids Res.

    (1995)
  • M Thomm

    Archaeal transcription factors and their role in transcription initiation

    FEMS Microbiol. Rev.

    (1996)
  • W.D Reiter et al.

    Mutational analysis of an archaebacterial promoteressential role of a TATA box for transcription efficiency and start-site selection in vitro

    Proc. Natl Acad. Sci. USA

    (1990)
  • S.T Smale

    Core promoter architecture for eucariotic protein-coding genes

  • S.A Qureshi et al.

    Sequence - specific DNA binding by the S. shibatae TFIIB homolog, TFB and its effect on promoter strength

    Mol. Cell

    (1998)
  • O Littlefield et al.

    The structural basis for the oriented assembly of a TBP/TFB/promoter complex

    Proc. Natl Acad. Sci. USA

    (1999)
  • G.R Janssen

    Eubacterial, archaebacterial, and eucaryotic genes that encode leaderless mRNA

  • C.J Wu et al.

    Translation of vph mRNA in Streptomyces lividans and Escherichia coli after removal of the 5′ untranslated leader

    Mol. Microbiol.

    (1996)
  • C Sensen et al.

    Organizational characteristics and information content of an archaeal genome156 kb of sequence from Sulfolobus solfataricus P2

    Mol. Microbiol.

    (1996)
  • N Tolstrup et al.

    Two different and highly organized mechanisms of translation initiation in the archaeon Sulfolobus solfataricus

    Extermophiles

    (2000)
  • Cited by (72)

    • The folA gene from the Rickettsia endosymbiont of Ixodes pacificus encodes a functional dihydrofolate reductase enzyme

      2018, Ticks and Tick-borne Diseases
      Citation Excerpt :

      Previous studies reported that 5′UTR of prokaryotic genes are highly diverse (Chang et al., 2006; Sakai et al., 2001). This is actually a common phenomenon, as many non-SD-led genes and leadless genes have been detected in the genomes of Archaea and Bacteria (Slupska et al., 2001). In fact, less than 20% of genes from some microbial genomes were found to be SD-led (Chang et al., 2006).

    • Biochemical and structural insights into RNA binding by Ssh10b, a member of the highly conserved Sac10b protein family in archaea

      2014, Journal of Biological Chemistry
      Citation Excerpt :

      Therefore, we speculate that the Sac10b proteins serve a role in RNA transactions such as translation in hyperthermophilic archaea. It is known that leaderless mRNAs are common in Archaea (42, 43). A gene encoded by a monocistronic transcript and that located at the 5′-proximal end of an operon often lack a Shine-Dalgarno sequence and generally have little or no sequence upstream of a translational start point in Sulfolobus species (42, 44, 45).

    • Genome-wide Mapping of Transcriptional Start Sites Defines an Extensive Leaderless Transcriptome in Mycobacterium tuberculosis

      2013, Cell Reports
      Citation Excerpt :

      In the classical Escherichia coli paradigm, translation of bacterial mRNA is initiated by binding of a Shine-Dalgarno sequence in the 5′ UTR to the complementary region of 16S rRNA (Shine and Dalgarno, 1974). However, recent bioinformatic analyses suggest that the frequency of Shine-Dalgarno sequences is lower in some eubacterial genera (Nakagawa et al., 2010; Zheng et al., 2011) and that they may resemble Archaea in having a greater reliance on translation of leaderless mRNAs that lack a 5′ UTR (Tolstrup et al., 2000; Slupska et al., 2001; Torarinsson et al., 2005; Brenneis et al., 2007). Leaderless mRNAs are translated with lower efficiency by E. coli ribosomes (Moll et al., 2002), though this preference can be altered during exposure to stress (Moll and Engelberg-Kulka, 2012; Moll et al., 2004) or as a result of changes to the ribosome induced by antibiotics or bacterial toxins (Kaberdina et al., 2009; Vesper et al., 2011).

    • Archaeal translation initiation factor aIF2 can substitute for eukaryotic eIF2 in ribosomal scanning during mammalian 48s complex formation

      2011, Journal of Molecular Biology
      Citation Excerpt :

      As in Bacteria, many of archaeal mRNAs are polycistronic, suggesting internal ribosome binding to proximal cistrons.16,17,27 Bioinformatic analysis of archaeal genomes revealed that the majority of transcripts from different species start directly from the AUG codon25,27,28 and likely utilize an ancient mechanism of factorless 70S/80S ribosome binding to the 5′-terminal AUGs (for review, see Ref. 5). For the rest of the transcripts, neither Shine–Dalgarno nor any other bacterial-like initiation signals were detected in their 5′ untranslated regions (5′ UTRs),29 suggesting that either a eukaryotic-like scanning or a novel mechanism must operate on these transcripts.

    View all citing articles on Scopus
    1

    Edited by M. Gottesman

    2

    Present address: A. G. King, Department of Pathology and Developmental Biology, B259 Beckman Center, Stanford University School of Medicine, Stanford, CA 94305, USA.

    View full text