Bacteriophage genomics

https://doi.org/10.1016/j.mib.2003.09.004Get rights and content

Abstract

Comparative genomic studies of bacteriophages, especially the tailed phages, together with environmental studies, give a dramatic new picture of the size, genetic structure and dynamics of this population. Sequence comparisons reveal some of the detailed mechanisms by which these viruses evolve and influence the evolution of their bacterial and archaeal hosts. We see rampant horizontal exchange of sequences among genomes, mediated by both homologous and nonhomologous recombination. High frequency exchange among phages occupying similar ecological niches leads to a high degree of mosaic diversity in local populations. Horizontal exchange also takes place at lower frequency across the entire span of phage sequence space.

Introduction

Suppose we were to stop an average molecular biologist on the street and ask what kinds of bacteriophages there are on Earth. The answer would certainly include T4, the T. rex of the phage world, as well as the well studied temperate phage λ. Most respondents would remember that there are other similar temperate phages, including some such as Salmonella phage P22 that infect non-Escherichia coli bacteria, and most would also know that there is a phage called T7 whose raison d’être is to supply promoters and a special RNA polymerase for expression vectors. The more sophisticated molecular biologists might recall that there are actually dozens of other phages that infect bacteria such as Bacillus and Vibrio, and that there are also some curious small phages, such as φX174, MS2 and M13, that lack tails and have genomes that are not dsDNA. Such a response would not be very different from what we would have heard had we conducted our survey a decade ago. Yet as readers who have been paying attention will know, over the past 10 years, and especially over the past 3–5 years, there has been a revolutionary change in our understanding of the nature and size of the global phage population, of the mechanisms and dynamics by which the phages evolve, and of the enormous impact phages have on terrestrial biology and biogeochemistry, and on the ecology and evolution of their bacterial and archaeal hosts.

Our new view of these matters comes primarily from two advances: first, a new appreciation of the abundance of tailed phages in the environment and of the dynamic nature of that population, and second, the recent availability of a significant number of phage genome sequences. Direct measurements of the abundance of tailed phages in the environment indicate ∼107 particles/ml in coastal seawater and even higher numbers in some freshwater sources [1] (see Figure 1). The global population is estimated to be on the order of 1031 individuals: if they were laid end to end, they would span between the Earth and the Sun 1013 times. This population turns over every few days [2], from which it can be calculated that the number of phages initiating an infection somewhere on Earth every second is on the order of Avagadro’s number. Numbers like these give a new impression of the opportunities for genetic exchange in the phage population, particularly because most bacterial cells contain prophage DNA with which the infecting phage can interact. At the laboratory end of things there are now about 150 complete genome sequences of tailed phages in the databases, and this number is increasing rapidly. At first blush, 150 genome sequences appears to be a hopelessly miniscule sampling of the population, but as described below, there are beginning to be indications that, as a result of extensive horizontal exchange of sequences across the population, we have already seen an appreciable fraction — if still a small minority — of all phage gene sequence families.

Section snippets

Mechanisms of evolution

The hallmark of tailed phage genome structure as seen by genome comparisons is genetic mosaicism, evidently arising from nonhomologous recombination between ancestral sequences. This was first seen in the ‘lambdoid’ phages of enteric hosts at low resolution in the 1960s, and it has been confirmed in detail for this group 3., 4., 5., 6., 7. through genome sequencing (Figure 2). The modules of mosaicism are most often individual genes, although they can be parts of genes corresponding to protein

Phage population structure

Our picture of the genetic structure of the tailed phage population is still very sketchy because our data come from a highly biased sample of ∼150 genomes from the population of ∼1031. The sequences of presumably homologous genes have frequently diverged beyond recognition. (“Presumably homologous genes” include those encoding the major capsid subunit. Although these proteins cannot always be shown by sequence analysis to be homologous, a recent structural analysis shows that the P22 capsid

Prophages and prophage-like elements

Bacterial genome sequences contain a large number of intact or defective prophages. The recent count of 230 prophages residing in 51 of the 82 available bacterial genome sequences is almost certainly an underestimate owing to our incomplete methods for recognizing prophages [26]. Several recent papers discuss the growing evidence for a major role of prophages in bacterial evolution, as a source of new genetic information 26., 27. and as a selective force acting on host genome structure [28].

Other phages, other viruses

The tailed, dsDNA phages discussed above may be the most abundant phages on the planet, but there are of course several other phage types known that differ in chromosome type, virion structure, and life style. For the most part genomic analysis of these viruses, although informative, has not yet provided the same level of insights as has been the case for the tailed phages. However, at a level of divergence beyond sequence recognizability, there are clear indications of elements of shared

Conclusions and prospects

Comparative analyses of phage genome sequences have begun to reveal mechanisms by which these genomes evolve, through revealing ‘fossils’ of past evolutionary events in the sequences. Although not emphasized in this review, these studies also often suggest hypotheses about novel functional aspects of the biology of the phages (for example, see 3., 9.••). These are frequently totally unpredicted but once revealed can be tested experimentally. Broader scale questions — for example, questions

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

  • of special interest

  • ••

    of outstanding interest

Acknowledgements

Work in the author’s laboratory on this topic is supported by grant GM51975 from the National Institutes of Health.

References (33)

  • G.E Allison et al.

    Complete genomic sequence of SfV, a serotype-converting temperate bacteriophage of Shigella flexneri

    J. Bacteriol.

    (2002)
  • J Recktenwald et al.

    The nucleotide sequence of Shiga toxin (Stx) 2e-encoding phage phiP27 is not related to other Stx phage genomes, but the modular genetic structure is conserved

    Infect Immun.

    (2002)
  • C Proux et al.

    The dilemma of phage taxonomy illustrated by comparative genomics of Sfi21-like Siphoviridae in lactic acid bacteria

    J. Bacteriol.

    (2002)
  • R.W Hendrix

    Bacteriophages: evolution of the majority

    Theor. Popul. Biol.

    (2002)
  • P.L Wagner et al.

    Bacteriophage control of Shiga toxin 1 production and release by Escherichia coli

    Mol. Microbiol.

    (2002)
  • R.W Hendrix et al.

    The origins and ongoing evolution of viruses

    Trends Microbiol.

    (2000)
  • Cited by (255)

    • Mechanisms of horizontal gene transfer and DNA recombination

      2023, Molecular Medical Microbiology, Third Edition
    • The small genome, virulent, non-contractile tailed bacteriophages that infect Enterobacteriales hosts

      2022, Virology
      Citation Excerpt :

      Tailed bacteriophages are among the most abundant organisms on Earth (Hambly and Suttle, 2005; Hendrix, 2003; Suttle, 2005; Wilhelm et al., 2002).

    View all citing articles on Scopus
    View full text