Bacteriophage genomics
Introduction
Suppose we were to stop an average molecular biologist on the street and ask what kinds of bacteriophages there are on Earth. The answer would certainly include T4, the T. rex of the phage world, as well as the well studied temperate phage λ. Most respondents would remember that there are other similar temperate phages, including some such as Salmonella phage P22 that infect non-Escherichia coli bacteria, and most would also know that there is a phage called T7 whose raison d’être is to supply promoters and a special RNA polymerase for expression vectors. The more sophisticated molecular biologists might recall that there are actually dozens of other phages that infect bacteria such as Bacillus and Vibrio, and that there are also some curious small phages, such as φX174, MS2 and M13, that lack tails and have genomes that are not dsDNA. Such a response would not be very different from what we would have heard had we conducted our survey a decade ago. Yet as readers who have been paying attention will know, over the past 10 years, and especially over the past 3–5 years, there has been a revolutionary change in our understanding of the nature and size of the global phage population, of the mechanisms and dynamics by which the phages evolve, and of the enormous impact phages have on terrestrial biology and biogeochemistry, and on the ecology and evolution of their bacterial and archaeal hosts.
Our new view of these matters comes primarily from two advances: first, a new appreciation of the abundance of tailed phages in the environment and of the dynamic nature of that population, and second, the recent availability of a significant number of phage genome sequences. Direct measurements of the abundance of tailed phages in the environment indicate ∼107 particles/ml in coastal seawater and even higher numbers in some freshwater sources [1] (see Figure 1). The global population is estimated to be on the order of 1031 individuals: if they were laid end to end, they would span between the Earth and the Sun 1013 times. This population turns over every few days [2], from which it can be calculated that the number of phages initiating an infection somewhere on Earth every second is on the order of Avagadro’s number. Numbers like these give a new impression of the opportunities for genetic exchange in the phage population, particularly because most bacterial cells contain prophage DNA with which the infecting phage can interact. At the laboratory end of things there are now about 150 complete genome sequences of tailed phages in the databases, and this number is increasing rapidly. At first blush, 150 genome sequences appears to be a hopelessly miniscule sampling of the population, but as described below, there are beginning to be indications that, as a result of extensive horizontal exchange of sequences across the population, we have already seen an appreciable fraction — if still a small minority — of all phage gene sequence families.
Section snippets
Mechanisms of evolution
The hallmark of tailed phage genome structure as seen by genome comparisons is genetic mosaicism, evidently arising from nonhomologous recombination between ancestral sequences. This was first seen in the ‘lambdoid’ phages of enteric hosts at low resolution in the 1960s, and it has been confirmed in detail for this group 3., 4., 5., 6., 7. through genome sequencing (Figure 2). The modules of mosaicism are most often individual genes, although they can be parts of genes corresponding to protein
Phage population structure
Our picture of the genetic structure of the tailed phage population is still very sketchy because our data come from a highly biased sample of ∼150 genomes from the population of ∼1031. The sequences of presumably homologous genes have frequently diverged beyond recognition. (“Presumably homologous genes” include those encoding the major capsid subunit. Although these proteins cannot always be shown by sequence analysis to be homologous, a recent structural analysis shows that the P22 capsid
Prophages and prophage-like elements
Bacterial genome sequences contain a large number of intact or defective prophages. The recent count of 230 prophages residing in 51 of the 82 available bacterial genome sequences is almost certainly an underestimate owing to our incomplete methods for recognizing prophages [26]. Several recent papers discuss the growing evidence for a major role of prophages in bacterial evolution, as a source of new genetic information 26., 27. and as a selective force acting on host genome structure [28].
Other phages, other viruses
The tailed, dsDNA phages discussed above may be the most abundant phages on the planet, but there are of course several other phage types known that differ in chromosome type, virion structure, and life style. For the most part genomic analysis of these viruses, although informative, has not yet provided the same level of insights as has been the case for the tailed phages. However, at a level of divergence beyond sequence recognizability, there are clear indications of elements of shared
Conclusions and prospects
Comparative analyses of phage genome sequences have begun to reveal mechanisms by which these genomes evolve, through revealing ‘fossils’ of past evolutionary events in the sequences. Although not emphasized in this review, these studies also often suggest hypotheses about novel functional aspects of the biology of the phages (for example, see 3., 9.••). These are frequently totally unpredicted but once revealed can be tested experimentally. Broader scale questions — for example, questions
References and recommended reading
Papers of particular interest, published within the annual period of review, have been highlighted as:
- •
of special interest
- ••
of outstanding interest
Acknowledgements
Work in the author’s laboratory on this topic is supported by grant GM51975 from the National Institutes of Health.
References (33)
- et al.
Genomic sequences of bacteriophages HK97 and HK022: pervasive genetic mosaicism in the lambdoid bacteriophages
J. Mol. Biol.
(2000) - et al.
Genomic sequence and analysis of the atypical temperate bacteriophage N15
J. Mol. Biol.
(2000) - et al.
Nucleotide sequence of coliphage HK620 and the evolution of lambdoid phages
J. Mol. Biol.
(2001) - et al.
Origins of highly mosaic mycobacteriophage genomes
Cell
(2003) - et al.
Role for a phage promoter in Shiga toxin 2 expression from a pathogenic Escherichia coli strain
J. Bacteriol.
(2001) - et al.
Coat protein fold and maturation transition of bacteriophage P22 seen at subnanometer resolutions
Nat. Struct. Biol.
(2003) - et al.
A conserved genetic module that encodes the major virion components in both the coliphage T4 and the marine cyanophage S-PM2
Proc. Natl. Acad. Sci. U.S.A.
(2001) - et al.
The R-type pyocin of Pseudomonas aeruginosa is related to P2 phage, and the F-type is related to lambda phage
Mol. Microbiol.
(2000) - et al.
Virioplankton: viruses in aquatic ecosystems
Microbiol. Mol. Biol. Rev.
(2000) - et al.
A dilution technique for the direct measurement of viral production: a comparison in stratified and tidally mixed coastal waters
Microb. Ecol.
(2002)
Complete genomic sequence of SfV, a serotype-converting temperate bacteriophage of Shigella flexneri
J. Bacteriol.
The nucleotide sequence of Shiga toxin (Stx) 2e-encoding phage phiP27 is not related to other Stx phage genomes, but the modular genetic structure is conserved
Infect Immun.
The dilemma of phage taxonomy illustrated by comparative genomics of Sfi21-like Siphoviridae in lactic acid bacteria
J. Bacteriol.
Bacteriophages: evolution of the majority
Theor. Popul. Biol.
Bacteriophage control of Shiga toxin 1 production and release by Escherichia coli
Mol. Microbiol.
The origins and ongoing evolution of viruses
Trends Microbiol.
Cited by (255)
Mechanisms of horizontal gene transfer and DNA recombination
2023, Molecular Medical Microbiology, Third EditionThe small genome, virulent, non-contractile tailed bacteriophages that infect Enterobacteriales hosts
2022, VirologyCitation Excerpt :Tailed bacteriophages are among the most abundant organisms on Earth (Hambly and Suttle, 2005; Hendrix, 2003; Suttle, 2005; Wilhelm et al., 2002).
T7 phage-assisted evolution of riboswitches using error-prone replication and dual selection
2024, Scientific ReportsStructure and proposed DNA delivery mechanism of a marine roseophage
2023, Nature Communications