Highly diverse and unknown viruses may enhance Antarctic endoliths’ adaptability

Cassandra L. Ettinger; Morgan Saunders; Laura Selbmann; Manuel Delgado-Baquerizo; Claudio Donati; Davide Albanese; Simon Roux; Susannah Tringe; Christa Pennacchio; Tijana G. del Rio; Jason E. Stajich; Claudia Coleine

doi:10.1101/2022.12.02.518905

Abstract

Rock-dwelling microorganisms are key players in ecosystem functioning of Antarctic ice free-areas. Yet, little is known about their diversity and ecology. Here, we performed metagenomic analyses on rocks from across Antarctica comprising >75,000 viral operational taxonomic units (vOTUS). We found largely undescribed, highly diverse and spatially structured virus communities potentially influencing bacterial adaptation and biogeochemistry. This catalog lays the foundation for expanding knowledge of the virosphere in extreme environments.

Main

Viruses are among the most prevalent entities on our planet, with the ability to infect organisms across all Domains¹. Sequencing advances are reshaping understanding of viral diversity across Earth’s diverse ecosystems, leading to a remarkable expansion of viral catalogs^2–4. It is becoming clear that viruses play key roles in global biogeochemical cycles through the modulation of host population dynamics, and that the better-studied pathogenic viruses represent only a small fraction of the virosphere^5–7. Further, through auxiliary metabolic genes (AMGs), some viruses can directly impact host metabolism to improve fitness⁸, including in extreme ecosystems.

Antarctic ice-free areas include several of the most inhospitable regions on Earth, among which is the Mars counterpart: the McMurdo Dry Valleys. In these locations, where rocks represent the main substratum, active life is possible for only a few specialized microorganisms; they survive by dwelling in porous rocks, forming self-sustaining ecosystems called endolithic communities^9,10are the primary life-forms present assuring the balance and functionality of these otherwise inert ecosystems. Recent studies have shed light on their biodiversity and adaptation, particularly the evolution of new and peculiar taxa spanning bacteria, fungi and archaea^10–13. However, the ecology and distribution of viral diversity from these communities remain wholly unknown and, to date, viral studies have instead focused on Antarctic freshwater lakes^14–16, surrounding oceans^17–19, and soils^20–23.

Here, we provide a large-scale viral catalog from 191 Antarctic endolith metagenomes. We sampled 37 localities across a broad range of environmental (e.g. 4 rock typologies, different altitudes and sun exposure) and spatial conditions (i.e. Antarctic Peninsula, Northern Victoria Land, and McMurdo Dry Valleys) (Table S1; Fig. 2A). We aimed to (i) untangle viral diversity in these communities, (ii) predict AMGs and how they may drive the fitness of their hosts, and (iii) explore ecological patterns (e.g., biogeography). This catalog is the first step toward understanding the role of viruses in regulating biogeochemical cycling in the coldest and driest region on Earth. This information is also critical for elucidating the role of viruses in whole community adaptation in a scenario of global warming and expanding desertification²⁴.

Figure 1. Antarctica is an underappreciated source of phage novelty.

(A) Bar charts displaying the number of viral sequences placed in VCs colored by rock type and divided by whether the VC is clustered with reference genomes. (B) Bar chart displaying host predictions colored by predicted host phylum. (C) Bar chart showing the number of predicted phage AMGs summarized by DRAM-v distilled metabolic categories.

Figure 2. Spatial structuring of viral communities in Antarctic rocks.

(A) Map showing collection sites with shapes and colors representative of the broad geographic area. (B) Stacked bar charts displaying the mean relative abundance of phage vOTUs at each site colored by predicted viral families. Sequences that were clustered into VCs with reference data are labeled by their taxonomy, sequences clustered without reference genomes are labeled “Unique VC”, while the rest are labeled based on their VContact2 status (i.e., singleton [share few or no genes with other genomes], overlap [share genes with genomes in multiple VCs], or outlier [share genes, but cannot confidently be placed in a VC]). (C) Principal-coordinate analysis (PCoA) visualization of Hellinger distances of viral communities. Samples are colored by site, with sites ordered by latitude, and have shapes based on geographic areas. (D) A scatter plot depicting a significant positive distance-decay relationship between sandstone viral community beta diversity (Hellinger distance) and geographical distance (km) between sites.

Using VirSorter2²⁵, we predicted 101,085 viral sequences. We clustered these at 95% average nucleotide identity into 76,984 viral operations taxonomic units (vOTUS)²⁶; we further used VContact2²⁷ with INPHARED²⁸ reference genomes to cluster phage vOTUs into 7,598 viral clusters (VCs), which approximate genus-level groupings based on gene-sharing networks. To keep analysis focused on the most robust catalog, we filtered this collection using community thresholds for length, detection, and quality (See Online Methods)^29–31. The final viral catalog represented 14,797 viral sequences, including 2,695 prophage, which clustered into 11,806 vOTUs, of which 5,743 phage vOTUs (7,309 sequences) were successfully placed in 2,286 VCs; the final catalog was predicted to predominantly be dsDNA phage, though 15.2% of vOTUs may represent eukaryotic viruses (i.e. NCLDVs).

Our findings indicate that Antarctic rock communities host highly diverse and novel phage populations, with only 1.8% (41 out of 2,286) of the VCs including reference sequences. The remaining 98.2% were unique VCs (i.e., did not include reference genomes), and could represent novel phage genera, greatly expanding the known diversity of viruses. Of the 41 VCs that did include reference genomes, the majority were assigned to the Caudoviricetes class (formerly Caudovirales order) of tailed double-stranded DNA bacteriophage (Fig. S1). Many genomes have not yet been reclassified, leaving viral taxonomy in flux; under the new schema, most of the 41 VCs are unclassified³². The majority of unique VCs are represented by viral sequences from sandstone communities (Fig. 1A), which represents an optimum substratum, in terms of rock traits (e.g. porosity), for endolithic colonization³³, but is also the most represented substratum in this work.

We further established host–virus linkages using NCBI BLAST against complete bacteria and archaea genomes from RefSeq, and Antarctic endolithic bacterial and archaeal metagenome-assembled genomes (MAGs) (see Online Methods)^34–36 to explore the potential effects of viruses on host fitness, such as host-cell reprogramming through AMGs³⁷. While we were unable to predict hosts for the majority of vOTUS, we observed that Proteobacteria, Actinobacteriota, and Chloroflexota were the most commonly predicted host phyla (Fig. 1B), which are thought to be core members of these communities^11,38,39. Using predictions against the Antarctic MAGs, we predicted hosts for an additional 16.5% of viral sequences (Fig. S2).

We then sought to improve understanding on the functional profiles of retrieved phages using DRAM-v⁴⁰. Notably, this catalog, which comprises metabolic novelty (39.3% of DRAM-v predicted AMGs had no distilled classification), may complement other available resources, which have largely been limited to coverage of human-related microbiomes (e.g. Li et al.⁴¹). Within identified functions, we found putative phage AMGs related to carbon, energy and nitrogen metabolisms (Fig. 1C). Specifically, within carbohydrate metabolism, glycoside hydrolases, glycosyltransferases, and carbohydrate-binding domains predominated. Within nitrogen metabolism, methionine degradation was the most prevalent module, and within energy, the dominant modules were related to electron transport and photosynthesis. This highlights the utility to connect vOTUs to Antarctic MAGs¹¹ and to implement complementary techniques (e.g. single-cell genomics) to provide a deeper understanding of virus-bacteria dynamics. More importantly, these findings underscore the complexity of virus-driven element biogeochemical cycles in the rocks of Antarctica, which have traditionally been considered devoid of life.

Given the geographic spread of sampling (see Online Methods and Table S1; Fig. 2A), we assessed whether this catalog could be useful to answer ecological questions related to viral community dynamics. While the dominant vOTUs at each site were taxonomically unclassified and largely members of unique VCs and thus possible novel genera (Fig. 2B), when investigating between-sample diversity (beta diversity) we observed a significant pattern related to site specificity (Fig. 2C; PERMANOVA, p < 0.001), and further detected significant distance decay across sandstone communities (Fig. 2D; r = 0.197, p < 0.001), indicating clear latitudinal spatial structuring of viral communities. In further support of this, we were able to detect only 41.0% of vOTUs at more than one site, with 29.4% of vOTUs detected across two or more geographic regions and only 1.45% detected across all regions. Of the vOTUs detected across all regions, the majority were in unique VCs (66.7%) and none were in VCs with reference data. We hypothesize that this viral spatial structuring reflects the reported dispersion limitation and local composition and adaptation of hosts in these communities^11,12. Similar spatial structuring has also been observed in grassland soil viromes, purportedly as a result of local assembly dynamics^42,43.

This study represents the most exhaustive geographic endeavor to date to capture the viral genomic diversity across ice-free regions of Antarctica and the first large-scale effort to explore the virosphere in endolithic communities. This catalog is a comprehensive repository for exploring the diversity, function, spatial ecology, and host-virus dynamics of this enigmatic continent. We also unveiled a possible influence of some viruses on carbon, energy and nitrogen metabolisms under conditions of oligotrophy up to the limit for life sustainability; this may be a key role for the resilience of these communities. This work is a good model for exploring adaptability of microbial communities in a scenario of global warming and expanding desertification.

Methods

Study area

191 rocks colonized by endolithic communities were collected in thirty-eight sites in Antarctica including Antarctic Peninsula (n = 3), McMurdo Dry Valleys, Southern Victoria Land (n = 80), and Northern Victoria Land (n = 108) during more than 20 years of Italian Antarctic Expeditions. Different rock typologies (sandstone n = 141, granite n = 43, quartz n = 5, and basalt/dolerite n = 2) were sampled. Samples were collected along a latitudinal transect ranging from -62.10008 -58.51664 to -77.874 160.739 at different environmental conditions namely sun exposure (northern sun exposed and southern shady rocks) and an altitudinal transect from sea level to 3,100 m above sea level (a.s.l.) to provide a comprehensive overview of Antarctic endolithic diversity (Supplementary Table 1). The presence of endolithic colonization was assessed by direct observation in situ. Rocks were excised using a geologic hammer and sterile chisel, and rock samples, preserved in sterile plastic bags, transported, and stored at −20 °C in the Culture Collection of Antarctic fungi of the Mycological Section of the Italian Antarctic National Museum (MNA-CCFEE), until downstream analysis.

Study data

In total, the dataset included 191 metagenomes, of which 100 have been assembled as described in Albanese et al.¹¹. The remaining metagenomes were generated, sequenced, and assembled as described below. The final metagenomic set represented 149,585,625 metagenomic contigs.

DNA extraction, library preparation, and sequencing

Total community DNA was extracted from 1 g of crushed rocks using DNeasy PowerSoil Pro Kit (Qiagen, German), quality checked by electrophoresis using a 1.5% agarose gel and Nanodrop spectrophotometer (Thermofisher, USA) and quantified using the Qubit dsDNA HS Assay Kit (Life Technologies, USA) according to Coleine et al.¹⁰. Shotgun metagenomic sequencing paired-end libraries were constructed and sequenced as 2×150 bp using the Illumina NovaSeq platform (Illumina Inc, San Diego, CA) at the Edmund Mach Foundation (San Michele all’Adige, Italy) and at the DOE Joint Genome Institute (JGI).

Sequencing reads preparation and assembly

The metashot/mag-illumina v2.0.0 workflow (https://github.com/metashot/mag-illumina, parameters: --metaspades_k 21,33,55,77,99) was used to perform raw reads quality trimming and filtering, assembly and contigs binning on the metagenomic samples. In brief, adapter trimming, contaminant (artifacts and and spike-ins) and quality filtering were performed using BBDuk (BBMap/BBTools v38.79, https://sourceforge.net/projects/bbmap/). During the quality filtering procedure i) raw reads were quality-trimmed to Q6 using the Phred algorithm; ii) reads that contained 4 or more “N” bases, had an average quality below 10, shorter than 50 bp or under 50% of the original length were removed. Samples were then assembled individually with SPAdes v3.15.1⁴⁴ (parameters –meta -k 21,33,55,77,99).

Identification and clustering of viral genomes

Using a workflow similar to Guo et al.⁴⁵, viral sequences were identified in metagenomic assemblies using VirSorter2 v. 2.2.3²⁵ using --min-length 5000, --min-score 0.5, and --include-groups dsDNAphage,NCLDV,RNA,ssDNA,lavidaviridae. CheckV v0.8.1³¹ was run on the VirSorter2 predicted viral sequences using the “end_to_end” workflow VirSorter2 was then run again on the viral sequences from CheckV workflow with the --prep-for-dramv option. DRAM-v v. 1.2.2⁴⁰ was then used to “annotate” sequences and then “distill” annotations into predicted auxiliary metabolic genes (AMGs) for phage.

Viral sequences were clustered into 95% similarity viral operational taxonomic units (vOTUs) using CD-HIT v. 4.8.1²⁶ with the following parameters: -c 0.95 -aS 0.85 -M 0 -d 0. Prodigal v. 2.6.3⁴⁶ was used to predict open reading frames in vOTUs using the -p meta option. VContact2 v. 0.9.19 was then run on predicted proteins from phage vOTUs and predicted proteins from the INPHARED August 2022 viral reference database to generate viral clusters (VCs) based on gene-sharing networks^27,28. We assigned taxonomy to phage vOTUs based on VC membership as in Santos-Medellin et al⁴². Predicted viral sequences and 95% similarity vOTUS are archived on Zenodo⁴⁷.

Viral host-prediction

Hosts were predicted for the phage sequences identified using (i) a database of complete genomes from NCBI RefSeq, and (ii) a previously published database of representative metagenome-assembled genomes (MAGs) from Antarctic endolith samples. To produce (i), we used “ncbi-genome-download” to download all complete bacterial (n = 25,984) and archaeal (n = 416) genomes, as of April 7, 2022, from NCBI RefSeq⁴⁸. For (ii), we downloaded MAGs from Zenodo (DOI: 10.5281/zenodo.7313591). We then used NCBI BLAST 2.12.0+ to convert these two databases into blast databases using “makeblastdb” and used “blastn” to compare vOTUs to these databases³⁴. We filtered the blastn results in R based on existing thresholds^35,36,49. Briefly, database matches had to share ≥ 2000 bp region with ≥ 70% sequence identity to the viral sequence and needed to have a bit score of ≥ 50 and minimum e-value of 0.001. Further to ensure matches did not represent partial or entirely viral contigs when searching against the MAG database, matches had to cover < 50% of the total MAG sequence length. As in Korthari et al.³⁶, only the top five hits matching these thresholds were considered, with host predictions made at each taxonomic level only if the taxonomy of all hits were in agreement. Discrepancies resulted in no host prediction for that taxonomic level. We then combined host predictions from both the RefSeq and MAG databases together; if there were discrepancies between the two databases, we defaulted to the MAG-based prediction.

Ecological analysis of vOTUs

We mapped reads from each metagenome to vOTUs using BBMap with a minid=0.90 to quantify vOTU relative abundance⁵⁰. We then used SAMtools to convert resulting sam files to bam files and genomecov from BEDTools to obtain coverage information for each vOTU across each metagenome^51,52. We then used bamM to parse bam files and calculate the trimmed pileup coverage (tpmean), which we used here in our analysis of viral relative abundance ⁵³. We removed vOTUs which displayed < 75% coverage over the length of the viral sequence and viral sequences < 10 kbp in length prior to downstream analyses in R⁵⁴. Thresholds for analysis of vOTUs were based on community guidelines for length (i.e. ≥ 10 kbp), similarity (i.e. ≥ 95% similarity), and detection (i.e. ≥ 75% of the viral genome length covered ≥ 1x by reads at ≥ 90% average nucleotide identity)^29,30. To be conservative, we also removed vOTUs with a CheckV quality score of “not-determined” prior to downstream analysis. The viral abundance (tpmean), quality, taxonomy and annotation results were imported, analyzed, and visualized in R using many packages including tidyverse and phyloseq^55,56. Analysis scripts associated with this study are on GitHub and archived in Zenodo⁵⁷.

To compare viral diversity between metagenomes (i.e., beta diversity), we calculated the Hellinger distance, the Euclidean distance of Hellinger transformed abundance data. We performed Hellinger transformations using the transform function in the microbiome R package, calculated the Hellinger distance using the ordinate function in phyloseq, and then visualized these distances using principal-coordinate analysis (PCoA). We performed permutational multivariate analyses of variance (PERMANOVAs) with 9,999 permutations to test for significant differences in mean centroids using the model: Distance ∼ Site + Rock type. Models were tested with “by = margins” and “by = terms” with all sequential combinations. We ran the ordistep and ordiR2step functions to help assess optimal parameters to include in the model. Since PERMANOVA tests are sensitive to differences in group dispersion, we also tested for significant differences in mean dispersions using the betadisper and permutest functions from the vegan package in R with 9,999 permutations.

To test for correlations between viral community distances (Hellinger distances) and geographic distances, we first subset the data to exclude metagenomes from the Antarctic Peninsula, and to account for variation between rock types, subset the data to include only metagenomes representing sandstone samples. We calculated geographical distances between metagenomes using the distm function in the geosphere package in R. We performed Mantel tests in the vegan R package to assess correlations between the community and geographic distances using 9,999 permutations. Mantel tests were repeated with exclusion of community distances when the geographic distance was zero to assess if patterns persisted in the absence of data from the same site.

Data availability

Metagenomes raw data are available under the NCBI accession numbers listed in Supplementary Table 1. Analysis scripts and intermediate data files associated with this study are on GitHub (https://github.com/stajichlab/Antarctic_Virus_Discovery) and archived in Zenodo (https://doi.org/10.5281/zenodo.7374327). Fasta files representing the entire catalog of predicted viral sequences and 95% similarity vOTUS are archived and available on Zenodo (https://doi.org/10.5281/zenodo.7245811).

Authors contribution

C.L.E., J.E.S., and C.C. conceived and designed the study. L.S. collected the samples. C.P, T.G.R. and S.T. oversaw and managed the metagenome sequencing and standard analysis. C.L.E. performed bioinformatic and statistical analysis with contributions from J.E.S. and M.S.. C.L.E. and C.C. interpreted results with contributions from M.S. and S.R.. C.L.E. and C.C. wrote the paper with contributions from all co-authors.

Acknowledgements

C.L.E. is supported by the National Science Foundation (NSF) under a NSF Ocean Sciences Postdoctoral Fellowship (Award No. 2205744). M.S. was supported by the NSF REU The National Summer Undergraduate Research Project, Award No. 2149582. C.C. is supported by the European Commission under the Marie Sklodowska-Curie Grant Agreement No. 702057 (DRYLIFE). C.C. and L.S. wish to thank the Italian National Antarctic Research Program for funding sampling campaigns and research activities in Italy in the frame of PNRA projects. The Italian Antarctic National Museum (MNA) is kindly acknowledged for financial support to the Mycological Section of the MNA and for providing rock samples used in this study stored in the Culture Collection of Antarctic fungi (MNA-CCFEE), University of Tuscia, Italy. M.D-B. is supported by a project from the Spanish Ministry of Science and Innovation (PID2020-115813RA-I00), and a project of the Fondo Europeo de Desarrollo Regional (FEDER) and the Consejería de Transformación Económica, Industria, Conocimiento y Universidades of the Junta de Andalucía (FEDER Andalucía 2014-2020 Objetivo temático ‘01 – Refuerzo de la investigación, el desarrollo tecnológico y la innovación’) associated with the research project P20_00879 (ANDABIOMA). J.E.S. is a CIFAR fellow in the Fungal Kingdom: Threats and Opportunities program. Data analyses performed at the High-Performance Computing Cluster at the University of California Riverside in the Institute of Integrative Genome Biology were supported by NSF grant DBI-1429826 and NIH grant S10-OD016290. Part of this work (proposals 10.46936/10.25585/60000791 and 10.46936/fics.proj.2020.51548/60000213) was conducted by the U.S. Department of Energy Joint Genome Institute (https://ror.org/04xm1d337), a DOE Office of Science User Facility, supported by the Office of Science of the US Department of Energy under Contract No. DE-AC02-05CH11231. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Footnotes

References

1.↵
Paez-Espino, D. et al. Uncovering Earth’s virome. Nature 536, 425–430 (2016).
OpenUrl CrossRef PubMed
2.↵
Kristensen, D. M., Mushegian, A. R., Dolja, V. V. & Koonin, E. V. New dimensions of the virus world discovered through metagenomics. Trends Microbiol. 18, 11–19 (2010).
OpenUrl CrossRef PubMed Web of Science
3.
Jin, M. et al. Diversities and potential biogeochemical impacts of mangrove soil viruses. Microbiome 7, 58 (2019).
OpenUrl CrossRef
4.↵
Hurwitz, B. L., Westveld, A. H., Brum, J. R. & Sullivan, M. B. Modeling ecological drivers in marine viral communities using comparative metagenomics and network analyses. Proc. Natl. Acad. Sci. U. S. A. 111, 10714–10719 (2014).
OpenUrl Abstract/FREE Full Text
5.↵
Kristensen, D. M., Mushegian, A. R., Dolja, V. V. & Koonin, E. V. New dimensions of the virus world discovered through metagenomics. Trends Microbiol. 18, 11–19 (2010).
OpenUrl CrossRef PubMed Web of Science
6.
Jin, M. et al. Diversities and potential biogeochemical impacts of mangrove soil viruses. Microbiome 7, 58 (2019).
OpenUrl CrossRef
7.↵
Hurwitz, B. L., Westveld, A. H., Brum, J. R. & Sullivan, M. B. Modeling ecological drivers in marine viral communities using comparative metagenomics and network analyses. Proc. Natl. Acad. Sci. U. S. A. 111, 10714–10719 (2014).
OpenUrl Abstract/FREE Full Text
8.↵
Breitbart, M., Thompson, L., Suttle, C. & Sullivan, M. Exploring the Vast Diversity of Marine Viruses. Oceanography vol. 20 135–139 Preprint at https://doi.org/10.5670/oceanog.2007.58 (2007).
OpenUrl
9.↵
Friedmann, E. I. Endolithic Microorganisms in the Antarctic Cold Desert. Science 215, 1045–1053 (1982).
OpenUrl Abstract/FREE Full Text
10.↵
Coleine, C., Biagioli, F., de Vera, J. P., Onofri, S. & Selbmann, L. Endolithic microbial composition in Helliwell Hills, a newly investigated Mars-like area in Antarctica. Environ. Microbiol. 23, 4002–4016 (2021).
OpenUrl
11.↵
Albanese, D. et al. Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages. Microbiome 9, 63 (2021).
OpenUrl
12.↵
Archer, S. D. J. et al. Endolithic microbial diversity in sandstone and granite from the McMurdo Dry Valleys, Antarctica. Polar Biology vol. 40 997–1006 Preprint at https://doi.org/10.1007/s00300-016-2024-9 (2017).
OpenUrl
13.↵
de la Torre, J. R., Goebel, B. M., Friedmann, E. I. & Pace, N. R. Microbial diversity of cryptoendolithic communities from the McMurdo Dry Valleys, Antarctica. Appl. Environ. Microbiol. 69, 3858–3867 (2003).
OpenUrl Abstract/FREE Full Text
14.↵
López-Bueno, A. et al. High diversity of the viral community from an Antarctic lake. Science 326, 858–861 (2009).
OpenUrl Abstract/FREE Full Text
15.
Prado, T. et al. Virome analysis in lakes of the South Shetland Islands, Antarctica - 2020. Sci. Total Environ. 852, 158537 (2022).
OpenUrl
16.↵
Zawar-Reza, P. et al. Diverse small circular single-stranded DNA viruses identified in a freshwater pond on the McMurdo Ice Shelf (Antarctica). Infection, Genetics and Evolution vol. 26 132–138 Preprint at https://doi.org/10.1016/j.meegid.2014.05.018 (2014).
OpenUrl
17.↵
Alarcón-Schumacher, T., Guajardo-Leiva, S., Antón, J. & Díez, B. Elucidating Viral Communities During a Phytoplankton Bloom on the West Antarctic Peninsula. Front. Microbiol. 10, 1014 (2019).
OpenUrl
18.
Miranda, J. A., Culley, A. I., Schvarcz, C. R. & Steward, G. F. RNA viruses as major contributors to Antarctic virioplankton. Environ. Microbiol. 18, 3714–3727 (2016).
OpenUrl CrossRef
19.↵
Gong, Z. et al. Viral Diversity and Its Relationship With Environmental Factors at the Surface and Deep Sea of Prydz Bay, Antarctica. Front. Microbiol. 9, 2981 (2018).
OpenUrl
20.↵
Zablocki, O. et al. High-level diversity of tailed phages, eukaryote-associated viruses, and virophage-like elements in the metaviromes of antarctic soils. Appl. Environ. Microbiol. 80, 6888–6897 (2014).
OpenUrl Abstract/FREE Full Text
21.
Adriaenssens, E. M. et al. Environmental drivers of viral community composition in Antarctic soils identified by viromics. Microbiome 5, 83 (2017).
OpenUrl CrossRef
22.
Zablocki, O. et al. Niche-dependent genetic diversity in Antarctic metaviromes. Bacteriophage 4, e980125 (2014).
OpenUrl CrossRef PubMed
23.↵
Bezuidt, O. K. I. et al. Phages Actively Challenge Niche Communities in Antarctic Soils. mSystems 5, (2020).
24.↵
Jansson, J. K. & Wu, R. Soil viral diversity, ecology and climate change. Nat. Rev. Microbiol. 1–16 (2022) doi:10.1038/s41579-022-00811-z.
OpenUrl CrossRef
25.↵
Guo, J. et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome 9, 37 (2021).
OpenUrl CrossRef
26.↵
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
OpenUrl CrossRef PubMed Web of Science
27.↵
Bin Jang, H. et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 37, 632–639 (2019).
OpenUrl
28.↵
Cook, R. et al. INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes. PHAGE 2, 214–223 (2021).
OpenUrl CrossRef
29.↵
Roux, S. et al. Minimum Information about an Uncultivated Virus Genome (MIUViG). Nat. Biotechnol. 37, 29–37 (2018).
OpenUrl
30.↵
Roux, S., Emerson, J. B., Eloe-Fadrosh, E. A. & Sullivan, M. B. Benchmarking viromics: an evaluation of metagenome-enabled estimates of viral community composition and diversity. PeerJ 5, e3817 (2017).
OpenUrl CrossRef
31.↵
Nayfach, S. et al. CheckV assesses the quality and completeness of metagenomeassembled viral genomes. Nat. Biotechnol. 39, 578–585 (2021).
OpenUrl CrossRef
32.↵
Turner, D., Kropinski, A. M. & Adriaenssens, E. M. A Roadmap for Genome-Based Phage Taxonomy. Viruses 13, (2021).
33.↵
Selbmann, L. et al. Effect of environmental parameters on biodiversity of the fungal component in lithic Antarctic communities. Extremophiles 21, 1069–1080 (2017).
OpenUrl CrossRef
34.↵
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 1–9 (2009).
OpenUrl CrossRef PubMed
35.↵
Edwards, R. A., McNair, K., Faust, K., Raes, J. & Dutilh, B. E. Computational approaches to predict bacteriophage-host relationships. FEMS Microbiol. Rev. 40, 258–272 (2016).
OpenUrl CrossRef PubMed
36.↵
Kothari, A. et al. Ecogenomics of Groundwater Phages Suggests Niche Differentiation Linked to Specific Environmental Tolerance. mSystems e0053721 (2021) doi:10.1128/mSystems.00537-21.
OpenUrl CrossRef
37.↵
Jarett, J. K. et al. Insights into the dynamics between viruses and their hosts in a hot spring microbial mat. The ISME Journal vol. 14 2527–2541 Preprint at https://doi.org/10.1038/s41396-020-0705-4 (2020).
OpenUrl
38.↵
Coleine, C. et al. Altitude and fungal diversity influence the structure of Antarctic cryptoendolithic Bacteria communities. Environ. Microbiol. Rep. 11, 718–726 (2019).
OpenUrl CrossRef
39.↵
Coleine, C. et al. Antarctic Cryptoendolithic Fungal Communities Are Highly Adapted and Dominated by Lecanoromycetes and Dothideomycetes. Front. Microbiol. 9, 1392 (2018).
OpenUrl CrossRef
40.↵
Shaffer, M. et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 48, 8883–8900 (2020).
OpenUrl CrossRef
41.↵
Li, S. et al. A catalog of 48,425 nonredundant viruses from oral metagenomes expands the horizon of the human oral virome. iScience 25, 104418 (2022).
OpenUrl
42.↵
Santos-Medellin, C. et al. Viromes outperform total metagenomes in revealing the spatiotemporal patterns of agricultural soil viral communities. ISME J. 15, 1956–1970 (2021).
OpenUrl
43.↵
Santos-Medellín, C. et al. Spatial turnover of soil viral populations and genotypes overlain by cohesive responses to moisture in grasslands. PNAS 119, e2209132119 (2022).
OpenUrl
44.↵
Bankevich, A. et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
OpenUrl CrossRef PubMed
45.↵
Guo, J., Vik, D., Pratama, A. A., Roux, S. & Sullivan, M. Viral sequence identification SOP with VirSorter2. protocols.io https://www.protocols.io/view/viral-sequence-identification-sop-with-virsorter2-btv8nn9w (2021).
46.↵
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010).
OpenUrl CrossRef PubMed
47.↵
Ettinger, C., Stajich, J. & Coleine, C. Antarctic Rock Viral Catalog. (2022) doi:10.5281/zenodo.7245811.
OpenUrl CrossRef
48.↵
kblin/ncbi-genome-download: Scripts to download genomes from the NCBI FTP servers. GitHub https://github.com/kblin/ncbi-genome-download.
49.↵
Nayfach, S. et al. A genomic catalog of Earth’s microbiomes. Nat. Biotechnol. 39, 499–509 (2020).
OpenUrl
50.↵
Bushnell, B. BBMap. SourceForge https://sourceforge.net/projects/bbmap/(2022).
51.↵
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
OpenUrl CrossRef PubMed Web of Science
52.↵
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
OpenUrl CrossRef PubMed Web of Science
53.↵
Ecogenomics/BamM: Metagenomics-focused BAM file manipulation. GitHub https://github.com/Ecogenomics/BamM.
54.↵
R Core Team. R: A Language and Environment for Statistical Computing. (2021).
55.↵
McMurdie, P. J. & Holmes, S. phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. PLoS One 8, e61217 (2013).
OpenUrl CrossRef PubMed
56.↵
Wickham, H. et al. Welcome to the Tidyverse. Journal of Open Source Software 4, 1686 (2019).
OpenUrl
57.↵
Ettinger, C., Stajich, J. & Coleine, C. stajichlab/Antarctic_Virus_Discovery: v2. (2022). doi:10.5281/zenodo.7374327.
OpenUrl CrossRef

View the discussion thread.

Posted December 03, 2022.

Download PDF

Supplementary Material

Data/Code

Citation Tools

Subject Area

Microbiology

Subject Areas

All Articles

Animal Behavior and Cognition (5197)
Biochemistry (11699)
Bioengineering (8715)
Bioinformatics (29119)
Biophysics (14927)
Cancer Biology (12047)
Cell Biology (17347)
Clinical Trials (138)
Developmental Biology (9405)
Ecology (14138)
Epidemiology (2067)
Evolutionary Biology (18261)
Genetics (12216)
Genomics (16760)
Immunology (11839)
Microbiology (27996)
Molecular Biology (11549)
Neuroscience (60781)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3228)
Physiology (4937)
Plant Biology (10382)
Scientific Communication and Education (1679)
Synthetic Biology (2876)
Systems Biology (7332)
Zoology (1642)

[1] 1.↵
Paez-Espino, D. et al. Uncovering Earth’s virome. Nature 536, 425–430 (2016).
OpenUrl CrossRef PubMed

[2] 2.↵
Kristensen, D. M., Mushegian, A. R., Dolja, V. V. & Koonin, E. V. New dimensions of the virus world discovered through metagenomics. Trends Microbiol. 18, 11–19 (2010).
OpenUrl CrossRef PubMed Web of Science

[3] 3.
Jin, M. et al. Diversities and potential biogeochemical impacts of mangrove soil viruses. Microbiome 7, 58 (2019).
OpenUrl CrossRef

[4] 4.↵
Hurwitz, B. L., Westveld, A. H., Brum, J. R. & Sullivan, M. B. Modeling ecological drivers in marine viral communities using comparative metagenomics and network analyses. Proc. Natl. Acad. Sci. U. S. A. 111, 10714–10719 (2014).
OpenUrl Abstract/FREE Full Text

[5] 5.↵
Kristensen, D. M., Mushegian, A. R., Dolja, V. V. & Koonin, E. V. New dimensions of the virus world discovered through metagenomics. Trends Microbiol. 18, 11–19 (2010).
OpenUrl CrossRef PubMed Web of Science

[6] 6.
Jin, M. et al. Diversities and potential biogeochemical impacts of mangrove soil viruses. Microbiome 7, 58 (2019).
OpenUrl CrossRef

[7] 7.↵
Hurwitz, B. L., Westveld, A. H., Brum, J. R. & Sullivan, M. B. Modeling ecological drivers in marine viral communities using comparative metagenomics and network analyses. Proc. Natl. Acad. Sci. U. S. A. 111, 10714–10719 (2014).
OpenUrl Abstract/FREE Full Text

[8] 8.↵
Breitbart, M., Thompson, L., Suttle, C. & Sullivan, M. Exploring the Vast Diversity of Marine Viruses. Oceanography vol. 20 135–139 Preprint at https://doi.org/10.5670/oceanog.2007.58 (2007).
OpenUrl

[9] 9.↵
Friedmann, E. I. Endolithic Microorganisms in the Antarctic Cold Desert. Science 215, 1045–1053 (1982).
OpenUrl Abstract/FREE Full Text

[10] 10.↵
Coleine, C., Biagioli, F., de Vera, J. P., Onofri, S. & Selbmann, L. Endolithic microbial composition in Helliwell Hills, a newly investigated Mars-like area in Antarctica. Environ. Microbiol. 23, 4002–4016 (2021).
OpenUrl

[11] 11.↵
Albanese, D. et al. Pre-Cambrian roots of novel Antarctic cryptoendolithic bacterial lineages. Microbiome 9, 63 (2021).
OpenUrl

[12] 12.↵
Archer, S. D. J. et al. Endolithic microbial diversity in sandstone and granite from the McMurdo Dry Valleys, Antarctica. Polar Biology vol. 40 997–1006 Preprint at https://doi.org/10.1007/s00300-016-2024-9 (2017).
OpenUrl

[13] 13.↵
de la Torre, J. R., Goebel, B. M., Friedmann, E. I. & Pace, N. R. Microbial diversity of cryptoendolithic communities from the McMurdo Dry Valleys, Antarctica. Appl. Environ. Microbiol. 69, 3858–3867 (2003).
OpenUrl Abstract/FREE Full Text

[14] 14.↵
López-Bueno, A. et al. High diversity of the viral community from an Antarctic lake. Science 326, 858–861 (2009).
OpenUrl Abstract/FREE Full Text

[15] 15.
Prado, T. et al. Virome analysis in lakes of the South Shetland Islands, Antarctica - 2020. Sci. Total Environ. 852, 158537 (2022).
OpenUrl

[16] 16.↵
Zawar-Reza, P. et al. Diverse small circular single-stranded DNA viruses identified in a freshwater pond on the McMurdo Ice Shelf (Antarctica). Infection, Genetics and Evolution vol. 26 132–138 Preprint at https://doi.org/10.1016/j.meegid.2014.05.018 (2014).
OpenUrl

[17] 17.↵
Alarcón-Schumacher, T., Guajardo-Leiva, S., Antón, J. & Díez, B. Elucidating Viral Communities During a Phytoplankton Bloom on the West Antarctic Peninsula. Front. Microbiol. 10, 1014 (2019).
OpenUrl

[18] 18.
Miranda, J. A., Culley, A. I., Schvarcz, C. R. & Steward, G. F. RNA viruses as major contributors to Antarctic virioplankton. Environ. Microbiol. 18, 3714–3727 (2016).
OpenUrl CrossRef

[19] 19.↵
Gong, Z. et al. Viral Diversity and Its Relationship With Environmental Factors at the Surface and Deep Sea of Prydz Bay, Antarctica. Front. Microbiol. 9, 2981 (2018).
OpenUrl

[20] 20.↵
Zablocki, O. et al. High-level diversity of tailed phages, eukaryote-associated viruses, and virophage-like elements in the metaviromes of antarctic soils. Appl. Environ. Microbiol. 80, 6888–6897 (2014).
OpenUrl Abstract/FREE Full Text

[21] 21.
Adriaenssens, E. M. et al. Environmental drivers of viral community composition in Antarctic soils identified by viromics. Microbiome 5, 83 (2017).
OpenUrl CrossRef

[22] 22.
Zablocki, O. et al. Niche-dependent genetic diversity in Antarctic metaviromes. Bacteriophage 4, e980125 (2014).
OpenUrl CrossRef PubMed

[23] 23.↵
Bezuidt, O. K. I. et al. Phages Actively Challenge Niche Communities in Antarctic Soils. mSystems 5, (2020).

[24] 24.↵
Jansson, J. K. & Wu, R. Soil viral diversity, ecology and climate change. Nat. Rev. Microbiol. 1–16 (2022) doi:10.1038/s41579-022-00811-z.
OpenUrl CrossRef

[25] 25.↵
Guo, J. et al. VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses. Microbiome 9, 37 (2021).
OpenUrl CrossRef

[26] 26.↵
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
OpenUrl CrossRef PubMed Web of Science

[27] 27.↵
Bin Jang, H. et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Nat. Biotechnol. 37, 632–639 (2019).
OpenUrl

[28] 28.↵
Cook, R. et al. INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes. PHAGE 2, 214–223 (2021).
OpenUrl CrossRef

[29] 29.↵
Roux, S. et al. Minimum Information about an Uncultivated Virus Genome (MIUViG). Nat. Biotechnol. 37, 29–37 (2018).
OpenUrl

[30] 30.↵
Roux, S., Emerson, J. B., Eloe-Fadrosh, E. A. & Sullivan, M. B. Benchmarking viromics: an evaluation of metagenome-enabled estimates of viral community composition and diversity. PeerJ 5, e3817 (2017).
OpenUrl CrossRef

[31] 31.↵
Nayfach, S. et al. CheckV assesses the quality and completeness of metagenomeassembled viral genomes. Nat. Biotechnol. 39, 578–585 (2021).
OpenUrl CrossRef

[32] 32.↵
Turner, D., Kropinski, A. M. & Adriaenssens, E. M. A Roadmap for Genome-Based Phage Taxonomy. Viruses 13, (2021).

[33] 33.↵
Selbmann, L. et al. Effect of environmental parameters on biodiversity of the fungal component in lithic Antarctic communities. Extremophiles 21, 1069–1080 (2017).
OpenUrl CrossRef

[34] 34.↵
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinformatics 10, 1–9 (2009).
OpenUrl CrossRef PubMed

[35] 35.↵
Edwards, R. A., McNair, K., Faust, K., Raes, J. & Dutilh, B. E. Computational approaches to predict bacteriophage-host relationships. FEMS Microbiol. Rev. 40, 258–272 (2016).
OpenUrl CrossRef PubMed

[36] 36.↵
Kothari, A. et al. Ecogenomics of Groundwater Phages Suggests Niche Differentiation Linked to Specific Environmental Tolerance. mSystems e0053721 (2021) doi:10.1128/mSystems.00537-21.
OpenUrl CrossRef

[37] 37.↵
Jarett, J. K. et al. Insights into the dynamics between viruses and their hosts in a hot spring microbial mat. The ISME Journal vol. 14 2527–2541 Preprint at https://doi.org/10.1038/s41396-020-0705-4 (2020).
OpenUrl

[38] 38.↵
Coleine, C. et al. Altitude and fungal diversity influence the structure of Antarctic cryptoendolithic Bacteria communities. Environ. Microbiol. Rep. 11, 718–726 (2019).
OpenUrl CrossRef

[39] 39.↵
Coleine, C. et al. Antarctic Cryptoendolithic Fungal Communities Are Highly Adapted and Dominated by Lecanoromycetes and Dothideomycetes. Front. Microbiol. 9, 1392 (2018).
OpenUrl CrossRef

[40] 40.↵
Shaffer, M. et al. DRAM for distilling microbial metabolism to automate the curation of microbiome function. Nucleic Acids Res. 48, 8883–8900 (2020).
OpenUrl CrossRef

[41] 41.↵
Li, S. et al. A catalog of 48,425 nonredundant viruses from oral metagenomes expands the horizon of the human oral virome. iScience 25, 104418 (2022).
OpenUrl

[42] 42.↵
Santos-Medellin, C. et al. Viromes outperform total metagenomes in revealing the spatiotemporal patterns of agricultural soil viral communities. ISME J. 15, 1956–1970 (2021).
OpenUrl

[43] 43.↵
Santos-Medellín, C. et al. Spatial turnover of soil viral populations and genotypes overlain by cohesive responses to moisture in grasslands. PNAS 119, e2209132119 (2022).
OpenUrl

[44] 44.↵
Bankevich, A. et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
OpenUrl CrossRef PubMed

[45] 45.↵
Guo, J., Vik, D., Pratama, A. A., Roux, S. & Sullivan, M. Viral sequence identification SOP with VirSorter2. protocols.io https://www.protocols.io/view/viral-sequence-identification-sop-with-virsorter2-btv8nn9w (2021).

[46] 46.↵
Hyatt, D. et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11, 119 (2010).
OpenUrl CrossRef PubMed

[47] 47.↵
Ettinger, C., Stajich, J. & Coleine, C. Antarctic Rock Viral Catalog. (2022) doi:10.5281/zenodo.7245811.
OpenUrl CrossRef

[48] 48.↵
kblin/ncbi-genome-download: Scripts to download genomes from the NCBI FTP servers. GitHub https://github.com/kblin/ncbi-genome-download.

[49] 49.↵
Nayfach, S. et al. A genomic catalog of Earth’s microbiomes. Nat. Biotechnol. 39, 499–509 (2020).
OpenUrl

[50] 50.↵
Bushnell, B. BBMap. SourceForge https://sourceforge.net/projects/bbmap/(2022).

[51] 51.↵
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
OpenUrl CrossRef PubMed Web of Science

[52] 52.↵
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
OpenUrl CrossRef PubMed Web of Science

[53] 53.↵
Ecogenomics/BamM: Metagenomics-focused BAM file manipulation. GitHub https://github.com/Ecogenomics/BamM.

[54] 54.↵
R Core Team. R: A Language and Environment for Statistical Computing. (2021).

[55] 55.↵
McMurdie, P. J. & Holmes, S. phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. PLoS One 8, e61217 (2013).
OpenUrl CrossRef PubMed

[56] 56.↵
Wickham, H. et al. Welcome to the Tidyverse. Journal of Open Source Software 4, 1686 (2019).
OpenUrl

[57] 57.↵
Ettinger, C., Stajich, J. & Coleine, C. stajichlab/Antarctic_Virus_Discovery: v2. (2022). doi:10.5281/zenodo.7374327.
OpenUrl CrossRef