HydDB: A web tool for hydrogenase classification and analysis

Dan Søndergaard; Christian N. S. Pedersen; Chris Greening

doi:10.1101/061994

Abstract

H₂ metabolism is the most ancient and diverse mechanism of energy-generation. The metalloenzymes mediating this metabolism, hydrogenases, are encoded by over 60 microbial phyla and are present in all major ecosystems. We developed a classification system and web tool, HydDB, for the structural and functional analysis of these enzymes. We show that hydrogenase function can be predicted by primary sequence alone using an expanded classification scheme (comprising 29 [NiFe]-hydrogenase subgroups, 8 [FeFe]-hydrogenase subtypes, [Fe]-hydrogenases). Using this scheme, we built a web tool that rapidly and reliably classifies hydrogenase primary sequences using a combination of k-nearest neighbors’ algorithms and CDD referencing. Demonstrating its capacity, the tool reliably predicted hydrogenase content and function in 12 newly-sequenced bacteria, archaea, and eukaryotes. HydDB also provides the capacity to browse 3248 annotated sequences and contains a detailed repository of physiological, biochemical, and structural information about the 38 hydrogenase classes defined here. The database and classifier are freely and publicly available at http://services.birc.au.dk/hyddb/

Introduction

Microorganisms conserve energy by metabolizing H₂. Oxidation of this high-energy fuel yields electrons that can be used for respiration and carbon-fixation. This diffusible gas is also produced in diverse fermentation and anaerobic respiratory processes ¹. H₂ metabolism contributes to the growth and survival of microorganisms across the three domains of life: chemotrophs and phototrophs, lithotrophs and heterotrophs, aerobes and anaerobes, mesophiles and extremophiles alike ^1,2. On the ecosystem scale, H₂ supports microbial communities in most terrestrial, aquatic, and host-associated ecosystems ^1,3. It is also generally accepted that H₂ was the primordial electron donor ⁴. In biological systems, metalloenzymes known as hydrogenases are responsible for oxidizing and evolving H₂ ^1,5. Our recent survey showed there is a far greater number and diversity of hydrogenases than previously thought ². It is predicted over 55 microbial phyla and up to half of all microorganisms harbor hydrogenases ^2,6. Better understanding H₂ metabolism and the enzymes that mediate it also has wider implications, particularly in relation to human health and disease ^3,7, biogeochemical cycling ⁸, and renewable energy ^9,10.

There are three classes of hydrogenase, the [NiFe], [FeFe], and [Fe] hydrogenases, that are distinguished by their metal composition. Whereas the [Fe]-hydrogenases are a small methanogenic-specific family ¹¹, the [NiFe] and [FeFe] classes are widely distributed and functionally diverse. They comprise numerous different groups and subgroups/subtypes with distinct biochemical features (e.g. directionality, affinity, redox partners, and localization) and physiological roles (i.e. respiration, fermentation, bifurcation, sensing) ^1,5. For example, while Group 2a and 2b [NiFe]-hydrogenases share > 35% sequence identity, they have distinct roles as respiratory uptake hydrogenases and H₂ sensors respectively ^12,13. Building on previous work ^14,15, we recently created a comprehensive hydrogenase classification scheme predictive of biological function ². This scheme was primarily based on amino acid sequence phylogeny, but also factored in genetic organization, metal-binding motifs, and functional information. This analysis identified 22 subgroups (within four groups) of [NiFe]-hydrogenases and six subtypes (within three groups) of [FeFe]-hydrogenases, each with unique physiological roles ².

In this work, we build on these findings to develop the first web database for the classification and analysis of hydrogenases. We developed an expanded classification scheme that captures the full sequence diversity of hydrogenase enzymes and predicts their biological function. Using this information, we developed a classification tool based on the k-nearest neighbors’ (k-NN) method. This tool is a more reliable, efficient, and user-friendly method for hydrogenase classification than standard approaches involved phylogenetic tree construction, with a precision of more than 99.8%.

Results and Discussion

A sequence-based classification scheme for hydrogenases

We initially developed a classification scheme to enable prediction of hydrogenase function by primary sequence alone. To do this, we visualized the relationships between all hydrogenases in sequence similarity networks ¹⁶, in which nodes represent individual proteins and the distances between them reflect BLAST E-values. As reflected by our analysis of other protein superfamilies ^17,18, SSNs allow robust inference of sequence-structure-function relationships for large datasets without the problems associated with phylogenetic trees (e.g. long-branch attraction). Consistent with previous phylogenetic analyses ^2,14,15, this analysis showed the hydrogenase sequences clustered into eight major groups (Groups 1 to 4 [NiFe]-hydrogenases, Groups A to C [FeFe]-hydrogenases, [Fe]-hydrogenases), six of which separate into multiple functionally-distinct subgroups or subtypes at narrower logE filters (Figure 1; Figure S1). The SSNs demonstrated that all [NiFe]-hydrogenase subgroups defined through phylogenetic trees in our previous work ² separated into distinct clusters, which is consistent with our evolutionary model that such hydrogenases diverged from a common ancestor to adopt multiple distinct functions ². The only exception were the Group A [FeFe]-hydrogenases, which as previously-reported ^2,15, cannot be classified by sequence alone as they have principally diversified through changes in domain architecture and quaternary structure. It remains strictly necessary to analyze the organization of the genes encoding these enzymes to determine their specific function, e.g. whether they serve fermentative or electron-bifurcating roles.

Figure 1.

Sequence similarity network of hydrogenase sequences. Nodes represent individual proteins and the edges show the BLAST E-values between them at the logE filter defined at the bottom-left of each panel. The sequences are colored by class as defined in the legends. Figure S1 shows the further delineation of the encircled [NiFe] hydrogenase classes.

The SSN analysis revealed that several groups and subgroups that clustered together in the phylogenetic tree analysis ² separate into several subclades of probable distinct function (Figure 1). On this basis, we refined and expanded the hydrogenase classification scheme to reflect the sequence diversification observed (Table 1). Three lineages originally classified as Group 1a [NiFe]-hydrogenases were reclassified as new subgroups, the Coriobacteria (Group 1 i), Archaeoglobi (Group 1j), and Methanosarcinales (Group 1i). The previously-defined 4b and 4d subgroups ² were dissolved, as the SSN analysis confirmed they were highly polyphyletic. These sequences are reclassified here into five new subgroups: the formate-and carbon monoxide-respiring Mrp-linked complexes (Group 4b) ¹⁹, the ferredoxin-coupled Mrp-linked complexes (Group 4d) ²⁰, the well-described methanogenic Eha (Group 4h) and Ehb (Group 4i) supercomplexes ²¹, and a more loosely clustered class of unknown function (Group 4g). Three crenarchaeotal hydrogenases were also classified as their own family (Group 2e); these enzymes enable certain crenarchaeotes to grow aerobically on O₂ ^22,23 and hence may represent a unique lineage of aerobic uptake hydrogenases currently underrepresented in genome databases. The Group C [FeFe]-hydrogenases were also separated into three main subtypes given they separate into distinct clusters even at relatively broad logE values (Figure 1); these enzymes likely have a sensory role ^2,15 and are each co-transcribed with different regulatory elements (Table 1).

View this table:

Table 1.

Expanded classification scheme for hydrogenase enzymes. The majority of the classes were defined in previous work ^2,14,15,40. The [NiFe] Group 1 i, 1j, 1j, 2e, 4d, 4g, 4h, and 4i enzymes and [FeFe] Groups C1, C2, and C3 enzymes were defined in this work based on their separation into distinct clusters in the SSN analysis (Figure 1). HydDB contains detailed information on each of these classes, including their taxonomic distribution, genetic organization, biochemistry, and structures, as well a list of primary references.

HydDB reliably predicts hydrogenase class using the k-NN method and CDD referencing

Using this information, we built a web tool to classify hydrogenases. Hydrogenase classification is determined through a two-step process following input of the catalytic subunit sequence. In the first, the Conserved Domain Database (CDD) ²⁴ is referenced to confirm that the inputted sequence has a hydrogenase catalytic domain, i.e. “Complex1_49kDa superfamily” (cl21493) (for NiFe-hydrogenases), “Fe_hyd_lg_C superfamily” (cl14953) (for FeFe-hydrogenases), and “HMD” (pfam03201) (for Fe-hydrogenases). The sequence is subsequently classified through the k-NN method that determines the most similar sequences listed in the HydDB reference database. To determine the optimal k for the dataset, we performed a 5-fold cross-validation for k = 1…10 and computed the accuracy for each k. The results are shown in Figure 2. The classifier predicted the classes of the 3248 hydrogenase sequences with 99.8% accuracy and high robustness when performing a 5-fold cross-validation (as described in the Methods section) for k = 4. The six sequences where there were discrepancies between the SSN and k-NN predictions are shown in Table S1. The classifier has also been trained to detect and exclude protein families that are homologous to hydrogenases but do not metabolize H₂ (Nuo, Ehr, NARF, HmdII ^1,2) using reference sequences of these proteins.

Figure 2.

Evaluation of the k-NN classifier for k = 1‖10. For each k, a 5-fold crossvalidation was performed. The mean accuracy ± two standard deviations of the folds is shown in the figure (note the y-axis). k = 1 provides the most accurate classifier. However, k = 4 provides almost the same accuracy and is more robust to errors in the training set (reflected by the lower standard deviation). In general, the standard deviation is very small, indicating that the predictions are robust to changes in the training data.

Sequences of the [FeFe] Group A can be classified into functionally-distinct subtypes (A1, A2, A3, A4) based on genetic organization ². The classifier can classify such hydrogenases if the protein sequence immediately downstream from the catalytic subunit sequence is provided. The classifier references the CDD to search for conserved domains in the downstream protein sequence. A sequence is classified as [FeFe] Group A2 if one of the domains “GltA”, “GltD”, “glutamate synthase small subunit” or “putative oxidoreductase”, but not “NuoF”, is found in the sequence. Sequences are classified as [FeFe] Group A3 if the domain “NuoF” is found and [FeFe] Group A4 if the domain “HycB” is present. If none of the domains are found, the sequence is classified as A1. These classification rules were determined by collecting 69 downstream protein sequences. The sequences were then submitted to the CDD and the domains which most often occurred in each subtype were extracted.

In addition to its accuracy, the classifier is superior to other approaches due to its usability (Figure S2). It is accessible as a free web service at http://services.birc.au.dk/hyddb/ HydDB allows the users to paste or upload sequences of hydrogenase catalytic subunit sequences in FASTA format and run the classification. When analysis has completed, results are presented in a table that can be downloaded as a CSV file. This provides an efficient and user-friendly way to classify hydrogenases, in contrast to the previous standard which requires visualization of multiple sequence alignments in phylogenetic trees ²⁵.

HydDB infers the physiological roles of H₂ metabolism

As summarized in Table 1, hydrogenase class is strongly correlated with physiological role. As a result, the classifier is capable of predicting both the class and function of a sequenced hydrogenase. To demonstrate this capacity, we used HydDB to analyze the hydrogenases present in 12 newly-sequenced bacteria, archaea, and eukaryotes of major ecological significance. The classifier correctly classified all 24 hydrogenases identified in the sequenced genomes, as validated with SSNs (Table 2). On the basis of these classifications, the physiological roles of H₂ metabolism were predicted (Table 2). For five of the organisms, these predictions are confirmed or supported by previously published data ^{23,26–⇓⇓29}. Other predictions are in line with metabolic models derived from metagenome surveying ^30–⇓32. In some cases, the capacity for organisms to metabolize H₂ was not tested or inferred in previous studies despite the presence of hydrogenases in the sequenced genomes ^{27,33–⇓35}.

While HydDB serves as a reliable initial predictor of hydrogenase class and function, further analysis is recommended to verify predictions. Hydrogenase sequences only provide organism with the genetic capacity to metabolise H₂; their function is ultimately modulated by their expression and integration within the cell ^1,36. In addition, some classifications are likely to be overgeneralized due to lack of functional and biochemical characterization of certain lineages and sublineages. For example, it is not clear if two distant members of the Group 1h [NiFe]-hydrogenases (Robiginitalea biformata, Sulfolobus islandicus) perform the same H₂-scavenging functions as the core group ⁸. Likewise, it seems probable that the Group 3a [NiFe]-hydrogenases of Thermococci and Aquificae use a distinct electron donor to the main class ³⁷. Prominent cautions are included in the enzyme pages in cases such as these. HydDB will be updated when literature is published that influences functional assignments.

HydDB contains interfaces for hydrogenase browsing and analyzing

In addition to its classification function, HydDB is designed to be a definitive repository for hydrogenase retrieval and analysis. The database presently contains entries for 3248 hydrogenases, including their NCBI accession numbers, amino acid sequence, hydrogenase class, taxonomic affiliation, and predicted behavior (Figure S2). To enable easy exploration of the data set, the database also provides access to an interface for searching, filtering, and sorting the data, as well as the capacity to download the results in CSV or FASTA format. There are individual pages for the 38 hydrogenase classes defined here (Table 1), including descriptions of their physiological role, genetic organization, taxonomic distribution, and biochemical features. This is supplemented with a compendium of structural information about the hydrogenases, which is integrated with the Protein Databank (PDB), as well as a library of over 1000 literature references (Figure S5).

View this table:

Table 2.

Predictive capacity of the HydDB. HydDB accurately determined hydrogenase content and predicted the physiological roles of H₂ metabolism in 12 newly-sequenced archaeal and bacterial species.

Conclusions

To summarize, HydDB is a definitive resource for hydrogenase classification and analysis. The classifier described here provides a reliable, efficient, and convenient tool for hydrogenase classification and functional prediction. HydDB also provides browsing tools for the rapid analysis and retrieval of hydrogenase sequences. Finally, the manually-curated repository of class descriptions, hydrogenase structures, and literature references provide a deep but accessible resource for understanding hydrogenases.

Materials and Methods

Sequence datasets

The database was constructed using the amino acid sequences of all curated non-redundant 3248 hydrogenase catalytic subunits represented in the NCBI RefSeq database in August 2014 ² (Dataset S1). In order to test the classification tool, additional sequences from newly-sequenced archaeal and bacteria phyla were retrieved from the Joint Genome Institute's Integrated Microbial Genomes database ³⁸.

Sequence similarity networks

Sequence similarity networks (SSNs) ¹⁶ were used to visualize the distribution and diversity of the 3248 retrieved hydrogenase sequences. In this analysis, nodes represent individual proteins and edges represent the all-versus-all BLAST E-values. Three networks were constructed using Cytoscape, namely for the [NiFe]-hydrogenase large subunit sequences, [FeFe]-hydrogenase catalytic domain sequences, and [Fe]-hydrogenase sequences. The relationships between them were viewed at different logE cutoffs using different subsets of sequences.

Classification method

The k-NN method is a well-known machine learning method for classification ³⁹. Given a set of data points x₁,x₂, …x_N (e.g. sequences) with known labels y₁,y₂, …,y_N (e.g. type annotations), the label of a point, x, is predicted by computing the distance from x to x₁,x₂, …x_N and extracting the k labeled points closest to x, i.e. the neighbors. The predicted label is then determined by majority vote of the labels of the neighbors. The distance measure applied here is that of a BLAST search. Thus, the classifier corresponds to a homology search where the types of the top k results are considered. However, formulating the classification method as a machine learning problem allows the use of common evaluation methods to estimate the accuracy of the method and perform model selection. The classifier was evaluated using fc-fold cross-validation. The dataset is first split in to k parts of equal size. k − 1 parts (the training set) are then used for training the classifier and the labels of the data points in the remaining part (the test set) are then predicted. This process, called a fold, is repeated k times. The predicted labels of each fold are then compared to the known labels and an accuracy can be computed.

Author Contributions

CG and DS designed experiments. DS and CG performed experiments. CG, DS, and CNSP analysed data. CNSP supervised students. CG and DS wrote the paper.

The authors declare no conflict of interest.

Acknowledgements

We thank A/Prof Colin J. Jackson, Dr Hafna Ahmed, Dr Andrew Warden, and Dr Stephen Pearce for their helpful advice and comments regarding this manuscript. This work was supported by a PUMPkin Centre of Excellence PhD Scholarship awarded to DS, an Australian National University PhD Scholarship awarded to FHA, and a CSIRO Office of the Chief Executive Postdoctoral Fellowship awarded to CG.

References

1.↵
Schwartz, E., Fritsch, J. & Friedrich, B. H₂-metabolizing prokaryotes. (Springer Berlin Heidelberg, 2013).
2.↵
Greening, C. et al. Genome and metagenome surveys of hydrogenase diversity indicate H₂ is a widely-utilised energy source for microbial growth and survival. ISME J. 10, 761–777 (2016).
OpenUrl CrossRef PubMed
3.↵
1. Poole, R. K.
Cook, G. M., Greening, C., Hards, K. & Berney, M. in Advances in Bacterial Pathogen Biology (ed. Poole, R. K.) 65, 1–62 (Academic Press, 2014).
OpenUrl
4.↵
Lane, N., Allen, J. F. & Martin, W. How did LUCA make a living? Chemiosmosis in the origin of life. BioEssays 32, 271–280 (2010).
OpenUrl CrossRef PubMed Web of Science
5.↵
Lubitz, W., Ogata, H., Rudiger, O. & Reijerse, E. Hydrogenases. Chem. Rev. 114, 4081–148 (2014).
OpenUrl CrossRef PubMed Web of Science
6.↵
Peters, J. W. et al. [FeFe]- and [NiFe]-hydrogenase diversity, mechanism, and maturation. Biochim. Biophys. Acta – Mol. Cell Res. (2014). doi:10.1016/j.bbamcr.2014.11.021
OpenUrl CrossRef PubMed
7.↵
Carbonero, F., Benefiel, A. C. & Gaskins, H. R. Contributions of the microbial hydrogen economy to colonic homeostasis. Nat Rev Gastroenterol Hepatol 9, 504–518 (2012).
OpenUrl CrossRef PubMed
8.↵
Greening, C. et al. Atmospheric hydrogen scavenging: from enzymes to ecosystems. Appl. Environ. Microbiol. 81, 1190–1199 (2015).
OpenUrl Abstract/FREE Full Text
9.↵
Levin, D. B., Pitt, L. & Love, M. Biohydrogen production: prospects and limitations to practical application. Int. J. Hydrogen Energy 29, 173–185 (2004).
OpenUrl CrossRef Web of Science
10.↵
Cracknell, J. A., Vincent, K. A. & Armstrong, F. A. Enzymes as working or inspirational catalysts for fuel cells and electrolysis. Chem. Rev. 108, 2439–2461 (2008).
OpenUrl CrossRef PubMed Web of Science
11.↵
Shima, S. et al. The crystal structure of [Fe]-Hydrogenase reveals the geometry of the active site. Science 321, 572–575 (2008).
OpenUrl Abstract/FREE Full Text
12.↵
Lenz, O. & Friedrich, B. A novel multicomponent regulatory system mediates H₂ sensing in Alcaligenes eutrophus. Proc. Natl. Acad. Sci. U. S. A. 95, 12474–12479 (1998).
13.↵
Greening, C., Berney, M., Hards, K., Cook, G. M. & Conrad, R. A soil actinobacterium scavenges atmospheric H₂ using two membrane-associated, oxygen-dependent [NiFe] hydrogenases. Proc. Natl. Acad. Sci. U. S. A. 111, 4257–4261 (2014).
14.↵
Vignais, P. M., Billoud, B. & Meyer, J. Classification and phylogeny of hydrogenases. FEMS Microbiol. Rev. 25, 455–501 (2001).
OpenUrl CrossRef PubMed Web of Science
15.↵
Calusinska, M., Happe, T., Joris, B. & Wilmotte, A. The surprising diversity of clostridial hydrogenases: a comparative genomic perspective. Microbiology 156, 1575–1588 (2010).
OpenUrl CrossRef PubMed
16.↵
Atkinson, H. J., Morris, J. H., Ferrin, T. E. & Babbitt, P. C. Using sequence similarity networks for visualization of relationships across diverse protein superfamilies. PLoS One 4, e4345 (2009).
OpenUrl CrossRef PubMed
17.↵
Ahmed, F. H. et al. Sequence-structure-function classification of a catalytically diverse oxidoreductase superfamily in mycobacteria. J. Mol. Biol. 427, 3554–3571 (2015).
OpenUrl CrossRef PubMed
18.↵
Ney, B. et al. The methanogenic redox cofactor F420 is widely synthesized by aerobic soil bacteria. ISME J. In press (2016).
19.↵
Kim, Y. J. et al. Formate-driven growth coupled with H₂ production. Nature 467, 352–5 (2010).
OpenUrl CrossRef PubMed Web of Science
20.↵
McTernan, P. M. et al. Intact functional fourteen-subunit respiratory membrane-bound [NiFe]-hydrogenase complex of the hyperthermophilic archaeon Pyrococcus furiosus. J. Biol. Chem. 289, 19364–19372 (2014).
OpenUrl Abstract/FREE Full Text
21.↵
Lie, T. J. et al. Essential anaplerotic role for the energy-converting hydrogenase Eha in hydrogenotrophic methanogenesis. Proc. Natl. Acad. Sci. U. S. A. 109, 15473–8 (2012).
22.↵
Auernik, K. S. & Kelly, R. M. Physiological versatility of the extremely thermoacidophilic archaeon Metallosphaera sedula supported by transcriptomic analysis of heterotrophic, autotrophic, and mixotrophic growth. Appl. Environ. MicroBiol. 76, 931–935 (2010).
OpenUrl Abstract/FREE Full Text
23.↵
Giaveno, M. A., Urbieta, M. S., Ulloa, J. R., Gonzalez Toril, E. & Donati, E. R. Physiologic versatility and growth flexibility as the Main characteristics of a novel thermoacidophilic Acidianus strain isolated from Copahue geothermal area in Argentina. Microb. Ecol. 65, 336346 (2012).
OpenUrl
24.↵
Marchler-Bauer, A. & Bryant, S. H. CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 32, W327–31 (2004).
OpenUrl CrossRef PubMed Web of Science
25.↵
Berney, M., Greening, C., Hards, K., Collins, D. & Cook, G. M. Three different [NiFe] hydrogenases confer metabolic flexibility in the obligate aerobe Mycobacterium smegmatis. Environ. Microbiol. 16, 318–330 (2014).
OpenUrl CrossRef PubMed Web of Science
26.↵
Greening, C. et al. Persistence of the dominant soil phylum Acidobacteria by trace gas scavenging. Proc. Natl. Acad. Sci. 112, 10497–10502 (2015).
27.↵
Chen, Z. et al. Phaeodactylibacterxiamenensis gen. nov., sp. nov., a member of the family Saprospiraceae isolated from the marine alga Phaeodactylum tricornutum. Int. J. Syst. Evol. Microbiol. 64, 3496–3502 (2014).
OpenUrl CrossRef PubMed
28.↵
Koch, H. et al. Growth of nitrite-oxidizing bacteria by aerobic hydrogen oxidation. Science 345, 1052–1054 (2014).
OpenUrl Abstract/FREE Full Text
29.↵
Carere, C. R. et al. Growth and persistence of methanotrophic bacteria by aerobic hydrogen respiration. Proc. Natl. Acad. Sci. U. S. A. (2016).
30.↵
Haroon, M. F. et al. Anaerobic oxidation of methane coupled to nitrate reduction in a novel archaeal lineage. Nature 500, 567–70 (2013).
OpenUrl CrossRef PubMed Web of Science
31.↵
Evans, P. N. et al. Methane metabolism in the archaeal phylum Bathyarchaeota revealed by genome-centric metagenomics. Science 350, 434–438 (2015).
OpenUrl Abstract/FREE Full Text
32.↵
Brown, C. T. et al. Unusual biology across a group comprising more than 15% of domain Bacteria. Nature 523, 208–211 (2015).
OpenUrl CrossRef PubMed
33.↵
Spang, A. et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521, 173–179 (2015).
OpenUrl CrossRef PubMed
34.↵
Eloe-Fadrosh, E. A. et al. Global metagenomic survey reveals a new bacterial candidate phylum in geothermal springs. Nat Commun 7, (2016).
35.↵
Wilson, M. C. et al. An environmental bacterial taxon with a large and distinct metabolic repertoire. Nature 506, 58–62 (2014).
OpenUrl CrossRef PubMed Web of Science
36.↵
Greening, C. & Cook, G. M. Integration of hydrogenase expression and hydrogen sensing in bacterial cell physiology. Curr. Opin. Microbiol. 18, 30–8 (2014).
OpenUrl CrossRef PubMed
37.↵
Greening, C. et al. Physiology, biochemistry, and applications of F₄₂₀-and F₀-dependent redox reactions. Microbiol. Mol. Biol. Rev. 80, 451–493 (2016).
OpenUrl Abstract/FREE Full Text
38.↵
Markowitz, V. M. et al. IMG: the integrated microbial genomes database and comparative analysis system. Nucleic Acids Research 40, D115–22 (2012).
OpenUrl CrossRef PubMed Web of Science
39.↵
Cover, T. & Hart, P. Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 13, (1967).
40.↵
Constant, P., Chowdhury, S. P., Pratscher, J. & Conrad, R. Streptomycetes contributing to atmospheric molecular hydrogen soil uptake are widespread and encode a putative high-affinity [NiFe]-hydrogenase. Environ. MicroBiol. 12, 821–829 (2010).
OpenUrl CrossRef PubMed Web of Science
41.
Stetter, K. O. Archaeoglobus fulgidus gen. nov., sp. nov.: a new taxon of extremely thermophilic archaebacteria. Syst. Appl. MicroBiol. 10, 172–173 (1988).
OpenUrl CrossRef Web of Science
42.
Deppenmeier, U. & Blaut, M. Analysis of the vhoGAC and vhtGAC operons from Methanosarcina mazei strain Go1, both encoding a membrane-bound hydrogenase and a cytochrome b. Eur. J. BioChem. 269, 261–269 (1995).
OpenUrl
43.
Hamann, E. et al. Environmental Breviatea harbour mutualistic Arcobacter epibionts. Nature 534, 254–258 (2016).
OpenUrl CrossRef
44.
Sousa, F. L., Neukirchen, S., Allen, J. F., Lane, N. & Martin, W. F. Lokiarchaeon is hydrogen dependent. Nat. MicroBiol. 1, 16034 (2016).
OpenUrl

View the discussion thread.

Posted July 04, 2016.

Download PDF

Citation Tools

Subject Area

Bioinformatics

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11752)
Bioengineering (8752)
Bioinformatics (29200)
Biophysics (14974)
Cancer Biology (12096)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18308)
Genetics (12245)
Genomics (16803)
Immunology (11869)
Microbiology (28097)
Molecular Biology (11594)
Neuroscience (60969)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] 1.↵
Schwartz, E., Fritsch, J. & Friedrich, B. H₂-metabolizing prokaryotes. (Springer Berlin Heidelberg, 2013).

[2] 2.↵
Greening, C. et al. Genome and metagenome surveys of hydrogenase diversity indicate H₂ is a widely-utilised energy source for microbial growth and survival. ISME J. 10, 761–777 (2016).
OpenUrl CrossRef PubMed

[3] 3.↵
Poole, R. K.
Cook, G. M., Greening, C., Hards, K. & Berney, M. in Advances in Bacterial Pathogen Biology (ed. Poole, R. K.) 65, 1–62 (Academic Press, 2014).
OpenUrl

[4] Poole, R. K.

[5] 4.↵
Lane, N., Allen, J. F. & Martin, W. How did LUCA make a living? Chemiosmosis in the origin of life. BioEssays 32, 271–280 (2010).
OpenUrl CrossRef PubMed Web of Science

[6] 5.↵
Lubitz, W., Ogata, H., Rudiger, O. & Reijerse, E. Hydrogenases. Chem. Rev. 114, 4081–148 (2014).
OpenUrl CrossRef PubMed Web of Science

[7] 6.↵
Peters, J. W. et al. [FeFe]- and [NiFe]-hydrogenase diversity, mechanism, and maturation. Biochim. Biophys. Acta – Mol. Cell Res. (2014). doi:10.1016/j.bbamcr.2014.11.021
OpenUrl CrossRef PubMed

[8] 7.↵
Carbonero, F., Benefiel, A. C. & Gaskins, H. R. Contributions of the microbial hydrogen economy to colonic homeostasis. Nat Rev Gastroenterol Hepatol 9, 504–518 (2012).
OpenUrl CrossRef PubMed

[9] 8.↵
Greening, C. et al. Atmospheric hydrogen scavenging: from enzymes to ecosystems. Appl. Environ. Microbiol. 81, 1190–1199 (2015).
OpenUrl Abstract/FREE Full Text

[10] 9.↵
Levin, D. B., Pitt, L. & Love, M. Biohydrogen production: prospects and limitations to practical application. Int. J. Hydrogen Energy 29, 173–185 (2004).
OpenUrl CrossRef Web of Science

[11] 10.↵
Cracknell, J. A., Vincent, K. A. & Armstrong, F. A. Enzymes as working or inspirational catalysts for fuel cells and electrolysis. Chem. Rev. 108, 2439–2461 (2008).
OpenUrl CrossRef PubMed Web of Science

[12] 11.↵
Shima, S. et al. The crystal structure of [Fe]-Hydrogenase reveals the geometry of the active site. Science 321, 572–575 (2008).
OpenUrl Abstract/FREE Full Text

[13] 12.↵
Lenz, O. & Friedrich, B. A novel multicomponent regulatory system mediates H₂ sensing in Alcaligenes eutrophus. Proc. Natl. Acad. Sci. U. S. A. 95, 12474–12479 (1998).

[14] 13.↵
Greening, C., Berney, M., Hards, K., Cook, G. M. & Conrad, R. A soil actinobacterium scavenges atmospheric H₂ using two membrane-associated, oxygen-dependent [NiFe] hydrogenases. Proc. Natl. Acad. Sci. U. S. A. 111, 4257–4261 (2014).

[15] 14.↵
Vignais, P. M., Billoud, B. & Meyer, J. Classification and phylogeny of hydrogenases. FEMS Microbiol. Rev. 25, 455–501 (2001).
OpenUrl CrossRef PubMed Web of Science

[16] 15.↵
Calusinska, M., Happe, T., Joris, B. & Wilmotte, A. The surprising diversity of clostridial hydrogenases: a comparative genomic perspective. Microbiology 156, 1575–1588 (2010).
OpenUrl CrossRef PubMed

[17] 16.↵
Atkinson, H. J., Morris, J. H., Ferrin, T. E. & Babbitt, P. C. Using sequence similarity networks for visualization of relationships across diverse protein superfamilies. PLoS One 4, e4345 (2009).
OpenUrl CrossRef PubMed

[18] 17.↵
Ahmed, F. H. et al. Sequence-structure-function classification of a catalytically diverse oxidoreductase superfamily in mycobacteria. J. Mol. Biol. 427, 3554–3571 (2015).
OpenUrl CrossRef PubMed

[19] 18.↵
Ney, B. et al. The methanogenic redox cofactor F420 is widely synthesized by aerobic soil bacteria. ISME J. In press (2016).

[20] 19.↵
Kim, Y. J. et al. Formate-driven growth coupled with H₂ production. Nature 467, 352–5 (2010).
OpenUrl CrossRef PubMed Web of Science

[21] 20.↵
McTernan, P. M. et al. Intact functional fourteen-subunit respiratory membrane-bound [NiFe]-hydrogenase complex of the hyperthermophilic archaeon Pyrococcus furiosus. J. Biol. Chem. 289, 19364–19372 (2014).
OpenUrl Abstract/FREE Full Text

[22] 21.↵
Lie, T. J. et al. Essential anaplerotic role for the energy-converting hydrogenase Eha in hydrogenotrophic methanogenesis. Proc. Natl. Acad. Sci. U. S. A. 109, 15473–8 (2012).

[23] 22.↵
Auernik, K. S. & Kelly, R. M. Physiological versatility of the extremely thermoacidophilic archaeon Metallosphaera sedula supported by transcriptomic analysis of heterotrophic, autotrophic, and mixotrophic growth. Appl. Environ. MicroBiol. 76, 931–935 (2010).
OpenUrl Abstract/FREE Full Text

[24] 23.↵
Giaveno, M. A., Urbieta, M. S., Ulloa, J. R., Gonzalez Toril, E. & Donati, E. R. Physiologic versatility and growth flexibility as the Main characteristics of a novel thermoacidophilic Acidianus strain isolated from Copahue geothermal area in Argentina. Microb. Ecol. 65, 336346 (2012).
OpenUrl

[25] 24.↵
Marchler-Bauer, A. & Bryant, S. H. CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 32, W327–31 (2004).
OpenUrl CrossRef PubMed Web of Science

[26] 25.↵
Berney, M., Greening, C., Hards, K., Collins, D. & Cook, G. M. Three different [NiFe] hydrogenases confer metabolic flexibility in the obligate aerobe Mycobacterium smegmatis. Environ. Microbiol. 16, 318–330 (2014).
OpenUrl CrossRef PubMed Web of Science

[27] 26.↵
Greening, C. et al. Persistence of the dominant soil phylum Acidobacteria by trace gas scavenging. Proc. Natl. Acad. Sci. 112, 10497–10502 (2015).

[28] 27.↵
Chen, Z. et al. Phaeodactylibacterxiamenensis gen. nov., sp. nov., a member of the family Saprospiraceae isolated from the marine alga Phaeodactylum tricornutum. Int. J. Syst. Evol. Microbiol. 64, 3496–3502 (2014).
OpenUrl CrossRef PubMed

[29] 28.↵
Koch, H. et al. Growth of nitrite-oxidizing bacteria by aerobic hydrogen oxidation. Science 345, 1052–1054 (2014).
OpenUrl Abstract/FREE Full Text

[30] 29.↵
Carere, C. R. et al. Growth and persistence of methanotrophic bacteria by aerobic hydrogen respiration. Proc. Natl. Acad. Sci. U. S. A. (2016).

[31] 30.↵
Haroon, M. F. et al. Anaerobic oxidation of methane coupled to nitrate reduction in a novel archaeal lineage. Nature 500, 567–70 (2013).
OpenUrl CrossRef PubMed Web of Science

[32] 31.↵
Evans, P. N. et al. Methane metabolism in the archaeal phylum Bathyarchaeota revealed by genome-centric metagenomics. Science 350, 434–438 (2015).
OpenUrl Abstract/FREE Full Text

[33] 32.↵
Brown, C. T. et al. Unusual biology across a group comprising more than 15% of domain Bacteria. Nature 523, 208–211 (2015).
OpenUrl CrossRef PubMed

[34] 33.↵
Spang, A. et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521, 173–179 (2015).
OpenUrl CrossRef PubMed

[35] 34.↵
Eloe-Fadrosh, E. A. et al. Global metagenomic survey reveals a new bacterial candidate phylum in geothermal springs. Nat Commun 7, (2016).

[36] 35.↵
Wilson, M. C. et al. An environmental bacterial taxon with a large and distinct metabolic repertoire. Nature 506, 58–62 (2014).
OpenUrl CrossRef PubMed Web of Science

[37] 36.↵
Greening, C. & Cook, G. M. Integration of hydrogenase expression and hydrogen sensing in bacterial cell physiology. Curr. Opin. Microbiol. 18, 30–8 (2014).
OpenUrl CrossRef PubMed

[38] 37.↵
Greening, C. et al. Physiology, biochemistry, and applications of F₄₂₀-and F₀-dependent redox reactions. Microbiol. Mol. Biol. Rev. 80, 451–493 (2016).
OpenUrl Abstract/FREE Full Text

[39] 38.↵
Markowitz, V. M. et al. IMG: the integrated microbial genomes database and comparative analysis system. Nucleic Acids Research 40, D115–22 (2012).
OpenUrl CrossRef PubMed Web of Science

[40] 39.↵
Cover, T. & Hart, P. Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 13, (1967).

[41] 40.↵
Constant, P., Chowdhury, S. P., Pratscher, J. & Conrad, R. Streptomycetes contributing to atmospheric molecular hydrogen soil uptake are widespread and encode a putative high-affinity [NiFe]-hydrogenase. Environ. MicroBiol. 12, 821–829 (2010).
OpenUrl CrossRef PubMed Web of Science

[42] 41.
Stetter, K. O. Archaeoglobus fulgidus gen. nov., sp. nov.: a new taxon of extremely thermophilic archaebacteria. Syst. Appl. MicroBiol. 10, 172–173 (1988).
OpenUrl CrossRef Web of Science

[43] 42.
Deppenmeier, U. & Blaut, M. Analysis of the vhoGAC and vhtGAC operons from Methanosarcina mazei strain Go1, both encoding a membrane-bound hydrogenase and a cytochrome b. Eur. J. BioChem. 269, 261–269 (1995).
OpenUrl

[44] 43.
Hamann, E. et al. Environmental Breviatea harbour mutualistic Arcobacter epibionts. Nature 534, 254–258 (2016).
OpenUrl CrossRef

[45] 44.
Sousa, F. L., Neukirchen, S., Allen, J. F., Lane, N. & Martin, W. F. Lokiarchaeon is hydrogen dependent. Nat. MicroBiol. 1, 16034 (2016).
OpenUrl