Supplemental Data
Files in this Data Supplement:
- Supplementary Methods
- Supplementary Note
- Protein families of the PDM consensus modules
- PDM assignments to genomes of the learning set
- Hemicellulolytic gene cluster in Fibrobacter succinogenes S85
- Co-occurrence profiles of the M1 protein families and GH6/GH48 across the learning set
- Gene cluster in the cow rumen draft genome AGa
- Microbial isolate strains (lignocellulose degraders and non-degraders) that were used as the learning set
- Single predictions of the consensus modules on the learning set of genomes
- Single predictions of the consensus modules on the remaining set of genomes and metagenome bins
- Venn diagram of the predicted occurrences of the modules M1-M4
- Protein sequences of the identified gene cluster in the draft genome AGa from the cow rumen metagenome