Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

Assessing anaerobic gut fungal (Neocalliamstigomycota) diversity using PacBio D1/D2 LSU rRNA amplicon sequencing and multi-year isolation

Radwa A. Hanafy, Britny Johnson, Noha H. Youssef, Mostafa S. Elshahed
doi: https://doi.org/10.1101/2020.03.24.005967
Radwa A. Hanafy
Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Britny Johnson
Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Noha H. Youssef
Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mostafa S. Elshahed
Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, OK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: Mostafa@okstate.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

The anaerobic gut fungi (AGF, Neocallimastigomycota) reside in the alimentary tracts of herbivores where they play a central role in the breakdown of ingested plant material. Accurate assessment of AGF diversity has been hampered by inherent deficiencies of the internal transcribed spacer1 (ITS1) region as a phylogenetic marker. Here, we report on the development and implementation of the D1/D2 region of the large ribosomal subunit (D1/D2 LSU) as a novel marker for assessing AGF diversity in culture-independent surveys. Sequencing a 1.4-1.5 Kbp amplicon encompassing the ITS1-5.8S rRNA-ITS2-D1/D2 LSU region in the ribosomal RNA locus from fungal strains and environmental samples generated a reference D1/D2 LSU database for all cultured AGF genera, as well as the majority of candidate genera encountered in prior ITS1-based diversity surveys. Subsequently, a D1/D2 LSU-based diversity survey using long read PacBio SMRT sequencing technology was conducted on fecal samples from 21 wild and domesticated herbivores. Twenty-eight genera and candidate genera were identified in the 17.7 K sequences obtained, including multiple novel lineages that were predominantly, but not exclusively, identified in wild herbivores. Association between certain AGF genera and animal lifestyles, or animal host family was observed. Finally, to address the current paucity of AGF isolates, concurrent isolation efforts utilizing multiple approaches to maximize recovery yielded 216 isolates belonging to twelve different genera, several of which have no prior cultured-representatives. Our results establish the utility of D1/D2 LSU and PacBio sequencing for AGF diversity surveys, and the culturability of a wide range of AGF taxa, and demonstrate that wild herbivores represent a yet-untapped reservoir of AGF diversity.

Introduction

Members of the anaerobic gut fungi (AGF) are strict anaerobes that inhabit the rumen and alimentary tract of a wide range of foregut and hindgut herbivores. The AGF play an important role in the breakdown of ingested plant biomass via enzymatic and physical disruption in the herbivorous gut1. AGF represent a distinct basal fungal phylum (Neocallimastigomycota) that evolved 66 (±10) million years ago coinciding, and possibly enabling, mammalian transition from insectivory to herbivory2.

Culture independent amplicon-based diversity surveys have been widely utilized to gauge anaerobic fungal diversity and community structure in herbivores3, 4, 5, 6. The internal transcribed spacer1 (ITS1) region within the ribosomal operon has been almost exclusively utilized as the phylogenetic marker of choice in culture-independent sequence-based phylogenetic assessments of AGF diversity7 Such choice is a reflection of its wider popularity as a marker within the kingdom Mycota8, 9, the high sequence similarity and limited discriminatory power of the 18S rRNA gene between various AGF taxa10, and its relatively shorter length, allowing high throughput pyrosequencing- and Illumina-based diversity assessments3, 6. However, concerns for the use of ITS1 in diversity assessment for the Mycota11, basal fungi12, and the Neocallimastigomycota7 have been voiced. The ITS1 region is polymorphic, exhibiting considerable secondary structure (number and organization of helices13), and length14 variability. Such polymorphism renders automated alignments, reproducible sequence divergence estimates, and classification of sequence data unreliable and highly dependent on alignment strategies and parameters specified. In addition, significant sequence divergence between copies of the ITS1 region within a single strain have been reported (up to 12.9% in15), values that exceed cutoffs utilized for species (even genus in some instances) level delineation from sequence data3, 16, 17, 18. Such limitations often necessitate laborious subjective manual curation and secondary structure incorporation into alignment strategies13, although it is well recognized that these efforts only partially alleviate, rather than completely address, such fundamental limitations.

The 28S large ribosomal subunit (LSU) is one of the original genes proposed for fungal barcoding12. Hypervariable domains 1 and 219 within the LSU molecule (D1/D2 LSU) have previously been utilized for differentiating strains of AGF via molecular typing20, 21, 22, or sequencing23, 24 Compared to ITS1, D1/D2 LSU region exhibits much lower levels of length heterogeneity and intra-strain sequence divergence in fungi25, including the AGF20. Identification and taxonomic assignment of AGF strains based on D1/D2 LSU have gathered momentum; and D1/D2 LSU-based phylogenetic analysis has been reported in all manuscripts describing novel taxa since 201515, 26, 27, 28, 29, 30, 50. The potential use of D1/D2 LSU as a marker in culture-independent AGF diversity surveys has been proposed as a logical alternative for ITS17, 14 The lack of specific AGF primers and the relatively large size of the region (approximately 750 bp) has been viewed as a barrier to the wide utilization of short read, high-throughput, Illumina-based amplicon sequencing in such surveys. However, the recent development of AGF LSU-specific primers24, 31, as well as the standardization and adoption of PacBio long-read sequencing for amplicon-based diversity surveys32, 33 could enable this process.

Theoretically, a comprehensive assessment of diversity and community structure of a host-associated lineage necessitates sampling all (or the majority) of hosts reported to harbor such lineage. However, to date, the majority of AGF diversity surveys conducted have targeted a few domesticated herbivores, e.g. cows, sheep, and goats4, 5, 34. “Exotic” animals have been sampled from zoo settings only sporadically, and on an opportunistic basis3, 35.

Isolation of AGF taxa enables taxonomic, metabolic, physiological, and ultrastructural characterization of individual taxa. As well, cultures availability enables subsequent –omics, synthetic and system-biology, and biogeography-based investigations36, 37, 38, 39, 40, 41, 42, as well as evaluation of evolutionary processes underpinning speciation in the AGF2, 43. However, efforts to isolate and maintain AGF strains have lagged behind their aerobic counterparts mainly due to their strict anaerobic nature and the lack of reliable long-term storage procedures. Due to these difficulties, many historic isolates are no longer available, and most culture-based studies report on the isolation of a single or few strains using a single substrate/enrichment condition from one or few hosts29, 44. Indeed, a gap currently exists between the rate of discovery (via amplicon-based diversity surveys) and the rate of isolation of new taxa of AGF, and several yet-uncultured AGF lineages have been identified in culture-independent diversity surveys17. Whether yet-uncultured AGF taxa are refractory to isolation, or simply not yet cultured due to inadequate sampling and isolation efforts remains to be seen.

The current study aims to expand our understanding of the diversity of AGF while addressing all three impediments described above. First, we sought to develop D1/D2 LSU as a more robust marker for AGF diversity assessment by building a reference sequence database correlating ITS1 and D1/D2 LSU sequence data from cultured strains and environmental samples. Second, we sought to expand on AGF diversity by examining a wide range of animal hosts, including multiple previously unsampled wild herbivores. Third, we sought to demonstrate the utility of intensive sampling and utilization of various isolation strategies in recovering AGF strains and testing the hypothesis that many yet-uncultured AGF lineages are indeed amenable to cultivation. Collectively, these efforts provide an established framework for future utilization of D1/D2 LSU amplification and PacBio sequencing for AGF community assessment, highlight the value of sampling wild herbivores, and establish the culturability of a wide range of AGF taxa.

Results

A reference D1/D2-LSU dataset for the Neocallimastigomycota

A 1.4-1.5 Kbp amplicon product encompassing the ITS1-5.8S-ITS2-D1/D2 LSU region was amplified and sequenced from AGF pure cultures and environmental samples to correlate the D1/D2 LSU region to the corresponding ITS1 region, and to provide a reference D1/D2 database for future utilization in high-throughput diversity surveys. Using this approach, representative D1/D2 LSU of all the previously cultured AGF genera Agriosomyces, Aklioshbomyces, Anaeromyces, Buwchfawromyces, Caecomyces, Capellomyces, Cyllamyces, Ghazallomyces, Joblinomyces, Feramyces, Khyollomyces, Liebetanzomyces, Neocallimastix, Orpinomyces, Pecoramyces, Piromyces, and Tahromyces were obtained (Table 1). Representatives of the genus Oontomyces were not encountered in this study, but reference LSU and ITS1 sequences were obtained from prior publication26. In addition, representative sequences of D1/D2 LSU of candidate genera AL3, AL4, AL8, MN3, MN4, SK3, and SK4, previously identified in ITS1 culture-independent datasets were also obtained (Table 1, Datasets 1-3). Finally, representatives of six completely novel AGF candidate genera (RH1-RH6) were also identified (Table 1, Datasets 1-3) and confirmed as novel independent clades in ITS1 and D1/D2 LSU-based phylogenetic analysis. It should be noted that multiple previously reported yet-uncultured (candidate) genera have recently been successfully isolated, e.g. AL1 (Khyollomyces), AL5 (Joblinomyces), AL6 (Feramyces), AL7 (Piromyces finnis), MN1 (Cyllamyces), SP4 (Liebetanzomyces), and SK2 (Buwchfawromyces). In addition, some previously proposed candidate genera clustered as members of already existing genera in our analysis, e.g. SP8 with Cyllamyces, and SP6 with Neocallimastix (Table 1). As such, we estimate that only representatives of candidate genera BlackRhino, SP1, SP217, and the relatively rare AL2, DA1, DT1, JH1/SP5 (ITS1 sequence representatives of these two candidate genera are 99.6% similar and so they should be considered as one candidate genus), KF1, MN2, SK1, SP3, and SP73, 5, 13, 17, 45, 46, 47, 48 were not encountered in this study, and hence no reference LSU sequence data for these candidate genera are currently available (Table 1).

View this table:
  • View inline
  • View popup
Table 1.

Representatives full-length sequences spanning the region “ITS1-5.8S rRNA-ITS2-D1/D2 LSU”. GenBank accession numbers are shown for all clone sequences obtained from representative AGF isolates in our culture collection. For yet-uncultured AGF taxa, accession numbers refer to the SMRT generated sequence name in Datasets 1-3. Start and end positions of ITS1, 5.8S rRNA gene, ITS2, and the D1/D2 region of the LSU are shown.

D1/D2 LSU versus ITS1 as a taxonomic marker

Intra-genus length variability

The ITS1 and D1/D2 LSU regions were bioinformatically extracted from the 116 Sanger-generated clone sequences from this study and previous studies23, 28, 29, 49 (accession numbers in Table 1), the rRNA loci from two Neocallimastigomycota genomes (Pecoramyces ruminantium strain C1A, and Neocallimastix californiae strain G1)36, 43 in which the entire rrn operon sequence data is available, and from the PacBio-generated environmental amplicons in this study. The ITS1 region displayed a high level of length heterogeneity, ranging in size between 141 and 250 bp (median 191 bp, Figure 1a), with 75% of sequences ranging between 182-208 bp in length. Some genera had shorter than median ITS1 region length, e.g. Cyllamyces (range 141-173 bp), Buwchfawromyces (range 155-169 bp), and candidate genus AL3 (range 145-148 bp), while others exhibited a longer than median ITS1 region length, e.g. Liebetanzomyces (range 198-225 bp), and candidate genus RH5 (range 191-224 bp) (Figure 1a). A third group of genera displayed a wide range of length heterogeneity, e.g. Neocallimastix (range 160-244 bp), Caecomyces (range 192-250), and Piromyces (range 173-225 bp). Few genera and candidate genera displayed a fairly narrow range of ITS1 length, e.g. AL3 (141-148 bp), but this is potentially a reflection of the paucity of sequences belonging to these genera obtained in this study (Figure 1a).

Figure 1.
  • Download figure
  • Open in new tab
Figure 1.

Box and whisker plots showing the variability in intra-genus length (A-B) and sequence divergence cutoff (C-D) for the ITS1 (A, C) and D1/D2 LSU (B, D) regions. A cartoon of the rRNA locus is shown on top. Genera and candidate genera with at least 5 sequences encompassing the full region “ITS1-5.8S-ITS2-D1/D2 LSU” were used to construct this plot as detailed in the methods section. The candidate genera AL4, MN3, MN4, RH3, RH6, and SK3 had only a few sequence representatives (1-3) and so are not included in the plot.

On the other hand, a much lower level of length heterogeneity was identified in the D1/D2 LSU (Figure 1b), ranging in size between 740-767 bp (median 760 bp), and where 75% of the sequences ranged between 757-761 bp, with all genera consistently displaying a much narrower D1/D2-LSU length heterogeneity, ranging between 11 bp in RH4 and 26 bp in the genera Neocallimastix and Aklioshbomyces.

Intra-genus sequence divergence

The ITS1 region displayed intra-genus sequence divergence ranging from 0.4 to 21% (median 3.2%), with 75% of the pairwise divergence values ranging between 1.7-6%. Genera displaying the highest level of divergence were Caecomyces (1-18.9%, median 8.3%), Cyllamyces (0.6-19.6%, median 5.5%), and Neocallimastix (0.4-19.3%, median 5.5%) (Figure 1c). On the other hand, intra-genus sequence divergence of the D1/D2 LSU ranged between 0.1-9.2% (median 1.4%), with 75% of the pairwise divergence values ranging between 0.8-2.1%. Genera displaying highest level of divergence were Feramyces (0.1-7.8%), Joblinomyces (0.1-8.7%, Caecomyces (0.1-9%), and Piromyces (0.1-9.2%) (Figure 1d).

Within strain length variability

Within strain length heterogenicity examined in 19 strains with 2 or more sequenced clones ranged between 0-5 bp (Figure 2a) for ITS1 region and 0-1 bp for the LSU region (Figure 2b).

Figure 2.
  • Download figure
  • Open in new tab
Figure 2.

Box and whisker plots showing the variability in intra-strain length (A-B) and sequence divergence cutoff (C-D) for the ITS1 (A, C) and D1/D2 LSU (B, D) regions.

Within strain sequence divergence

Examining the 19 strains with more than two sequenced clones, the full ITS1 region showed intra-strain sequence divergence ranging from 0.1-10.01% (Figure 2c). Similar, and even higher levels of intra-strain ITS1 variability was previously reported e.g. up to 12.9% in Buwchfawromyces eastonii strain GE0915. On the other hand, within strain D1/D2 LSU rRNA region showed a much lower sequence divergence, ranging from 0.13-1.84% (Figure 2d).

Neocallimastigomycota diversity assessment using D1/D2 LSU as a phylogenetic marker

Phylogenetic diversity and Novelty

A total of 17,697 high-quality long-read amplicons were obtained. Phylogenetic analysis using the D1/D2 LSU amplicons assigned all sequences into 28 different genera/candidate genera (Figure 3a, Figure 4a, Figure S1) and 298 species level OTUs0.02. AGF genera identified in this study included members of the previously described genera Anaeromyces, Buwchfawromyces, Caecomyces, Cyllamyces, Liebetanzomyces, Neocallimastix, Orpinomyces, Pecoramyces, and Piromyces. In addition, sequences representing multiple novel genera were also identified (Figure 3a, Figure 4a, Figure S1), some of which have been subsequently isolated, named, and characterized in separate publications, e.g. Feramyces28, Aklioshbomyces, Agriosomyces, Ghazallomyces, and Khyollomyces50. Finally, six novel candidate genera were identified and designated RH1-RH6 (Figure 3a, Figure 4a, Figure S1). All of these six novel genera were encountered in extremely low abundance in a few samples (Figure 3a), with the notable exception of RH5, which was present in high relative abundance in multiple animals e.g. domesticated sheep (96.22%), blackbuck deer (52.41%), axis deer (20.71%), and an aoudad sheep sample (11.75%).

Figure 3.
  • Download figure
  • Open in new tab
Figure 3.

AGF genera distribution across the animal studied. (A) Percentage abundance of AGF genera in the animals studied. The tree is intended to show the relationship between the animals and is not drawn to scale. Host phylogeny (family), lifestyle, and gut type are shown for each animal. The X-axis shows the percentage abundance of AGF genera. (B) Rank abundance curves are displayed for each animal showing a distribution pattern in which a few genera (1-5) represent the majority (>10%) of the sequences obtained.

Figure 4.
  • Download figure
  • Open in new tab
Figure 4.

(A) Phylogenetic tree constructed using the D1/D2 LSU sequences of representatives of each of the 28 genera/candidate genera identified in this study. Sequences were aligned using the MAFFT aligner and maximum likelihood tree was constructed in FastTree61, 62. Bootstrap values are based on 100 replicates and are shown for branches with >50% bootstrap support. Genera with cultured representatives are shown in black, uncultured candidate genera identified in previous ITS1-based studies are shown in green, while the 6 novel genera identified in the current study are shown in red. The distribution of each of these genera/candidate genera in the animals studied is shown as a heatmap on the right. (B) AGF genera distribution patterns. The total number of animals harboring each of the genera identified in this study is shown on the Y-axis, with the different colored stacked bars reflecting the number of animals where the genus was the most abundant member, occurred with high (>5%) abundance, occurred with medium (1-5%) abundance, or occurred with low (<1%) abundance. AGF genera are classified into one of the five distribution patterns shown on top of the graph using empirical cutoffs for ubiquity (presence in at least 50% of the animals studied, shown as the dotted line across the bottom bar chart), as well as the fraction of animals where the genus abundance was above 1% (shown as the top bar chart).

Diversity estimates, and distribution patterns

The number of AGF genera encountered per sample varied widely from 5 (in Pere David’s deer, and Longhorn cattle) to 16 (in one Aoudad sheep sample) (Table 2, Figure 3a, Figure 4a). However, in each of these samples a distribution pattern was observed in which a few genera represent the absolute majority of the sequences obtained. Excluding genera present in less than 1% abundance would lower the number of genera encountered per animal to 1 (in white-tail deer and dwarf goat) −10 (domesticated goat). Usually, 1-5 taxa were present in >10% abundance per animal (Figure 3b).

View this table:
  • View inline
  • View popup
Table 2.

Animals sampled in this study, numbers of sequences obtained (N), number of observed OTUs, and various diversity indices both at the species equivalent (0.02) and the genus levels

Using empirical cutoffs for ubiquity (presence in at least 50% of the animals studied) and abundance (above 1%), we identify five different distribution patterns for AGF genera encountered in this study (Figure 4b); 1. Ubiquitous mostly abundant genera: These are the genera identified in at least 50% of the animals studied and where their relative abundances exceed 1% in at least 50% of their hosts: This group includes Piromyces, Feramyces, Khyollomyces, RH5, Neocallimastix, Cyllamyces, and Caecomyces. 2. Ubiquitous mostly rare genera: These are the genera identified in at least 50% of the animals studied and where their relative abundances were lower than 1% in at least 50% of their hosts. This group includes Orpinomyces, and Pecoramyces. 3. Less ubiquitous but mostly abundant genera: These are the genera identified in < 50% of the animals studied but where their relative abundances exceed 1% in at least 50% of their hosts. This group includes Ghazallomyces, RH4, MN4, Joblinomyces, SK4, Buwchfawromyces, AL3, RH1, and RH3. 4. Less ubiquitous mostly rare genera: These are the genera identified in < 50% of the animals and where their relative abundances were lower than 1% in at least 50% of their hosts. This group includes Liebetanzomyces, Anaeromyces, AL8, Aklioshbomyces, RH2, and Agriosomyces. 5. Less ubiquitous consistently rare genera: These are the genera identified in < 50% of the animals and where their relative abundances never exceeded 1% in any of their hosts. This group includes RH6, AL4, MN3, and SK3.

Multiple diversity estimates (number of observed genera, Chao and Ace richness estimates, Shannon diversity index, Simpson evenness, as well as diversity rankings) were computed for each sample (Table 2). The highest genus-level richness was observed in aoudad sheep, dwarf goat, oryx, domesticated cow, domesticated goat, miniature donkey, zebra, and blackbuck deer samples, while the highest genus-level diversity (based on diversity ranking and Shannon index) was observed in domesticated goat, alpaca, axis deer, blackbuck deer, mouflon ram, miniature donkey, oryx, and domesticated horse. On the other hand, the lowest genus-level richness was observed in longhorn cattle, Pere David’s deer, Boer goat, domesticated horse, domesticated sheep, and alpaca, while the lowest genus-level diversity was observed in Fallow deer, zebra, domesticated sheep, dwarf goat, and white-tail deer.

When correlated to animal host phylogeny or animal lifestyle (24 possible combinations), all diversity estimates showed low correlation coefficients (Cramer’s V statistic <0.49) at both the genus and the species equivalent levels (Table S1). Student t-tests were used to examine the significance of the difference in diversity estimates at the genus and species equivalent levels between animal host families (families Bovidae, Cervidae, and Equidae) as well as animal lifestyle (zoo-housed, wild, and domesticated). Only three of these showed a significant difference (Student t-test p-value <0.05): Family Bovidae had a significantly higher observed number of genera and significantly higher Chao estimate at the genus level, and zoo-housed animals had significantly lower Shannon diversity at the species equivalent level (Table S1).

Community structure

We used a combination of ordination methods and Student t tests to identify associations between AGF genera and host factors. Non-metric multidimensional scaling based on the genus-level Bray-Curtis indices (Figure 5a-b) identified a few patterns. The genera Aklioshbomyces, Ghazallomyces, Joblinomyces, Feramyces, Buwchfawromyces, and Pecoramyces seem to be more prevalent in some wild animals (e.g. black buck deer, mouflon, oryx, axis deer, and white tailed deer; filled squares in Figure 5a), while some zoo-housed animals (e.g. elk, dwarf goat, and miniature donkey; grey squares in Figure 5a) clustered together based on the abundance of Neocallimastix, Caecomyces, and Liebetanzomyces. Few domesticated animals (e.g. domesticated goat, longhorn, alpaca, and domesticated cow; open squares in Figure 5a) clustered together based on the abundance of Cyllamyces, AL8, MN3, MN4, RH1, RH3, RH4, and RH6. Animal host family had a slightly less apparent effect on AGF community structure (Figure 5b) with the exception of the importance of Aklioshbomyces and Ghazallomyces in family Cervidae, and AL3 and Khyollomyces in family Equidae.

Figure 5.
  • Download figure
  • Open in new tab
Figure 5.

Nonmetric multidimensional scaling based on pairwise Bray-Curtis dissimilarity indices at the genus level. Samples are shown as symbols and displayed in black text while AGF genera are shown as ‘+’ and displayed in red text. (A) Symbols reflect lifestyle with domesticated animals shown as white squares, zoo-housed animals shown as grey squares, and wild animals shown as black squares. (B) Symbols reflect animal host phylogeny with family Bovidae shown as squares, family Cervidae shown as circles, family Equidae shown as hexagons, and family Camelidae shown as a star. Abbreviations: Am Bison, American bison; Ax deer, Axis deer; B goat, Boer goat; Bb deer, Blackbuck deer; Dw Goat, Dwarf goat; Fa deer, Fallow deer; Min Don, Miniature donkey; PD deer, Pere David’s deer; WT deer, White-tail deer.

To test the significance of these observed patterns, Student t-tests were used to identify significant associations between specific AGF taxa and host phylogeny (families Bovidae, Equidae, Cervidae), or animal lifestyle (zoo-housed, domesticated, wild). From all possible associations (168 total; 28 genera x 3 host families and 3 lifestyles), significant differences were observed only in the following cases. The AGF genera AL3, Khyollomyces, and Piromyces were significantly more abundant in family Equidae (p-value=0.014, 0.018, and 0.034 respectively), while the genera Aklioshbomyces, Ghazallomyces, and Joblinomyces were significantly more abundant in family Cervidae (p-value=0.074, 0.072, and 0.075 respectively). On the other hand, the animal lifestyle had slightly more significant effect on AGF community structure as follows: The genus Neocallimastix was significantly more abundant in zoo-housed animals (p-value=0.007), the genera Aklioshbomyces, Buwchfawromyces, and Pecoramyces were significantly more abundant in wild animals (p-value=0.047, 0.028, and 0.014 respectively), and the genera Cyllamyces, AL8, RH1, RH4, and RH6 were significantly more abundant in domesticated animals (p-value=0.001, 0.001, 0.011, 0.018, and 0.054 respectively). Finally, for individual animals species with enough replication in our study, the genera Cyllamyces, AL8, and RH1 were significantly more abundant in Bos taurus (p-values=1.86E-11, 3E-5, and 2.27E-9, respectively), the genera Caecomyces and RH5 were significantly more abundant in Ovis aries (p-values=0.006, and 0.004 respectively), and the genera Feramyces and SK4 were significantly more abundant in Ammotragus lervia (p-values=0.002, and 0.0006, respectively). Further, some genera were only encountered in one animal, demonstrating a probable strong AGF genus-host preference. These genera include Ghazallomyces only encountered in axis deer, AL4 only encountered in domesticated sheep, MN3 only encountered in domesticated cow, and MN4 only encountered in domesticated goat.

Neocallimastigomycota isolation

A total of 216 AGF isolates were obtained from 21 animals (Table 3). Success in isolation and maintenance of that large number of isolates was enabled by the implementation of various techniques for isolation, and the development of a reliable storage procedure51. Isolates obtained belonged to 12 different genera (Table 3), six of which were exclusively isolated in this study, and characterized in separate publications (Akhlioshbomyces, Ghazallomyces, Capellomyces, Agriosomyces, Khoyollomyces (AL1), and Feramyces (AL6)28, 50. In general, 1-3 AGF genera were isolated per sample. Isolation efforts captured anywhere between 6.3% (1 of 16 genera) to 27.3% (3 of 11 genera) of AGF genera identified in a single sample using culture-independent D1/D2 LSU gene-based analysis. However, these values are highly affected by the fact that sequencing efforts are capable of identification of AGF genera present in extremely low levels of relative abundance. Indeed, excluding rare taxa (those present at <1% abundance), the culturability goes up to 10% (1 of 10 genera)-100% (2 of 2 genera).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3.

Number and sources of isolates obtained in thus study.

We sought to determine how community structure and isolation efforts correlate, and whether obtaining isolates belonging to a specific genus could be predicted from the community structure of the sample. We observed a strong Pearson correlation (r=0.79) between the abundance of a certain genus in a sample and the frequency of its isolation. On the other hand, the success of isolation of the most dominant member of the community was negatively affected by the sample evenness (Pearson correlation coefficient = −0.87). Indeed, our ability to isolate the novel genera Aklioshbmyces, Ghazallomycota, and Khyollomyces could be attributed to their presence in high relative abundance in samples from which they were recovered (Table 3), as opposed to their rarity/absence in other samples (Figure 3, 4). Within ubiquitous genera, we observed that the abundance-success of isolation correlation described above is stronger for monocentric taxa (Pearson correlation coefficients= 0.83, 0.96, 0.92, and 1 for Pecoramyces, Feramyces, Neocallimastix, and Agriosomyces, respectively), while such relationship was much weaker in polycentric taxa (Pearson correlation coefficients= 0.31, and 0.58 for Orpinomyces, and Anaeromyces, respectively). However, the polycentric nature of these genera (ability to propagate even in the absence of zoospore production, and the larger colony size on roll tubes) enabled their isolation even when they constituted a minor fraction of the total community.

Discussion

LSU as a phylogenetic marker for AGF diversity surveys

We highlight and quantify the advantages associated with the utilization of D1/D2 LSU as a phylomarker for the AGF when compared to the currently utilized ITS1 region (Figures 1, 2). We also report on overcoming the three main hurdles (lack of reference sequences from uncultured genera, correlating D1/D2 LSU data to currently available ITS1 datasets, and amplicon length precluding utilization of Illumina platform) associated with D1/D2 LSU use as a phylomarker. To address the lack of reference LSU sequence data, we undertook a multi-year isolation effort to provide a comprehensive D1/D2-LSU database from a wide range of AGF taxa, a necessary approach given the lack of LSU sequence data from multiple historic taxa, unavailability of AGF in culture collections, and difficulties in maintenance of this fastidious group of organisms. To correlate D1/D2 LSU data to currently available ITS1 datasets and overcome amplicon length constrains, we utilized a SMRT-PacBio sequencing approach to obtain sequences comprising the region spanning from the start of the ITS1 region to the end of the D1/D2-LSU region in the rRNA locus (1400-1500 bp). In the process, we not only increased the representation of D1/D2-LSU sequences from all cultured taxa, but also identified D1/D2-LSU sequences of yet-uncultured taxa previously defined only by their ITS1 sequences (e.g. AL3, AL4, AL8, MN3, MN4, SK3, and SK4), as well as defined 6 completely novel AGF candidate genera (RH1-RH6). Collectively, this dataset (17,697 sequences of environmental D1/D2 LSU annotated by their taxonomy (Dataset 3), plus 116 Sanger-generated clone sequences and genomic rRNA loci sequences (Table 1 with accession numbers) could pave the way for future D1/D2 LSU-based AGF diversity surveys. We anticipate that additional sampling and culture-independent studies using the whole region, as well as future isolation efforts will identify the corresponding D1/D2-LSU region for those few yet-uncultured ITS1-defined lineages that we failed to capture in our study.

Single molecule real-time (SMRT) PacBio sequencing technology enables long read sequencing by a single uninterrupted DNA polymerase molecule. The SMRT sequencing protocol involves ligating hairpin adaptors to the ends of double-stranded DNA (PCR products in the case of culture-independent studies), leading to the circularization of the DNA. This subsequently allows the sequencing polymerase to pass around the molecule multiple times. The re-sequencing by multiple passages increases sequence coverage thereby significantly reducing error rates from initial values of up to 15%, to levels lower than 1%. Culture-independent studies in bacteria, archaea, and fungi33, 52, 53, 54 have successfully applied the technology. We, here, provide the basis for its application to culture-independent studies in anaerobic gut fungi. We applied rigorous control to ensure the high quality of reads utilized to build the single molecule consensus read sequences (by using a minimum threshold of 5 full passes and 99.95% predicted accuracy), followed by pre-processing in Mothur to remove sequences with ambiguities or an average quality score below 25. Also, we anticipate that future AGF diversity studies employing PacBio sequencing of the D1/D2-LSU region (rather than the full ITS1-5.8S-ITS2-D1/D2 LSU region) would be further enabled by the shorter amplicon length (~700 as opposed to ~1300-1400 bp), as well as recent (e.g. Sequel II) and future anticipated improvements in SMRT sequencing technology.

Discovery and characterization of novel AGF lineages

D1/D2 LSU-based diversity assessment of 21 fecal samples identified multiple novel AGF candidate genera (Figure 3, 4), five of which were subsequently isolated and described in separate publications (Feramyces28, Aklioshbomyces, Agriosomyces, Ghazallomyces, and Khyollomyces50). These results clearly demonstrate that the scope of AGF diversity is much broader than implied from prior studies. This conclusion is in apparent disagreement with the recent work of Paul et al.17, where the authors utilized a rarefaction-based approach on publicly available ITS1 AGF sequence data to suggest that AGF sampling efforts have reached saturation. However, we argue that using a rarefaction curve approach on publicly available datasets only elucidates coverage within samples already in the database, and not the broader AGF diversity in nature. Many prior studies have used relatively low throughput sequencing technologies, and repeatedly sampled few domesticated animals, and such pattern would result in encountering highly similar populations between different studies. We attribute the discovery and characterization of a wide range of novel AGF taxa within our dataset to sampling previously unsampled animal hosts, and the use of high-throughput sequencing that enabled access to rare members of the AGF community. Multiple novel AGF genera were isolated from animals previously unsampled for AGF diversity, e.g. Aklioshbomyces from white-tailed deer where it represented 98.5% of the community, Ghazallomyces from axis deer where it represented 27.8% of the community, and Feramyces from an aoudad sheep sample where it represented 55.3% of the community. It is notable that many of these novel taxa were only encountered in wild herbivores. Whether this novelty is a reflection of a lifestyle selecting for specific taxa, or a reflection of simply lack of prior sampling of wild animals due to logistic difficulties remains to be seen. This clearly demonstrates that novel AGF taxa remain to be discovered by sampling hitherto unsampled/poorly sampled animal hosts.

Further, a significant fraction of novel AGF candidate genera identified were present in extremely low relative abundance. Such pattern suggests the presence of numerous novel AGF taxa that appear to predominantly exist in relatively low abundance possibly as dormant members of the AGF community in the herbivorous gut. The discovery and characterization of the rare members of AGF community could significantly expand the scope of AGF diversity in nature. The dynamics, rationale for occurrence, mechanisms of maintenance, putative role in ecosystems, and evolutionary history of rare members of the community are currently unclear. It has been suggested that a fraction of the rare biosphere could act as a seed bank of functional redundancy that aids in ecosystem response to periodic (e.g. occurring as part of growth of the animal host, or due to seasonal changes in feed types) or occasional (i.e. due to unexpected disturbances) changes in the gut in-situ conditions. Regardless, such pattern highlights the value of deeper sampling (to capture rare biosphere), as well as more extensive time-series, rather than snapshot, sampling to capture patterns of promotion and demotion of members of the AGF community within the lifespan of an animal.

The value of AGF isolation efforts

The strict anaerobic nature of AGF necessitates the implementation of special techniques for their isolation and maintenance55, 56. Further, while several storage methods based on cryopreservation have been proposed57, the decrease in temperature to the ultra-low values and the incidental O2 exposure during revival of the cryopreserved strains were shown before to be detrimental for some isolates. The lack of reliable long-term storage procedures often necessitates frequent subculturing of strains (every 3-4 days), which often leads to either the production of sporangia that do not differentiate to zoospores, or the outright failure to produce sporangia58.

Through a multi-year effort, we were successful in obtaining 216 isolates representing twelve AGF genera. We attribute our success to using multiple strategies (enrichment on multiple carbon sources, and paying special attention to picking colonies of different shapes and sizes, and to picking several colonies of the same shape, as representatives of different genera are known to produce colonies with very similar macroscopic features), but, more importantly, to using a wide range of samples (with varying host lifestyle, gut type, and animal phylogeny). The success of isolation of a certain genus was, in general, attributed to its abundance in the sample (Pearson correlation coefficient=0.79), especially for monocentric genera (e.g. Pecoramyces, Feramyces, Neocallimastix, and Agriosomyces), and was negatively correlated to the sample evenness (Pearson correlation coefficient= −0.87). It remains to be seen if this is true and reproducible for all samples and across laboratories. More efforts are certainly needed to develop targeted isolation strategies for specific taxa that we failed to obtain in pure cultures despite our best effort and despite their abundance in their respective sample (e.g. SK4 in one of the aoudad sheep samples, and RH5 in the domesticated sheep and the axis deer samples).

In conclusion, our results establish the utility of D1/D2 LSU and PacBio sequencing for AGF diversity surveys, and the culturability of a wide range of AGF taxa, and demonstrate that wild herbivores represent a yet-untapped reservoir of AGF diversity.

Experimental Procedures

Samples

Fecal Samples were obtained from six domesticated, six zoo-housed, and nine wild animals (Table 2). The host animals belonged to the families Bovidae (11), Cervidae (6), Equidae (3), and Camelidae (1). The dataset encompassed some replicates from few animal species sometimes with lifestyle variations within a single animal species: Bos taurus (n=2; domesticated cow, and domesticated longhorn cattle), Ovis aries (n=2; domesticated sheep and wild mouflon ram), Capra aegagrus (n=3; domesticated goat, wild Boer goat, and zoo-housed dwarf goat), and Ammotragus lervia (n=2; Aoudad Sheep) (Table 2). Samples from domesticated animals were obtained from Oklahoma State University and surrounding farms between September 2016 and May 2018. Samples from the Oklahoma City Zoo were obtained in April 2019. For samples from wild herbivores, we enlisted the help of hunters in four separate hunting expeditions in Sutton, Val Verde, and Coke counties, Texas (April 2017, July 2017, and April 2018), and Payne County, Oklahoma (October 2017). Appropriate hunting licenses were obtained and the animals were shot either on private land with the owner’s consent or on public land during the hunting season. All samples were stored on ice and promptly (within 20 minutes for domesticated samples, within 1 hour for zoo samples, and within 24 hours for samples obtained during hunting trips) transferred to the laboratory. Upon arrival, a portion of the sample was immediately used for setting enrichments for isolation efforts, while the rest was stored at −20°C for DNA extraction.

Development of D1/D2 LSU locus as a phylogenetic marker

A. Amplification of the ITS1-5.8S rRNA-ITS2-D1/D2 LSU from Neocallimastigomycota isolates

Biomass was harvested from 10 ml of 2-4-day old cultures and crushed in liquid nitrogen. DNA was extracted from the ground fungal biomass using DNeasy PowerPlant Pro Kit (Qiagen, Germantown, Maryland) according to the manufacturer’s instructions (Youssef et al. 2013). A PCR reaction targeting the region encompassing ITS1, 5.8S rRNA, ITS2, and D1/D2 region of the LSU rRNA’ (Figure 6) was conducted using the primers ITS5-NL423. The PCR protocol consisted of an initial denaturation for 5 min at 95 °C followed by 40 cycles of denaturation at 95 °C for 1 min, annealing at 55 °C for 1 min and elongation at 72 °C for 2 min, and a final extension of 72 °C for 20 min. PCR amplicons were purified using PureLink® PCR cleanup kit (Life Technologies, Carlsbad, California), followed by cloning into a TOPO-TA cloning vector according to the manufacturer’s instructions (Life Technologies, Carlsbad, California). Clones (n=1-12 per isolate) were Sanger sequenced at the Oklahoma State University DNA sequencing core facility.

Figure 6.
  • Download figure
  • Open in new tab
Figure 6.

Workflow diagram describing the methods employed in this study.

B. Amplification of the ITS1-5.8S-ITS2-D1/D2 LSU from environmental samples

Fecal material from different animals (0.25-0.5 g) were crushed in liquid nitrogen and total DNA was extracted from the ground sample using DNeasy PowerPlant Pro Kit (Qiagen, Germantown, Maryland) according to the manufacturer’s instructions (Youssef et al. 2013). Extracted DNA was then used as a template for ITS1-5.8S-ITS2-D1/D2 LSU PCR amplification using ITS5 forward primer and the AGF-specific reverse primer GG-NL424. Primers were barcoded to allow PacBio sequencing and multiplexing (Table S2). Amplicons were purified using PureLink® PCR cleanup kit (Life Technologies, Carlsbad, California), quantified using Qubit® (Life Technologies, Carlsbad, California), pooled, and sequenced at Washington State University core facility using one cell of the single molecular real time (SMRT) Pacific Biosciences (PacBio) RSII system.

C. Environmental PacBio-generated sequences quality control

We performed a two-tier quality control protocol on the generated sequences. First, the raw reads were processed according to PacBio published protocols to obtain single molecule consensus reads. Second, we used rigorous sequence quality control in Mothur59 to remove any sequences with low quality from subsequent analysis.

For Raw reads processing, the official PacBio pipeline (RS_Subreads.1) (http://files.pacb.com/software/smrtanalysis/2.2.0/doc/smrtportal/help/!SSL!/Webhelp/CS_Prot_RS_Subreads.htm) was utilized. Raw reads were filtered based on the minimum read length and minimum read quality. The passing reads were then processed with the PacBio RS_ReadsOfInsert protocol (http://files.pacb.com/software/smrtanalysis/2.2.0/doc/smrtportal/help/!SSL!/Webhelp/CS_Prot_RS_ReadsOfInsert.htm) for generating single-molecule consensus reads from the insert template. Consensus reads had a minimum of 5 full passes, 99.95% predicted accuracy, and 1000 bp insert length. The resulting consensus reads had a mean number of passes of 20, mean read length of insert of 1429 bp, and mean polymerase read quality of 0.99.

Sequence quality control procedures were subsequently conducted in Mothur59 to assess the quality of the generated consensus reads utilizing stringent protocols previously suggested for assessing bacterial, archaeal, and fungal diversity for similar sized amplicons33, 52, 53, 54. Reads were filtered in Mothur using trim.seqs to remove reads longer than 2000 bp, reads with average quality score below 25, reads with ambiguous bases, reads not containing the correct barcode sequence, reads with more than 2 bp difference in the primer sequence, and reads with homopolymer stretches longer than 12 bp. Reads with the primer sequence in the middle were identified by performing a standalone Blastn-short using the primer sequence as the query, and were subsequently removed using the remove.seqs command in Mothur.

A mock community (constituted of equal concentration of PCR products of 5 different strains (Aklioshbomyces papillarum strain WT2, Feramyces austinii isolate DS10, Liebetanzomyces sp. isolate Cel1A, Piromyces sp. isolate A1, and Piromyces sp. isolate Jen1) from our culture collection and for which we have obtained at least 5 Sanger clone sequences) was also sequenced. To establish whether the above approaches for overall read- and sequence-based quality control are adequate, we compared PacBio-generated mock sequences to the corresponding Sanger-generated clone sequences. The median percentage similarity between PacBio-generated sequences affiliated with a certain strain and the Sanger-generated clone sequences obtained for that strain (99.05±0.6 to 99.64±0.47) were not significantly different from the median percentage similarities between different clones of the same strain (98.91±0.6 to 99.72±0.47) (Student t-test p-value>0.1) attesting to the adequacy of the above quality control measures in removing low quality sequences.

D. A D1/D2 LSU reference database for cultured and yet-uncultured AGF taxa

A reference D1/D2-LSU sequence database for all Neocallimastigomycota cultured genera present in our culture collection was created via amplification, cloning, and sequencing of the ITS1-5.8S-ITS2-D1/D2 LSU allowing for a direct correlation and cross-referencing of both regions. To obtain D1/D2 LSU sequences representing yet-uncultured candidate genera previously defined by ITS1 sequence data3, 5, 13, 17, 46, the ITS1 region from the PacBio-generated ITS1-5.8S-ITS2-D1/D2 LSU environmental amplicons was extracted in Mothur using the pcr.seqs command with the sequence of the MNGM2 reverse primer and the flag rdiffs=2 to allow for 2 differences in primer sequence. The trimmed sequences corresponding to the ITS1 region were compared, using blastn, to a manually curated Neocallimastigomycota ITS1 database encompassing all known cultured genera, as well as yet-uncultured taxa previously identified in culture-independent studies3, 5, 13, 17, 46 (Figure 6). Sequences were classified as their first hit taxonomy if the percentage similarity to the first hit was >96% and the two sequences were aligned over >70% of the query sequence length. A taxonomy file was then created that contained the name of each sequence in the PacBio-generated environmental dataset and its corresponding taxonomy and was used for assigning taxonomy to the D1/D2 LSU sequence data.

E. Comparison of D1/D2-LSU versus ITS1 as phylogenetic markers

We used the dataset of full length PacBio-generated sequences described above, in addition to 116 Sanger-generated clone sequences from this study and previous studies23, 28, 29, 49, as well as genomic rRNA loci from two Neocallimastigomycota genomes (Pecoramyces ruminantium strain C1A, and Neocallimastix californiae strain G1)36, 43 in which the entire rrn operon sequence data is available to compare the ITS1 and D1/D2-LSU regions with respect to heterogeneity in length and intra-genus sequence divergence. For every sequence, the ITS1, and the D1/D2-LSU regions were bioinformatically extracted in Mothur using the pcr.seqs command (with the reverse primer MNGM2, and the forward primer NL1, for the ITS1, and the D1/D2-LSU regions, respectively) and allowing for two differences in the primer sequence. The trimmed sequences (both ITS1 and D1/D2-LSU) were then sorted into files based on their taxonomy such that for each genus/taxon two fasta files were created, an ITS1 and a D1/D2-LSU. These fasta files were then used to compare length heterogeneity, and intra-genus sequence divergence as follows. Sequences lengths in each fasta file were obtained using the summary.seqs command in Mothur. Intra-genus sequence divergence values were obtained by first creating a multiple sequence alignment using the MAFFT aligner60, followed by generating a sequence divergence distance matrix using the dist.seqs command in Mothur. Box plots for the distribution of length and sequence divergence were created in R.

AGF Diversity assessment using D1/D2 LSU locus

A. Phylogenetic placement

The majority of the D1/D2-LSU sequences bioinformatically extracted from environmental sequences were assigned to an AGF genus as described above. D1/D2-LSU sequences that could not be confidently assigned to an AGF genus were sequentially inserted into a reference LSU tree to assess novelty. Further, the associated ITS1 sequences (obtained from the same amplicon) were similarly inserted into a reference ITS1 tree for confirmation. Sequences were assigned to a novel candidate genus when both loci (LSU and ITS1) cluster as novel, independent genus-level clades with high (>70%) bootstrap support in both trees.

B. Genus and species level delineation

Genus level assignments were conducted via a combination of similarity search and phylogenetic placement as described above. We chose not to depend on sequence divergence cutoffs for OTU delineation at the genus level since some genera exhibit high sequence similarity between their D1/D2-LSU sequences (e.g. Liebetanzomyces, Capellomyces, and Anaeromyces D1/D2-LSU sequence divergence ranges between 1.8-2.5%), while other genera are highly divergent (e.g. Piromyces intra-genus sequence divergence cutoff of the D1/D2-LSU region ranges between 0-5.7%), and as such “a one size fits all” approach should not be applied. On the other hand, a similar approach for OTUs delineation at the species equivalent level is problematic due to uncertainties in circumscribing species boundaries, and inadequate numbers of species representatives in many genera. Therefore, for OTU delineation at the species equivalent level, we reverted to using a sequence divergence cutoff. Historically, cutoffs of 3%16 to 5%3 were used for ITS1-based species equivalent delineation. However, D1/D2-LSU sequence data are more conserved when compared to LSU data in the Neocallimastigomycota15, 26, 27, 28, 29, 30, 50, as well as other fungi19. To obtain an appropriate species equivalent cutoff, we used the 116 Sanger-generated clone sequences from this study and previous studies23, 28, 29, 49, as well as genomic rRNA loci from two Neocallimastigomycota genomes where the entire ribosomal operon sequence is available (Pecoramyces ruminantium strain C1A, and Neocallimastix californiae strain G1)36, 43. The ITS1 and D1/D2-LSU regions were bioinformatically extracted and sorted to separate fasta files. Sequences in each file were then aligned using MAFFT60 and the alignment was used to create a distance matrix for every possible pairwise comparison using dist.seqs command in Mothur. The obtained pairwise distances for the ITS1, and the D1/D2-LSU regions were then correlated to obtain values of D1/D2-LSU sequence divergence cutoffs corresponding to the 3-5% range in ITS1. This was equivalent to 2.0-2.2%, and hence, for this study, a cutoff of 2% was used for OTU generation at the species equivalent level using the D1/D2-LSU region.

C. Diversity and community structure assessments

Genus and species equivalent OTUs generated as described above were used to calculate alpha diversity indices (Chao and Ace richness estimates, Shannon diversity index, Simpson evenness index) for the different samples studied using the summary.seqs command in Mothur. A shared OTUs file created in Mothur using the make.shared command was used to calculate Bray-Curtis beta diversity indices between different samples (using the summary.shared command in Mothur). The shared OTUs file was also used as a starting point for ranking the samples based on their diversity using both an information-related diversity ordering method (Renyi generalized entropy), and an expected number of species-related diversity ordering method (Hulbert family of diversity indices) (Table 2). For community structure visualization, Bray Curtis indices at the genus level were also used to perform non-metric multidimensional scaling using the metaMDS function in the Vegan package in R. Also, the percentage abundance of different genera across the samples studied were used in principal components analysis using the prcomp function in the labdsv package in R. Ordination plots were generated from the two analyses (NMDS and PCA) using the ordiplot function.

D. Statistical analysis

Correlations of the diversity estimates to animal host factors including the animal lifestyle (domesticated, zoo-housed, wild), and the animal host families (Bovidae, Cervidea, Equidae, Camelidae) were calculated using χ2-contingency tables followed by measuring the degree of association using Cramer’s V statistics as detailed before3. In addition, to identify factors impacting AGF diversity, Student t-tests were used to identify significant differences in the above alpha diversity estimates based on animal lifestyle (zoo-housed, domesticated, wild), and host phylogeny (families Bovidae, Equidae, Cervidae). To test the effect of the above host factors on the AGF community structure, Student t-tests were used to identify significant associations between specific AGF taxa and animal lifestyle (zoo-housed, domesticated, wild) or host family (families Bovidae, Equidae, Cervidae).

Isolation efforts

Fecal samples (either freshly obtained, or stored at −20°C in sterile, air-tight plastic tubes) were used for isolation. Care was taken to avoid sample repeated freezing and thawing. Samples were first enriched by incubation for 24 h at 39°C in rumen-fluid-cellobiose (RFC) medium supplemented with antibiotics (50 μg/mL kanamycin, 50 μg/mL penicillin, 20 μg/mL streptomycin, and 50 μg/ mL chloramphenicol)27, 28, 29, 50, 51. Subsequently, enrichments were serially diluted by adding approximately 1 ml of enriched samples to 9 mL of RF medium supplemented with 1% cellulose or a mixture of 0.5% switchgrass and 0.5% cellobiose. Since fungal hyphae and zoospores are usually attached to the coarse particulates in the enrichment, serial dilutions were conducted using Pasteur pipettes rather than syringes and needles, as the narrow bore of the needle prevented the fecal clumps from being transferred. Serial dilutions up to a 10−5 dilution were incubated at 39°C for 24–48 h. Dilutions showing visible signs of growth (clumping or floating plant materials, a change in the color of cellulose, and production of gas bubbles) were then used for the preparation of roll tubes55, 56 on RFC agar medium. At the same time, and as a backup strategy in case the roll tubes failed to produce visible colonies, the dilution tubes themselves were subcultured and transferred to fresh medium with the same carbon source. Single colonies on roll tubes were then picked into liquid RFC medium, and at least three rounds of tube rolling and colony picking were conducted to ensure purity of the obtained colonies. To maximize the chances of obtaining isolates belonging to different genera, special attention was given, not only to picking colonies of different shapes and sizes, but also to picking several colonies of the same shape, as representatives of different genera could produce colonies with very similar macroscopic features. Isolates were maintained by biweekly subculturing into RFC medium. For long-term storage, cultures were stored on agar medium according to the procedure described by Calkins et al.51.

Data accession

Sanger-generated clone sequences from pure cultures were deposited in GenBank under accession numbers MT085665 - MT085741. SMRT-generated sequences were deposited at DDBJ/EMBL/GenBank under the Bioproject accession number PRJNA609702, Biosample accession numbers SAMN14258225, and Targeted Locus Study project accession KDVX00000000. The version described in this paper is the first version, KDVX01000000.

Competing interests

The authors declare no competing interest

Acknowledgements

This work has been funded by the NSF-DEB grant 1557102 to N.H.Y. and M.S.E.

References

  1. 1.↵
    Gruninger RJ, et al. Anaerobic Fungi (Phylum Neocallimastigomycota): Advances in understanding of their taxonomy, life cycle, ecology, role, and biotechnological potential. FEMS Microbiol Ecol 90, 1–17 (2014).
    OpenUrlCrossRefPubMed
  2. 2.↵
    Wang Y, Youssef NH, Couger MB, Hanafy RA, Elshahed MS, Stajich JE. Molecular Dating of the emergence of anaerobic rumen fungi and the impact of laterally acquired genes. mSystems 4, e00247–00219 (2019).
    OpenUrl
  3. 3.↵
    Liggenstoffer AS, Youssef NH, Couger MB, Elshahed MS. Phylogenetic diversity and community structure of anaerobic fungi (Phylum Neacallymastigales) in ruminant and non-ruminant herbivores. ISME J 4, 1225–1235 (2010).
    OpenUrlCrossRefPubMed
  4. 4.↵
    Edwards JE, et al. Dynamics of initial colonization of nonconserved perennial ryegrass by anaerobic fungi in the bovine rumen. FEMS Microbiol Ecol 66, 537–545 (2008).
    OpenUrlCrossRefPubMed
  5. 5.↵
    Kittelmann S, G. E. Naylor, J. P. Koolaard, Janssen PH. A proposed taxonomy of anaerobic fungi (class Neocallimastigomycetes) suitable for large-scale sequence-based community structure analysis. PloS one 7, e36866 (2012).
    OpenUrlCrossRefPubMed
  6. 6.↵
    Kittelmann S, et al. Simultaneous amplicon sequencing to explore co-occurrence patterns of bacterial, archaeal and eukaryotic microorganisms in rumen microbial communities. PloS one 8, e47879 (2013).
    OpenUrlCrossRefPubMed
  7. 7.↵
    Edwards JE, et al. PCR and omics based techniques to study the diversity, ecology and biology of anaerobic fungi: insights, challenges and opportunities. Front Microbiol 8, 1657 (2017).
    OpenUrlCrossRefPubMed
  8. 8.↵
    Schoch CL, et al. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc Natl Acad Sci USA 109, 6241–6246 (2012).
    OpenUrlAbstract/FREE Full Text
  9. 9.↵
    Nilsson RH, et al. The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications. Nucl Acids Res 47, D259–d264 (2019).
    OpenUrlCrossRefPubMed
  10. 10.↵
    Dore J, Stahl DA. Phylogeny of anaerobic rumen Chytridiomycetes inferred from small subunit ribosomal RNA sequence comparisons. Can J Bot 69, 1964–1971 (1991).
    OpenUrlCrossRef
  11. 11.↵
    Kiss L. Limits of nuclear ribosomal DNA internal transcribed spacer (ITS) sequences as species barcodes for Fungi. Proc Natl Acad Sci USA 109, E1811 (2012).
    OpenUrlFREE Full Text
  12. 12.↵
    James TY, et al. Reconstructing the early evolution of Fungi using a six-gene phylogeny. Nature 443, 818–822 (2006).
    OpenUrlCrossRefPubMedWeb of Science
  13. 13.↵
    Koetschan C, et al. Internal transcribed spacer 1 secondary structure analysis reveals a common core throughout the anaerobic fungi (Neocallimastigomycota). PloS one 9, e91928 (2014).
    OpenUrlCrossRefPubMed
  14. 14.↵
    Edwards JE, Hermes GDA, Kittelmann S, Nijsse B, Smidt H. Assessment of the accuracy of high-throughput sequencing of the its1 region of Neocallimastigomycota for community composition analysis. Front Micorobiol 10, 2370 (2019).
    OpenUrl
  15. 15.↵
    Callaghan TM, et al. Buwchfawromyces eastonii gen. nov., sp. nov.: a new anaerobic fungus (Neocallimastigomycota) isolated from buffalo faeces. MycoKeys 9, 11–28 (2015).
    OpenUrl
  16. 16.↵
    Mura E, et al. Anaerobic fungal communities differ along the horse digestive tract. Fung Biol 123, 240–246 (2019).
    OpenUrl
  17. 17.↵
    Paul SS, Bu D, Xu J, Hyde KD, Yu Z. A phylogenetic census of global diversity of gut anaerobic fungi and a new taxonomic framework. Fung Div 89, 253–266 (2018).
    OpenUrl
  18. 18.↵
    Kumar S, Indugu N, Vecchiarelli B, Pitta DW. Associative patterns among anaerobic fungi, methanogenic archaea, and bacterial communities in response to changes in diet and age in the rumen of dairy cows. Front Micorobiol 6, 781 (2015).
    OpenUrl
  19. 19.↵
    Guadet J, Julien J, Lafay JF, Brygoo Y. Phylogeny of some Fusarium species, as determined by large-subunit rRNA sequence comparison. Mol Biol Evol 6, 227–242 (1989).
    OpenUrlPubMedWeb of Science
  20. 20.↵
    Dagar SS, Kumar S, Mudgil P, Singh R, Puniya AK. D1/D2 domain of large-subunit ribosomal DNA for differentiation of Orpinomyces spp. Appl Environ Microbiol 77, (2011).
  21. 21.↵
    Hausner G, Inglis GD, Yanke LJ, Kawchuk LM, Mcallister TA. Analysis of restriction fragment length polymorphisms in the ribosomal DNA of a selection of anaerobic chytrids. Can J Bot 78, 917–927 (2000).
    OpenUrlCrossRef
  22. 22.↵
    Fliegerova K, Mrazek J, Voigt K. Differentiation of anaerobic polycentric fungi by rDNA PCR-RFLP. Folia Microbiol 51, 273–277 (2006).
    OpenUrl
  23. 23.↵
    Wang X, Liu X, Groenewald JZ. Phylogeny of anaerobic fungi (phylum Neocallimastigomycota), with contributions from yak in China. Antonie Van Leeuwenhoek 110, 87–103 (2017).
    OpenUrlCrossRef
  24. 24.↵
    Nagler M, Podmirseg SM, Griffith GW, Insam H, Ascher-Jenull J. The use of extracellular DNA as a proxy for specific microbial activity. Appl Microbiol Biotechnol 102, 2885–2898 (2018).
    OpenUrl
  25. 25.↵
    Liu K-L, Porras-Alfaro A, Kuske CR, Eichorst SA, Xie G. Accurate, Rapid taxonomic classification of fungal large-subunit rRNA genes. Appl Environ Microbiol 78, 1523–1533 (2012).
    OpenUrlAbstract/FREE Full Text
  26. 26.↵
    Dagar SS, et al. A new anaerobic fungus (Oontomyces anksri gen. nov., sp. nov.) from the digestive tract of the Indian camel (Camelus dromedarius). Fung Biol 119, 731–737 (2015).
    OpenUrl
  27. 27.↵
    Hanafy RA, Elshahed MS, Liggenstoffer AS, Griffith GW, Youssef NH. Pecoramyces ruminantium, gen. nov., sp. nov., an anaerobic gut fungus from the feces of cattle and sheep. Mycologia 109, 231–243 (2017).
    OpenUrlCrossRef
  28. 28.↵
    Hanafy RA, Elshahed MS, Youssef NH. Feramyces austinii, gen. nov., sp. nov., an anaerobic gut fungus from rumen and fecal samples of wild Barbary sheep and fallow deer. Mycologia 110, 513–525 (2018).
    OpenUrlCrossRef
  29. 29.↵
    Hanafy RA, Johnson B, Elshahed MS, Youssef NH. Anaeromyces contortus, sp. nov., a new anaerobic gut fungal species (Neocallimastigomycota) isolated from the feces of cow and goat. Mycologia 110, 502–512 (2018).
    OpenUrlCrossRef
  30. 30.↵
    Joshi A, Lanjekar VB, Dhakephalkar PK, Callaghan TM, Griffith GW, Dagar SS. Liebetanzomyces polymorphus gen. et sp. nov., a new anaerobic fungus (Neocallimastigomycota) isolated from the rumen of a goat. MycoKeys 40, (2018).
  31. 31.↵
    Dollhofer V, Callaghan TM, Dorn-In S, Bauer J, Lebuhn M. Development of three specific PCR-based tools to determine quantity, cellulolytic transcriptional activity and phylogeny of anaerobic fungi. J Microbiol Meth 127, 28–40 (2016).
    OpenUrl
  32. 32.↵
    Heeger F, et al. Long-read DNA metabarcoding of ribosomal RNA in the analysis of fungi from aquatic environments. Molec Ecol Resources 18, 1500–1514 (2018).
    OpenUrl
  33. 33.↵
    Tedersoo L, Tooming-Klunderud A, Anslan S. PacBio metabarcoding of Fungi and other eukaryotes: errors, biases and perspectives. New Phytol 217, 1370–1385 (2018).
    OpenUrl
  34. 34.↵
    Griffith GW, Ozkose E, Theodorou MK, Davies DR. Diversity of anaerobic fungal populations in cattle revealed by selective enrichment culture using different carbon sources. Fung Ecol 2, 87–97 (2009).
    OpenUrl
  35. 35.↵
    Solomon KV, et al. Early-branching gut fungi possess a large, comprehensive array of biomass-degrading enzymes. Science 351, 1192–1195 (2016).
    OpenUrlAbstract/FREE Full Text
  36. 36.↵
    Haitjema CH, et al. A parts list for fungal cellulosomes revealed by comparative genomics. Nat Microbiol 2, 17087 (2017).
    OpenUrl
  37. 37.↵
    Henske JK, Gilmore SP, Haitjema CH, Solomon KV, O’Malley MA. Biomass-degrading enzymes are catabolite repressed in anaerobic gut fungi. AIChE Journal, (2018).
  38. 38.↵
    Henske JK, et al. Metabolic characterization of anaerobic fungi provides a path forward for bioprocessing of crude lignocellulose. Biotechnol Bioeng 115, 874–884 (2018).
    OpenUrl
  39. 39.↵
    Seppälä S, Solomon KS, Gilmore SP, Henske JK, Rieth MD, O’Malley MA. Mapping the membrane proteome of anaerobic gut fungi using RNA-Seq. Biophys J 110, 58a–59a (2016).
    OpenUrl
  40. 40.↵
    Murphy CL, et al. Horizontal Gene Transfer forged the evolution of anaerobic gut fungi into a phylogenetically distinct gut-dwelling fungal lineage. Appl Environ Microbiol 85, :e00988–00919 (2019).
    OpenUrl
  41. 41.↵
    Calkins SS, et al. Development of an RNA interference (RNAi) gene knockdown protocol in the anaerobic gut fungus Pecoramyces ruminantium strain C1A. PeerJ 6, e4276 (2018).
    OpenUrl
  42. 42.↵
    Couger MB, Youssef NH, Struchtemeyer CG, Liggenstoffer AS, Elshahed MS. Transcriptomic analysis of lignocellulosic biomass degradation by the anaerobic fungal isolate Orpinomyces sp. strain C1A. Biotechnol Bioeng 8, 208 (2015).
    OpenUrl
  43. 43.↵
    Youssef NH, et al. Genome of the anaerobic fungus Orpinomyces sp. C1A reveals the unique evolutionary history of a remarkable plant biomass degrader. Appl Environ Microbiol 79, 4620–4634 (2013).
    OpenUrlAbstract/FREE Full Text
  44. 44.↵
    Ho YW, Barr DJS, Abdullah N, Jalaludin S, Kudo H. Piromyces spiralis, a new species of anaerobic fungus from the rumen of goat Mycotaxon 48, :59–68 (1993).
    OpenUrl
  45. 45.↵
    Fliegerová K, Mrázek J, Hoffmann K, Zábranská J, Voigt K. Diversity of anaerobic fungi within cow manure determined by ITS1 analysis. Folia Microbiol 55, 319–325 (2010).
    OpenUrl
  46. 46.↵
    Herrera J, Poudel R, Khidir HH. Molecular characterization of coprophilous fungal communities reveals sequences related to root-associated fungal endophytes. Microb Ecol 61, 239–244 (2011).
    OpenUrlCrossRefPubMed
  47. 47.↵
    Nicholson MJ, McSweeney CS, Mackie RI, Brookman JL, Theodorou MK. Diversity of anaerobic gut fungal populations analysed using ribosomal ITS1 sequences in faeces of wild and domesticated herbivores. Anaerobe 16, 66–73 (2010).
    OpenUrlCrossRefPubMed
  48. 48.↵
    Tuckwell DS, Nicholson MJ, McSweeney CS, Theodorou MK, Brookman JL. . The rapid assignment of ruminal fungi to presumptive genera using ITS1 and ITS2 RNA secondary structures to produce group-specific fingerprints. Microbiology 151, 1557–1567 (2005).
    OpenUrlCrossRefPubMedWeb of Science
  49. 49.↵
    Nicholson MJ, Theodorou MK, Brookman JL. Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome. Microbiology 151, 121–133 (2005).
    OpenUrlCrossRefPubMed
  50. 50.↵
    Hanafy RA, et al. Seven new Neocallimastigomycota genera from wild, zoo-housed, and domesticated herbivores greatly expand the taxonomic diversity of the phylum. Mycologia doi: 10.1080/00275514.2019.1696619. (2020).
    OpenUrlCrossRef
  51. 51.↵
    Calkins S, Elledge NC, Hanafy RA, Elshahed MS, Youssef N. A fast and reliable procedure for spore collection from anaerobic fungi: Application for RNA uptake and long-term storage of isolates. J Microbiol Methods 127, 206–213 (2016).
    OpenUrlCrossRef
  52. 52.↵
    Bahram M, Anslan S, Hildebrand F, Bork P, Tedersoo L. Newly designed 16S rRNA metabarcoding primers amplify diverse and novel archaeal taxa from the environment. Environ Microbiol Rep 11, 487–494 (2019).
    OpenUrl
  53. 53.↵
    Schloss PD, Jenior ML, Koumpouras CC, Westcott SL, Highlander SK. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system. PeerJ 4, e1869 (2016).
    OpenUrlCrossRef
  54. 54.↵
    Schlaeppi K, et al. High-resolution community profiling of arbuscular mycorrhizal fungi. New Phytol 212, 780–791 (2016).
    OpenUrl
  55. 55.↵
    Bryant M. Commentary on the Hungate technique for culture of anaerobic bacteria. Am J Clin Nutr 25, 1324–1328 (1972).
    OpenUrlFREE Full Text
  56. 56.↵
    Hungate RE. A roll tube method for cultivation of strict anaerobes. Meth Microbiol 3, 117–132 (1969).
    OpenUrl
  57. 57.↵
    Solomon KV, Henske JK, Theodorou MK, O’Malley MA. Robust and effective methodologies for cryopreservation and DNA extraction from anaerobic gut fungi. Anaerobe 38, 39–46 (2016).
    OpenUrl
  58. 58.↵
    Ho YW, Barr DJS. Classification of anaerobic gut fungi from herbivores with emphasis on rumen fungi from Malaysia. Mycologia 87, 655–677 (1995).
    OpenUrlCrossRefWeb of Science
  59. 59.↵
    Schloss PD, et al. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol 75, 7537 (2009).
    OpenUrlAbstract/FREE Full Text
  60. 60.↵
    Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30, 772–780 (2013).
    OpenUrlCrossRefPubMedWeb of Science
  61. 61.↵
    Price MN, Dehal PS, Arkin AP. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol 26, 1641–1650 (2009).
    OpenUrlCrossRefPubMedWeb of Science
  62. 62.↵
    Price MN, Dehal PS, Arkin AP. FastTree 2--approximately maximum-likelihood trees for large alignments. PloS one 5, e9490 (2010).
    OpenUrlCrossRefPubMed
Back to top
PreviousNext
Posted March 25, 2020.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Assessing anaerobic gut fungal (Neocalliamstigomycota) diversity using PacBio D1/D2 LSU rRNA amplicon sequencing and multi-year isolation
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Assessing anaerobic gut fungal (Neocalliamstigomycota) diversity using PacBio D1/D2 LSU rRNA amplicon sequencing and multi-year isolation
Radwa A. Hanafy, Britny Johnson, Noha H. Youssef, Mostafa S. Elshahed
bioRxiv 2020.03.24.005967; doi: https://doi.org/10.1101/2020.03.24.005967
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Assessing anaerobic gut fungal (Neocalliamstigomycota) diversity using PacBio D1/D2 LSU rRNA amplicon sequencing and multi-year isolation
Radwa A. Hanafy, Britny Johnson, Noha H. Youssef, Mostafa S. Elshahed
bioRxiv 2020.03.24.005967; doi: https://doi.org/10.1101/2020.03.24.005967

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Microbiology
Subject Areas
All Articles
  • Animal Behavior and Cognition (4667)
  • Biochemistry (10332)
  • Bioengineering (7653)
  • Bioinformatics (26277)
  • Biophysics (13497)
  • Cancer Biology (10663)
  • Cell Biology (15388)
  • Clinical Trials (138)
  • Developmental Biology (8480)
  • Ecology (12800)
  • Epidemiology (2067)
  • Evolutionary Biology (16817)
  • Genetics (11378)
  • Genomics (15451)
  • Immunology (10591)
  • Microbiology (25139)
  • Molecular Biology (10186)
  • Neuroscience (54316)
  • Paleontology (399)
  • Pathology (1663)
  • Pharmacology and Toxicology (2889)
  • Physiology (4331)
  • Plant Biology (9223)
  • Scientific Communication and Education (1585)
  • Synthetic Biology (2551)
  • Systems Biology (6769)
  • Zoology (1459)