CaptureSeq: Capture-based enrichment of cpn60 gene fragments empowers pan-Domain profiling of microbial communities without universal PCR

Matthew G. Links; Tim J. Dumonceaux; Luke McCarthy; Sean M. Hemmingsen; Edward Topp; Alexia Comte; Jennifer R. Town

doi:10.1101/492116

ABSTRACT

Molecular profiling of complex microbial communities has become the basis for examining the relationship between the microbiome composition, structure and metabolic functions of those communities. Microbial community structure can be partially assessed with universal PCR targeting taxonomic or functional gene markers. Increasingly, shotgun metagenomic DNA sequencing is providing more quantitative insight into microbiomes. Unfortunately both amplicon-based and shotgun sequencing approaches have significant shortcomings that limit the ability to study microbiome dynamics. We present a novel, amplicon-free, hybridization-based method (CaptureSeq) for profiling complex microbial communities using probes based on the chaperonin-60 gene. This new method generates a quantitative, pan-Domain community profile with significantly less expenditure and sequencing effort than a shotgun metagenomic sequencing approach. Molecular microbial profiles were compared for antibiotic-amended soil samples using CaptureSeq, shotgun metagenomics, and amplicon-based techniques. The CaptureSeq method generated a microbial profile that provided a much greater depth and sensitivity than shotgun metagenomic sequencing while simultaneously mitigating the bias effects associated with amplicon-based methods. The resulting community profile provided quantitatively reliable information about all three Domains of life (Bacteria, Archaea, and Eukarya). The applications of CaptureSeq are globally impactful and will facilitate highly accurate studies of host-microbiome interactions for environmental, crop, animal and human health.

DNA sequencing data associated with this work has been deposited at NCBI under BioProject PRJNA406970 and SRA deposits SRX3181274-SRX3181276 and SRX3187583-SRX3187601.

INTRODUCTION

Life on Earth is classified into hierarchical taxonomic lineages that describe all living systems as having descended from a common ancestor along three evolutionary lines. Using ribosomal RNA-encoding gene sequences, Woese and Fox ¹ delineated these Domains, which are now known as Bacteria, Archaea, and Eukarya ². Most complex microbial communities exist as assemblages replete with representatives from each of these Domains, the total genomic complement of which is called a microbiome. Understanding microbial community dynamics requires tools to examine the composition of these complex ecosystems. Advancements in DNA sequencing technology have created new opportunities to simplify the profiling of microbial communities from a diverse range of environments. As new insights are gained into the diversity of microbiomes in soil, water, plant and animal-associated ecosystems, we are collectively realizing the powerful effects that microbiome composition and structure can have on how these communities function ³. To characterize the multifaceted relationships between microorganisms and their environment, it is critical to obtain a comprehensive microbial community profile that most accurately reflects its original composition and quantitative structure.

Microbiologists have increasingly embraced culture-independent methods of identification in recent decades ⁴. By far the most commonly employed culture independent method is PCR-based amplification of informative gene sequences. In adapting the use of PCR for amplifying a conserved region of 16S rRNA, Weller and Ward provided the first example of microbial profiling ⁵. More recently, Paul Hebert’s proposed DNA barcoding criteria for Eukarya have established standards for what comprises a robust target for phylogenetic profiling ⁶. Alternative universal gene markers for 16S ⁷, cpn60 ⁸, rpoB ⁹, mcrA ¹⁰ and ITS ¹¹ have been used for profiling microorganisms from bacterial, archaeal and eukaryotic Domains, however no single amplification is able to profile microbes from all three Domains simultaneously. In order to obtain phylogenetic information for microorganisms across all three Domains of life, separate target amplification and processing protocols are required¹², increasing the cost and analytical complexity of accurately assessing dynamic changes in the community across Domains. Moreover, stochastic effects of primer interaction with a complex template, along with the difficulty in designing primers and amplification conditions that will equally target all members of a community¹³, result in an unavoidable bias in community representation both in terms of presence/absence and relative abundances^13–16.

In recent years metagenomic approaches in which whole nucleic acid recovered from a sample is fragmented and sequenced using shotgun methods have become increasingly popular. This approach has a significant advantage over barcode-specific methods in that shotgun-sequencing data can overcome issues of bias and representation that are inherent in amplicon sequencing approaches, and provides the additional advantage of describing the metabolic potential of the microbial community ^17–19. Sequencing of all DNA present in an environmental sample can therefore be considered somewhat of a “gold standard” for taxonomic profiling. However, this approach is not without its own limitations. For example, it can be a wasteful enterprise in terms of the phylogenetic information recovered per sequencing cost. Shotgun sequencing is also not easily able to connect the functional potential observed in the sequencing data with the exact microbe within which that functionality resides. Additionally, DNA acquired from a community of microorganisms is inherently unbalanced; there are not equal numbers of each taxon, nor do all taxa have genomes that are of equal sizes. Thus shotgun sequencing can provide a view of microbial community composition that is biased by genome size and microbial abundances. Overcoming this bias requires significant amounts of sequencing; therefore, chasing the rarity of the least abundant microbes by shotgun metagenomics sequencing carries a high financial cost ^14,15,20,21. The abundances of microbes within characterized complex microbial communities range over many orders of magnitude. While shotgun sequencing efforts provide a reasonable estimate of abundance there is a significant loss in dynamic range when compared to PCR-based profiling.

The chaperonin 60 gene ⁸ (type I chaperonin) and its Archaeal homologue thermosome complex ²² (type II chaperonin) have been previously recognized as highly discriminating targets across all three Domains of life ²³, meet standard International Barcode of Life criteria ²⁴ and enable de novo assembly of operational taxonomic units (OTU) ²⁵. While “universal” PCR primers are available ^8,26, they are not expected to capture the pan-Domain diversity of a complex microbial community through amplification. Moreover, cpn60 amplification provides OUT abundances that do not always correlate to the true abundance of the microorganism in the sample ²⁷. If these limitations can be overcome, there is significant opportunity to dramatically improve research assessing host-microbiome interactions in plant, human and animal settings.

Recent advances in hybridization-based DNA capture combined with high throughput sequencing (CaptureSeq), which have proven to be remarkably powerful means of enriching samples for DNA sequences of interest ^28–30, led us to consider the possibility of exploiting the unique features of cpn60 to provide a pan-Domain microbial community profile without the use of universal PCR amplifications. A custom array of biotinylated RNA capture baits was designed based on the entire taxonomic composition of the chaperonin database cpnDB (www.cpndb.ca) ⁸ and evaluated as a tool for enriching total genomic DNA simultaneously for type I and type II chaperonin target sequences. Samples were selected that encompassed taxonomic diversity across all three Domains of life. Soil samples comprised primarily of Bacteria, manure samples with increased Archaeal diversity and a terrestrial pond sample with a larger number of Eukarya were used to compare the CaptureSeq method to standard shotgun metagenomic and amplicon-based approaches. The results indicate that CaptureSeq provides the taxonomic reach associated with shotgun metagenomic sequencing combined with the sampling depth of amplicon-based sequencing, giving an essentially complete, balanced, quantitatively accurate view of complex microbial ecosystems with reduced sequencing effort.

RESULTS

CaptureSeq generates Pan-Domain microbial community profiles

Microbial profiles were generated by CaptureSeq using samples from very different environmental ecosystems including soil, manure and a non-aerated terrestrial pond using CaptureSeq. These profiles provided a taxonomic overview of Bacteria, Archaea and Eukarya simultaneously, and identified sequencing reads from 9,361 (soil), 9,306 (manure), and 6,568 (pond) distinct taxonomic clusters (Supplemental Dataset S1). Additionally, the CaptureSeq profile facilitated inter-Domain comparisons of read abundances between taxonomic groups, since the abundances could be expressed in relation to the total pan-Domain community as opposed to reflecting only the proportions within a single Domain (Figure 1).

Figure 1:

CaptureSeq was used to simultaneously profile Bacteria, Archaea, and Eukarya from an ecologically diverse range of samples including soil (n=6), manure (n=3), and a freshwater pond (n=1). The relative abundances of individual Phyla were expressed as a proportion of the entire pan-Domain microbial community.

The soil sample microbiomes were composed primarily of Bacteria, with Proteobacteria and Actinobacteria comprising 60% and 25% of the pan-Domain community respectively. Members of the phyla Acidobacteria and Gemmatinomonadetes represented an additional 5% each of the microbiome. Total archaeal reads only accounted for 0.03-0.08% of the soil pan-Domain community, however there were still 165 archaeal taxonomic clusters identified in the soil. Eukarya represented 0.18-0.21% of the soil microbiome, with Fungi and Metazoa the most abundant taxonomic groups. While the manure samples also contained a diverse array of Bacteria, they only represented 77-80% of the microbiome, compared to >99% for all of the soil samples. CaptureSeq libraries from the manure samples contained 19-22% archaeal reads, of which the vast majority were methanogens from the Phylum Euryarchaeota. The terrestrial pond contained a much greater proportion and diversity of Eukaryotes, representing 6.7% of the sequencing reads and 361 taxonomic clusters (Supplemental Dataset S1). De novo assembly of eukaryotic sequencing reads from the terrestrial pond sample generated 11 OTU most closely related to members of the Phylum Chlorophyta (green algae). Additionally, the assembly of OTU most similar to Aenopholes sp. (mosquitoes), and three members of the Phylum Alveolata (protists), suggests that CaptureSeq was able to retrieve cpn60 DNA from higher level Eukarya. Compared to reference sequences in cpnDB, these de novo assembled OTU had nucleotide identities ranging from 59-84%, suggesting that the current probe array design and hybridization conditions were sufficiently permissive to allow capture of novel cpn60 sequences (true unknowns).

CaptureSeq provides a similar microbial community profile to shotgun metagenomic sequencing

The complex taxonomic diversity found in soil provided an opportunity to determine if CaptureSeq yields a microbial community profile that accurately reflects the composition of the community and facilitates insights into the response of the communities to perturbation. Therefore, replicate plots amended with antibiotics were compared to control (unamended) soil samples using CaptureSeq, shotgun metagenomics, or cpn60-based amplicon sequencing techniques. In this setting, the ability of CaptureSeq to achieve in-depth sampling that is a more accurate reflection of the community composition is critical to elucidate the effects of antimicrobial exposure on microbial ecosystem dynamics.

Both CaptureSeq and metagenomic techniques generated type I chaperonin sequences from all three Domains unencumbered by amplification and primer design biases. However, the number of chaperonin containing sequences represented only 0.08% of the total reads from the shotgun metagenomic library compared to an average of 16.7% (± 0.8%) for CaptureSeq and 94.8% (± 0.6%) for amplicon libraries (Supplemental Table S1). For a complex community such as soil, a greater sampling depth is required in order to make meaningful conclusions regarding microbial community composition and structure. Using a metagenomic approach requires orders of magnitude more sequencing effort to achieve a high level of community coverage and is not financially feasible for a large number of samples (Figure 2).

Figure 2:

Good’s coverage estimate reflecting the average total sequencing effort for six soil samples each profiled using amplicon (red), CaptureSeq (blue), or shotgun metagenomic (green) approaches.

Examination of OTU abundance patterns revealed that the CaptureSeq and shotgun metagenomic profiled samples displayed patterns of microbial abundances that were more similar to one another and distinct from the pattern shown by the amplicon datasets (Figure 3). Moreover, of the three methods analyzed, only CaptureSeq showed a hierarchical clustering pattern that showed a difference between the antibiotic-treated and untreated soil samples (Figure 3). Similarly, when intra-technique beta diversity was assessed, only the CaptureSeq data provided measures that showed a separation of the soil samples by antibiotic treatment (Supplemental Figure S1). These results highlight the importance of profiling method on the ability to gain meaningful insights into microbiome structure and function.

Figure 3:

Proportional abundance of taxonomic clusters for type I chaperonins in soil samples profiled using amplicon, CaptureSeq, or shotgun metagenomic approaches. Samples were clustered based on Bray-Curtis distance, and reference clusters composing a minimum of 0.5% of the mapped sequencing reads in any one sample are shown.

Comparing alpha diversity metrics of the soil communities between the three profiling techniques suggested that both richness (Chao1) and diversity (Shannon H’) were higher when profiled using shotgun metagenomic compared to amplicon sequencing (Supplemental Figure S2). The CaptureSeq method provided alpha diversity metrics that were between those of the shotgun metagenomic shotgun method and amplicon sequencing (Supplemental Figure S2). Additionally, the alpha diversity metrics of the CaptureSeq method showed the least variability among the biological replicates of each treatment, even when libraries were down-sampled to very low levels (Supplemental Figure S2). Samples examined by cpn60 amplification and sequencing displayed the highest inter-sample variability compared to CaptureSeq and metagenomic sequencing.

CaptureSeq permits de novo assembly of OTU from taxonomic clusters

To determine if de novo assembly of OTU representing individual organisms was reliable using CaptureSeq, we selected one target microorganism from each Domain for quantification using OTU-specific qPCR. For Bacteria, we quantified Microbacterium sp. C448, which was cultured from these soil samples and has previously been shown to degrade and metabolize the sulfonamide antibiotic added to the field plots³¹. While the presence of this target in the soil samples was confirmed using culture methods, it was under-represented in the amplicon and shotgun metagenomic libraries when compared to the CaptureSeq profiles. Only the CaptureSeq library provided a sufficient number of target sequencing reads for de novo assembly, generating a 1,066 bp OTU that was >99% identical to the cpn60 sequence obtained from the genome of this organism ³². We also assembled OTU targets from the Domains Eukarya (type I-Phythophthora infestans) and Archaea (type II-Methanoculleus sp.). Reads that mapped to the reference chaperonin sequences for these organisms were assembled de novo into OTU and were then quantified in each soil sample using ddPCR. Quantification of Microbacterium sp. C448 showed that the bacterium was present at a low level in all soil samples of between 10³ and 10⁴ gene copies per gram of soil, and that the levels were significantly higher in the antibiotic-treated soil samples (Table 1). The archaeal OTU was quantified at levels between 495 and 527 gene copies per gram of soil. The OTU corresponding to P. infestans was present at levels below the limit of detection of ddPCR for these samples, yet was detectable by CaptureSeq (Table 1). These results confirm the potential of the CaptureSeq method to almost completely sample complex microbial communities with a limit of detection beyond the dynamic range of even very sensitive quantification methods like ddPCR.

View this table:

Table 1:

Abundances of selected OTU from each Domain, as determined by quantitative PCR.

CaptureSeq provides a quantitatively accurate view of bacterial abundance

Using a synthetic community of 20 microorganisms spiked into carrier DNA from a seed wash facilitated a quantitative examination microbial community profiles using CaptureSeq. Quantification of cpn60 DNA from the synthetic community before and after hybridization using qPCR revealed an enrichment of 3-4 orders of magnitude for cpn60-containing DNA fragments compared to 16S rRNA-encoding genes (Supplemental Figure S3). For the 5 microorganisms that were quantified, the ∼10-fold reduction in gene copy number observed between the high, medium, and low spike levels was consistent with the starting composition of the synthetic community samples (Supplemental Figure S3). Furthermore, the number of cpn60 gene copies for the microorganisms added to the seed wash DNA extract was highly reproducible within each spike level across the 1000-fold difference analyzed (Supplemental Figure S3). Across the different spiking levels, there was a linear correlation between qPCR-determined input gene copies and the number of sequencing reads observed for each of the five targets using the CaptureSeq method, providing Pearson correlation coefficients (r²) ranging from 0.995-1.000. This compared to a range of 0.532-0.878 for libraries profiled by amplicon sequencing, with more apparent distortion at the higher spike levels when targets were the most abundant (Figure 4).

Figure 4:

The correlation between input cpn60 gene copies quantified by species-specific quantitative PCR and the number of mapped sequencing reads was determined for 5 bacteria from the synthetic community.

While all 20 bacteria from the synthetic community were identified using both amplicon and CaptureSeq profiling techniques, only the CaptureSeq method generated profiles that accurately reflected the relative amounts of DNA spiked into the seed wash background (Figure 5 and Supplemental Table S2). In the CaptureSeq libraries, the number of mapped sequencing reads for each member of synthetic community was within one order of magnitude from the mean for each spike level. In the amplicon libraries however, the cpn60 sequences of Bifidobacterium infantis and Bifidobacterium bifidum, which feature a high G/C content, were over 10- and 100-fold lower than the mean for both the High and Medium spiked samples (Supplemental Figure S4). This improved representation of high G/C Actinobacteria by CaptureSeq was also apparent in the microbial community profiles generated for the soil samples. Compared to the CaptureSeq libraries, the cpn60 sequences of the 25 most under-represented taxonomic clusters in the amplicon libraries had very high G/C content (64-71%) and included several members of the genera Nocardioides, Marmoricola and Pseudonocardia (Supplemental Table S3).

Figure 5:

Sequencing read abundance for seed wash samples spiked with a synthetic community of 20 bacteria in 10-fold decreasing dilutions and were profiled using UT amplification or CaptureSeq profiling methods. The color scale represents the log₁₀ read abundance in the sequencing library.

De novo assembly of the mapped sequencing reads for each microorganism from the synthetic panel for both amplicon and CaptureSeq libraries generated OTU that were >99% identical to the known cpn60 sequences.

DISCUSSION

Targeted capture of cpn60 gene fragments resulted in an approximately 200-fold enrichment of the soil samples for the taxonomic marker of interest, from under 0.1% of reads in the shotgun metagenomic sequencing to over 15% of reads in the CaptureSeq datasets. This level of enrichment enabled very deep sampling of the soil microbial communities (similar to that attained using PCR-based enrichment) with far less sequencing data (i.e. a significant cost savings). This is of particular importance when the organisms of interest are very low in abundance, such as Microbacterium sp. C448 in this study. OTU were observed in the CaptureSeq datasets that were present at extremely low levels in the soil genomic DNA, near or below the detection limit for ddPCR. Based on the assay setup and dilution factors we used, the theoretical ddPCR detection limit was 3570 copies/g soil, assuming detection of 10 copies per assay³³. Although increased sequencing effort can result in more complete coverage of complex microbial communities using shotgun metagenomic sequencing^15,21, application of this method to investigate the taxonomic composition of a sample is not an efficient use of budgetary resources. In addition, CaptureSeq provided a balanced view of the relative abundances of microorganisms within the community. PCR-associated representational bias, which presents a skewed representation of microbial taxon abundance ³⁴, is a well-known phenomenon^35–37, and is likely the result of using end-point PCR product to generate the sequencing library as the exponential accumulation of amplicon serves to compress the dynamic range of relative DNA abundance in the end product of the reaction. CaptureSeq also resulted in an improvement of the representation of high G/C content microorganisms compared to amplification. Difficulty in amplification of high G/C content targets is a phenomenon that has been previously observed using both 16S and cpn60 taxonomic markers from mixed communities ^26,38. De novo assembly of taxonomic clusters from the CaptureSeq datasets into OTU for which probes were not explicitly designed, such as Microbacterium sp. C448, also suggests that off-target cpn60 sequence capture can expand the breadth of OTU observed in the dataset beyond the sequences represented in the probe array and can include sequences that have not been previously observed. While CaptureSeq may be biased by the probe sequences employed, it is clearly capable of detecting novel microbes, expanding the breadth of microorganisms that are included in the microbial community profile beyond microbes that have been previously identified.

The overall patterns of OTU abundances in each of the three methods showed that the amplicon-based method provided a pattern that was distinct from the patterns observed for both CaptureSeq and shotgun metagenomic sequencing, which were more similar to one another. While the three methods all provided discernably different overall community profiles, the difference observed in the relative abundances of microorganisms was likely the result of different biases inherent in each of the methods. The over-representation in the amplicon datasets of several of the microorganisms that were very rare in the metagenomic and CaptureSeq libraries was likely the result of amplification effects on the relative abundances of microorganisms ^16,39. PCR amplification also introduced a higher experimental error in various alpha diversity parameters (Chao1, Shannon, Simpson) among the biological replicates analyzed compared to CaptureSeq and shotgun metagenomic sequencing. This observation is consistent with previous studies using 16S rRNA amplicon profiles of soil communities ^16,40. Among the three methods, CaptureSeq displayed the lowest inter-sample variation for these diversity parameters. CaptureSeq therefore has the potential to improve insight into microbial community dynamics by reducing experimental variability, and thereby improving reproducibility, compared to both amplicon-based and shotgun metagenomic sequencing. The consistency in alpha diversity calculations is likely a reflection of the reduced biases inherent in the CaptureSeq protocol and facilitates making meaningful conclusions about community richness and diversity.

The cpn60 taxonomic marker enables de novo assembly of OTU ^23,25 providing greater discrimination between closely related microorganisms and facilitating OTU-specific assay design. The cpn60-based CaptureSeq approach generates assembled chaperonin sequences that may also include regions flanking the sequence amplified by the universal primers, as observed with the OTU over 1 kb in length generated for Microbacterium sp. C448 and Methanoculleus marisnigri in this study. This additional sequencing information can provide further taxonomic discrimination of many prokaryotes, especially if the assembled region includes the cpn10 co-chaperonin that is adjacent to cpn60 in many bacterial genomes ⁴¹. The OTU that were de novo assembled provided suitable targets for ddPCR, facilitating the enumeration of targeted microorganisms from each Domain, which had initially been identified by sequencing and assembly. Such an approach can be used to identify biological interactions between/among microorganisms that can explain their relative abundance patterns ²³.

Both CaptureSeq and shotgun metagenomic sequencing provided the means to identify OTU from all Domains simultaneously, facilitating the characterization of inter-Domain relationships among microorganisms. The ability to calculate the abundances of organisms as a proportion of the entire pan-Domain community facilitates the identification of inter-Domain relationships and syntrophies. This is of particular importance in many settings (e.g. manure or gut health) in identifying the syntrophic relationships between volatile fatty acid producing Bacteria and methanogenic Archaea ⁴². In soil, the complex relationship between saprophytic Fungi and Bacteria is critical to examining the role of the microbiome in nutrient cycling ⁴³. Similarly in the terrestrial pond, the bacterial and eukaryotic components of the microbial ecosystems can be directly compared numerically, which may allow insights into inter-Domain relationships that impact elemental cycles or other ecosystem services. This advantage is not offered using amplification of universal targets, although it does provide the benefit of very deep coverage of complex microbial communities. Shotgun metagenomic genome sequencing does not provide the community coverage of either the amplicon-based or CaptureSeq methods at a similar sequencing effort, suggesting that complex microbiomes will likely require additional phylogenetic data to make any informed examination of microbial diversity metrics. CaptureSeq enabled deep coverage of complex microbial communities, although the community representation is naturally biased by the hybridization probes used. However, we observed off-target hybridization, as evidenced by the appearance of cpn60 OTU in the CaptureSeq datasets. Optimizing the hybridization parameters may result in further improvements to the enrichment of taxonomic markers in complex templates, increasing the efficiency of this approach to microbial community profiling. Shotgun metagenomics can reasonably be considered the least biased means of determining the taxonomic composition of an environmental sample, and may be a suitable choice when sufficient sequencing resources are available. However the abiding popularity of amplicon-based profiling is at least partially a result of the high degree of enrichment of taxonomically informative sequence reads that it generates. CaptureSeq provides an alternative that avoids the amplification biases associated with PCR while retaining the sequencing efficiency of amplicon-based profiling.

Molecular microbial community profiling is one of the foundational steps in exploring microbiome structure-function relationships in an experimental system ^44–46. To generate and evaluate scientific hypotheses it is critical to generate a microbiome profile that reflects the natural state a closely as possible with sufficient sensitivity to evaluate both abundant and rare microorganisms. The cpn60-based method described herein permits taxonomically broad and deep microbial community profiling of complex microbiomes. Thus CaptureSeq has the potential to impact life sciences research wherever microbes are thought to be important, including human health and nutrition ⁴⁷, agriculture ⁴⁸, biotechnology ⁴⁹, and environmental sciences ⁵⁰. Several methodologies are available for microbial community profiling, including 16S and ITS amplification and sequencing, as well as profiling using 16S rRNA-based capture probes ³⁰. While all microbial community profiling techniques have inherent limitations and biases, compared to shotgun metagenomic and universal target amplification, CaptureSeq is a suitable alternative that provides quantitative, pan-Domain analysis of complex communities.

MATERIALS AND METHODS

Soil sample preparation

Soil samples were obtained from a long-term study initiated in 1999 evaluating the effect of annual antibiotic exposure on soil microbial communities, described in Cleary et al. ⁵¹. Soil samples evaluated in the present study were obtained in 2013 following 15 sequential annual applications of a mixture of sulfamethazine, chlortetracycline and tylosin, each added at 10 mg kg^-1 soil. Soil was sampled 30 days after the spring application of antibiotics. The plots were planted with soybeans (Glycine max, v. Harosoy) immediately after incorporation of the antibiotics. One triplicate group of plots had experienced no antibiotic treatment, and the other triplicate set had received yearly antibiotic treatments since 1999 as described ⁵¹. Genomic DNA was extracted from 3.5 g of each soil sample using the PowerMax Soil DNA isolation kit (Mo-Bio Laboratories, Carlsbad, CA) with a 5 mL elution volume. DNA extracts were quantified using a Qubit fluorimeter (Thermo Fisher Scientific, Waltham, MA, USA) and stored at −80°C until processing and analysis.

Terrestrial pond sample preparation

A water sample was obtained from a pond located on a Saskatchewan farm (51.99°N, - 106.46°W) on May 13, 2016. Biological material was recovered from 2L of water by centrifugation at 20,000 g for 20 minutes. Total DNA was extracted using a PowerWater DNA extraction kit (Mo-Bio Laboratories, Carlsbad, CA) and quantified as described above.

Seed wash carrier DNA preparation

Genomic DNA to act as carrier DNA for spiking 10-fold decreasing amounts of a synthetic community was generated by washing wheat seeds as previously described ²³, and known to lack all of the microorganisms comprising the synthetic community panel ²³.

Synthetic community sample preparation

Amplicons corresponding to the cpn60 UT of 20 bacteria associated with the human vaginal tract ²⁵ were cloned into the pGEM-T Easy plasmid (Promega, WI, USA) and purified using the Qiagen Miniprep kit (Qiagen, CA, USA). The synthetic community was formed by combining equimolar concentrations of plasmids containing the cpn60 UT for all 20 microorganisms ²⁵. Dilutions of this mixture (corresponding to 0.4, 0.04, and 0.004 ng plasmid DNA, or approximately 10⁸, 10⁷, and 10⁶ copies of each plasmid) were spiked into a background of 10 ng/μl of wheat seed carrier DNA. Spiked genomic DNA samples prepared in this way were sequenced using cpn60 universal target amplification and CaptureSeq as described below.

The efficacy of the CaptureSeq hybridization was assessed prior to sequencing using quantitative PCR (qPCR) targeting plasmids added to the seed wash background. qPCR primers and amplification conditions were as described previously ⁵². Total bacteria were enumerated using qPCR targeting the 16S ribsosomal RNA-encoding gene as described previously ⁵³.

Amplicon-based sequencing

The cpn60 UT was amplified from synthetic community-spiked DNA or soil genomic DNA samples using 40 cycles of PCR with the type I chaperonin universal primer cocktail containing a 1:3 ratio of H279/H280:H1612/H1613 ²⁶ and cycling conditions of 1x 95°C, 5 min; 40x 95°C 30sec, 42-60°C 30sec, 72°C 30sec; 1x 72°C 2min. Replicate reactions from each amplification temperature for each sample were pooled and gel purified using the Blue Pippin Prep system (Sage Science, MA, USA) with a 2% agarose cassette, and concentrated using Amicon 30K 0.5 ml spin columns (EMD Millipore, MA, USA). Amplicon from all samples was prepared for sequencing using the NEBNext Illumina library preparation kit (New England Biolabs, location), and sequenced with 400 forward cycles of v2 Miseq chemistry.

CaptureSeq array design

Capture probes were designed based on all type I and type II chaperone sequences in the public domain (i.e. CpnDB; www.cpndb.ca)⁸. 15,733 probes were designed to be complementary to the type I and type II chaperone sequences. Design of probes was based on identifying 120bp sequences from the reference database using a 60bp incrementing step. Thus the resulting probes should share a 50% overlap with the next probe in a tiling-like fashion. The custom oligos were bound to magnetic beads in equimolar concentration as a custom Mybaits array by Mycoarray (Ann Arbor, MI, USA).

Shotgun metagenomic sequencing and CaptureSeq preparation

Genomic DNA from each of the soil samples was diluted to 2.5 ng/μl and split into two aliquots of 100 μl each for shearing using a water bath sonicator as described ⁵⁴. Shotgun metagenomic genomic sequencing libraries were prepared directly from one aliquot of each sheared genomic DNA sample using the NEBNext Illumina library preparation kit according to the manufacturer’s directions (New England Biolabs, MA, USA). Samples were then sequenced with 2×250 bp cycles of v2 Miseq chemistry (Illumina, CA, USA).

To generate the CaptureSeq libraries, the second aliquots of sheared genomic DNA samples were subjected to end repair and index addition using NEBNext as above, then hybridized to the capture probe array as described ⁵⁴. The chaperonin-enriched products were then sequenced with 2×250 bp cycles of v2 Miseq chemistry (Illumina, CA, USA).

Sequencing analysis

To compare the number of output sequencing reads for the different spiking levels, sequencing reads from the synthetic community-spiked samples were down-sampled to the smallest library size for each profiling technique (30,091 for amplicon and 506,247 for CaptureSeq) and mapped to a reference set of cpn60 UT sequences for the 20 microorganisms in the panel by local paired alignment using bowtie2 (v. 2.2.3) ⁵⁵.

A reference database of all publically available chaperonin sequences was generated by selecting a list of seven chaperonin protein sequences representing each taxonomic group: fungi, bacteria, archaea, plant mitochondria, plant chloroplast, and animal mitochondria. These probes were used as queries for a BLAST search of GenBank using the default parameters to blastp. Matching protein sequences were manually vetted to generate a list of 30,141 protein identifiers. These protein identifiers were then used to retrieve the corresponding 30,120 nucleotide sequences available in GenBank according to the procedure described in Supplemental Information. The accession numbers of those nucleotide sequences are provided in Supplemental Dataset S2. The breadth of taxa that were retrieved by this method was similar to the taxonomic breadth represented in the 16S and ITS reference datasets (Supplemental Dataset S3). Sequencing reads from all soil samples were grouped into taxonomic clusters by paired local alignment to this reference set of chaperonin genes using bowtie2. The sequencing libraries were down-sampled to the size of the smallest shotgun metagenomic library (2,777 mapped paired reads), and the relative abundances of each of the resulting taxonomic clusters was used as the basis for assessing the alpha and beta diversity metrics of the three profiling methods for equivalent sampling effort.

De novo OTU assembly and quantification

Read pairs from target taxonomic clusters were assembled de novo into cpn60 OTU using Trinity (v. 2.4.0) with a kmer of 31. OTU-specific primer and hydrolysis probe sets were designed using Primer3 ⁵⁶ or Beacon Designer (v.7) (Premier Biosoft, Palo Alto, CA, USA) as described previously ⁵⁷. Annealing temperatures were optimized for each reaction using gradient PCR with ddPCR Supermix for Probes (Bio-Rad, Mississauga, ON, Canada) using 900 nM each primer and 250 nM of hydrolysis probe in a 20 μl reaction volume. Primer/probe sequences and optimized amplification conditions are shown in Supplemental Table S1. Template DNA was digested prior to amplification using EcoRI at 37°C for 60 minutes. A final volume of 2-5 μl was used as template for droplet digital PCR (ddPCR). Emulsions were formed using a QX100 droplet generator (Bio-Rad, Hercules, CA, USA), and amplifications were carried out using a C1000 Touch thermocyler (Bio-Rad). Reactions were analyzed using a QX100 droplet reader (Bio-Rad) and quantified using QuantaSoft (v.1.6.6) (Bio-Rad). Results were converted to copy number/g soil extracted by accounting for sample preparation and dilution. For the prepared CaptureSeq libraries, results were converted to copy number/μl by considering dilution factors.

Alpha diversity analysis

To compare the richness and diversity metrics between the three profiling techniques, mapped sequencing reads were down-sampled from 250-2,750 reads to simulate a uniform sampling effort across profiling techniques. Metrics were averaged across 100 bootstrapped datasets using the multiple_rarefactions.py and alpha_diversity.py scripts from QIIME (v. 1.8.0) ⁵⁸.

In the cases where the total effect of sequencing effort was required for comparisons across estimates of community coverage read thresholds were transformed to reflect total sequencing effort for each sample.

Beta diversity analysis

To compare the community similarity between different sequencing methods, mapped sequencing reads were down-sampled to the size of the smallest metagenomic library sample (2,777 mapped reads). For intra-technique comparisons, mapped sequencing reads were down-sampled to the smallest library size within each profiling method; 2,777 for metagenomic, 127,642 for CaptureSeq, and 27,388 reads for amplicon libraries. Principal Coordinate Analysis of inter- and intra-technique Bray-Curtis distance was calculated using the vegan package (v. 2.4.2) in R (v. 3.2.4).

Authors’ contributions

ET, SH, TD and AC performed collection, processing and sequencing of all samples. ML, LM and JT performed bioinformatics analysis of sequencing data. All authors contributed to writing the manuscript.

Competing interests

The author(s) declare no competing financial or non-financial interests.

Acknowledgements

This work was funded through Agriculture and Agri-Food Canada A-base project 1562: Optimizing soil health and protecting environmental quality through judicious manure management, and innovative cover cropping.

References

↵
Woese, C. R. & Fox, G. E. Phylogenetic structure of the prokaryotic domain: The primary kingdoms. Proc. Natl. Acad. Sci. U.S.A. 74, 5088–5090, doi:10.1073/pnas.74.11.5088 (1977).
OpenUrl Abstract/FREE Full Text
↵
Woese, C. R., Kandler, O. & Wheelis, M. L. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc. Natl. Acad. Sci. U.S.A. 87, 4576–4579, doi:10.1073/pnas.87.12.4576 (1990).
OpenUrl Abstract/FREE Full Text
↵
Tikhonovich, I. A. & Provorov, N. A. Microbiology is the basis of sustainable agriculture: An opinion. Ann. Appl. Biol. 159, 155–168, doi:10.1111/j.1744-7348.2011.00489.x (2011).
OpenUrl CrossRef
↵
J T Staley, a. & Konopka, A. Measurement of in situ activities of nonphotosynthetic microorganisms in aquatic and terrestrial habitats. Annu Rev Microbiol 39, 321–346, doi:10.1146/annurev.mi.39.100185.001541 (1985).
OpenUrl CrossRef PubMed Web of Science
↵
Weller, R. & Ward, D. M. Selective recovery of 16S rRNA sequences from natural microbial communities in the form of cDNA. Appl. Environ. Microbiol. 55, 1818–1822 (1989).
OpenUrl Abstract/FREE Full Text
↵
Hebert, P. D. N., Cywinska, A., Ball, S. L. & deWaard, J. R. Biological identifications through DNA barcodes. Proc R Soc Lond [Biol] 270, 313–321, doi:10.1098/rspb.2002.2218 (2003).
OpenUrl CrossRef PubMed Web of Science
↵
Singer, E. et al. High-resolution phylogenetic microbial community profiling. ISME J 10, 2020–2032, doi:10.1038/ismej.2015.249 (2016).
OpenUrl CrossRef
↵
Hill, J. E., Penny, S. L., Crowell, K. G., Goh, S. H. & Hemmingsen, S. M. cpnDB: A chaperonin sequence database. Genome Res. 14, 1669–1675 (2004).
OpenUrl Abstract/FREE Full Text
↵
Adékambi, T., Drancourt, M. & Raoult, D. The rpoB gene as a tool for clinical microbiologists. Trends Microbiol. 17, 37–45 (2009).
OpenUrl CrossRef PubMed Web of Science
↵
Barret, M. et al. Identification of Methanoculleus spp. as active methanogens during anoxic incubations of swine manure storage tank samples. Appl. Environ. Microbiol. 79, 424–433 (2013).
OpenUrl Abstract/FREE Full Text
↵
Schoch, C. L. et al. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc. Natl. Acad. Sci. U.S.A. 109, 6241–6246, doi:10.1073/pnas.1117018109 (2012).
OpenUrl Abstract/FREE Full Text
↵
Barret, M. et al. Emergence shapes the structure of the seed-microbiota. Appl. Environ. Microbiol. 81, 1257–1266 (2015).
OpenUrl Abstract/FREE Full Text
↵
Walker, A. W. et al. 16S rRNA gene-based profiling of the human infant gut microbiota is strongly influenced by sample processing and PCR primer choice. Microbiome 3, 26, doi:10.1186/s40168-015-0087-4 (2015).
OpenUrl CrossRef PubMed
↵
Guo, J., Cole, J. R., Zhang, Q., Brown, C. T. & Tiedje, J. M. Microbial community analysis with ribosomal gene fragments from shotgun metagenomes. Appl. Environ. Microbiol. 82, 157–166, doi:10.1128/aem.02772-15 (2016).
OpenUrl Abstract/FREE Full Text
↵
Lynch, M. D. J. & Neufeld, J. D. Ecology and exploration of the rare biosphere. Nat Rev Micro 13, 217–229, doi:10.1038/nrmicro3400 (2015).
OpenUrl CrossRef PubMed
↵
Poretsky, R., Rodriguez-R, L. M., Luo, C., Tsementzi, D. & Konstantinidis, K. T. Strengths and limitations of 16S rRNA gene amplicon sequencing in revealing temporal microbial community dynamics. PLOS ONE 9, e93827, doi:10.1371/journal.pone.0093827 (2014).
OpenUrl CrossRef PubMed
↵
Hess, M. et al. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science 331, 463–467 (2011).
OpenUrl Abstract/FREE Full Text
Raymond, F. et al. The initial state of the human gut microbiome determines its reshaping by antibiotics. ISME J 10, 707–720, doi:10.1038/ismej.2015.148 (2016).
OpenUrl CrossRef PubMed
↵
Handley, K. M. et al. Biostimulation induces syntrophic interactions that impact C, S and N cycling in a sediment microbial community. ISME J. 7, 800–816, doi:10.1038/ismej.2012.148 (2013).
OpenUrl CrossRef PubMed
↵
Fierer, N. et al. Cross-biome metagenomic analyses of soil microbial communities and their functional attributes. Proc. Natl. Acad. Sci. U.S.A. 109, 21390–21395, doi:10.1073/pnas.1215210110 (2012).
OpenUrl Abstract/FREE Full Text
↵
Luo, C. et al. Soil microbial community responses to a decade of warming as revealed by comparative metagenomics. Appl. Environ. Microbiol. 80, 1777–1786, doi:10.1128/aem.03712-13 (2014).
OpenUrl Abstract/FREE Full Text
↵
Chaban, B. & Hill, J. E. A ‘universal’ type II chaperonin PCR detection system for the investigation of Archaea in complex microbial communities. ISME J. 6, 430–439 (2012).
OpenUrl CrossRef PubMed
↵
Links, M. G. et al. Simultaneous profiling of seed-associated bacteria and fungi reveals antagonistic interactions between microorganisms within a shared epiphytic microbiome on Triticum and Brassica seeds. New Phytol. 202, 542–553, doi:10.1111/nph.12693 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
Links, M. G., Dumonceaux, T. J., Hemmingsen, S. M. & Hill, J. E. The chaperonin-60 universal target is a barcode for bacteria that enables de novo assembly of metagenomic sequence data. PLoS One 7, e49755, doi:10.1371/journal.pone.0049755 (2012).
OpenUrl CrossRef PubMed
↵
Links, M. G., Chaban, B., Hemmingsen, S., Muirhead, K. & Hill, J. mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences. Microbiome 1, 23 (2013).
OpenUrl CrossRef
↵
Hill, J. E., Town, J. R. & Hemmingsen, S. M. Improved template representation in cpn60 polymerase chain reaction (PCR) product libraries generated from complex templates by application of a specific mixture of PCR primers. Environ. Microbiol. 8, 741–746, doi:10.1111/j.1462-2920.2005.00944.x (2006).
OpenUrl CrossRef PubMed
↵
Dumonceaux, T. J., Hill, J. E., Hemmingsen, S. M. & Van Kessel, A. G. Characterization of intestinal microbiota and response to dietary virginiamycin supplementation in the broiler chicken. Appl. Environ. Microbiol. 72, 2815–2823 (2006).
OpenUrl Abstract/FREE Full Text
↵
Schuenemann, V. J. et al. Targeted enrichment of ancient pathogens yielding the pPCP1 plasmid of Yersinia pestis from victims of the Black Death. Proc. Natl. Acad. Sci. U.S.A. 108, E746–E752, doi:10.1073/pnas.1105107108 (2011).
OpenUrl Abstract/FREE Full Text
Wagner, D. M. et al. Yersinia pestis and the Plague of Justinian 541-543 AD: a genomic analysis. Lancet Infect Dis (2014).
↵
Gasc, C. & Peyret, P. Hybridization capture reveals microbial diversity missed using current profiling methods. Microbiome 6, 61, doi:10.1186/s40168-018-0442-3 (2018).
OpenUrl CrossRef
↵
Topp, E. et al. Accelerated biodegradation of veterinary antibiotics in agricultural soil following long-term exposure, and isolation of a sulfamethazine-degrading Microbacterium sp. J. Environ. Qual. 42, 173–178, doi:10.2134/jeq2012.0162 (2013).
OpenUrl CrossRef PubMed
↵
Martin-Laurent, F., Marti, R., Waglechner, N., Wright, G. D. & Topp, E. Draft Genome Sequence of the Sulfonamide Antibiotic-Degrading Microbacterium sp. Strain C448. Genome Announcements 2, e01113–01113, doi:10.1128/genomeA.01113-13 (2014).
OpenUrl CrossRef
↵
Bustin, S. A. et al. The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem 55, 611–622 (2009).
OpenUrl Abstract/FREE Full Text
↵
Props, R. et al. Absolute quantification of microbial taxon abundances. ISME J, doi:10.1038/ismej.2016.117 (2016).
OpenUrl CrossRef
↵
Johnson, L. A., Chaban, B., Harding, J. C. & Hill, J. E. Optimizing a PCR protocol for cpn60-based microbiome profiling of samples variously contaminated with host genomic DNA. BMC research notes 8, 253, doi:10.1186/s13104-015-1170-4 (2015).
OpenUrl CrossRef
Green, S. J., Venkatramanan, R. & Naqib, A. Deconstructing the polymerase chain reaction: Understanding and correcting bias associated with primer degeneracies and primer-template mismatches. PLoS ONE 10, doi:10.1371/journal.pone.0128122 (2015).
OpenUrl CrossRef
↵
Lee, C. K. et al. Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing. PLoS ONE 7, doi:10.1371/journal.pone.0044224 (2012).
OpenUrl CrossRef PubMed
↵
Pinto, A. J. & Raskin, L. PCR biases distort bacterial and archaeal community structure in pyrosequencing datasets. PLOS ONE 7, e43093, doi:10.1371/journal.pone.0043093 (2012).
OpenUrl CrossRef PubMed
↵
Logares, R. et al. Metagenomic 16S rDNA Illumina tags are a powerful alternative to amplicon sequencing to explore diversity and structure of microbial communities. Environ. Microbiol. 16, 2659–2671, doi:10.1111/1462-2920.12250 (2014).
OpenUrl CrossRef PubMed
↵
Ranjan, R., Rani, A., Metwally, A., McGee, H. S. & Perkins, D. L. Analysis of the microbiome: Advantages of whole genome shotgun versus 16S amplicon sequencing. Biochem. Biophys. Res. Commun. 469, 967–977, doi:http://dx.doi.org/10.1016/j.bbrc.2015.12.083 (2016).
OpenUrl CrossRef PubMed
↵
Chaban, B., Links, M. & Hill, J. A molecular enrichment strategy based on cpn60 for detection of Epsilon-Proteobacteria in the dog fecal microbiome. Microb. Ecol. 63, 348–357, doi:10.1007/s00248-011-9931-7 (2012).
OpenUrl CrossRef PubMed
↵
Demirel, B. & Scherer, P. The roles of acetotrophic and hydrogenotrophic methanogens during anaerobic conversion of biomass to methane: a review. Rev. Environ. Sci. Biotechnol. 7, 173–190 (2008).
OpenUrl CrossRef
↵
de Menezes, A. B., Richardson, A. E. & Thrall, P. H. Linking fungal–bacterial co-occurrences to soil ecosystem function. Curr. Opin. Microbiol. 37, 135–141, doi:http://dx.doi.org/10.1016/j.mib.2017.06.006 (2017).
OpenUrl
↵
Carballa, M., Regueiro, L. & Lema, J. M. Microbial management of anaerobic digestion: exploiting the microbiome-functionality nexus. Curr. Opin. Biotechnol. 33, 103–111, doi:http://dx.doi.org/10.1016/j.copbio.2015.01.008 (2015).
OpenUrl CrossRef PubMed
Gopal, M. & Gupta, A. Microbiome selection could spur next-generation plant breeding strategies. Frontiers in microbiology 7, doi:10.3389/fmicb.2016.01971 (2016).
OpenUrl CrossRef
↵
Muegge, B. D. et al. Diet drives convergence in gut microbiome functions across mammalian phylogeny and within humans. Science 332, 970–974, doi:10.1126/science.1198719 (2011).
OpenUrl Abstract/FREE Full Text
↵
Kau, A. L., Ahern, P. P., Griffin, N. W., Goodman, A. L. & Gordon, J. I. Human nutrition, the gut microbiome and the immune system. Nature 474, 327–336, doi:10.1038/nature10213 (2011).
OpenUrl CrossRef PubMed Web of Science
↵
Busby, P. E. et al. Research priorities for harnessing plant microbiomes in sustainable agriculture. PLOS Biology 15, e2001793, doi:10.1371/journal.pbio.2001793 (2017).
OpenUrl CrossRef PubMed
↵
Koch, C., Müller, S., Harms, H. & Harnisch, F. Microbiomes in bioenergy production: From analysis to management. Curr. Opin. Biotechnol. 27, 65–72, doi:10.1016/j.copbio.2013.11.006 (2014).
OpenUrl CrossRef PubMed
↵
Fierer, N. Embracing the unknown: disentangling the complexities of the soil microbiome. Nature reviews. Microbiology (2017).
↵
Cleary, D. W. et al. Long-term antibiotic exposure in soil is associated with changes in microbial community structure and prevalence of class 1 integrons. FEMS Microbiol Ecol 92, doi:10.1093/femsec/fiw159 (2016).
OpenUrl CrossRef PubMed
↵
Dumonceaux, T. J. et al. Multiplex detection of bacteria associated with normal microbiota and with bacterial vaginosis in vaginal swabs by use of oligonucleotide-coupled fluorescent microspheres. J Clin Microbiol 47, 4067–4077, doi:10.1128/jcm.00112-09 (2009).
OpenUrl Abstract/FREE Full Text
↵
Lee, D. H., Zo, Y. G. & Kim, S. J. Nonradioactive method to study genetic profiles of natural bacterial communities by PCR-single-strand-conformation polymorphism. Appl. Environ. Microbiol. 62, 3112–3120 (1996).
OpenUrl Abstract/FREE Full Text
↵
Dumonceaux, T. J., Links, M. G., Town, J. R., Hill, J. E. & Hemmingsen, S. M. Targeted capture of cpn60 gene fragments for PCR-independent microbial community profiling. Protoc exch (2017).
↵
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10, R25 (2009).
OpenUrl CrossRef PubMed
↵
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. Methods in molecular biology (Clifton, N.J.) 132, 365–386 (2000).
OpenUrl
↵
1. K.A. Bishop-Lilly
Pérez-López, E., Hammond, C., Olivier, C. Y. & Dumonceaux, T. J. in Diagnostic Bacteriology Vol. 1616 Methods in Molecular Biology (ed K.A. Bishop-Lilly) (Humana Press, 2017).
↵
Caporaso, J. G. et al. QIIME allows analysis of high-throughput community sequencing data. Nature Meth. 7, 335–336 (2010).
OpenUrl

View the discussion thread.

Posted December 13, 2018.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Microbiology

Subject Areas

All Articles

Animal Behavior and Cognition (5210)
Biochemistry (11736)
Bioengineering (8746)
Bioinformatics (29186)
Biophysics (14964)
Cancer Biology (12084)
Cell Biology (17401)
Clinical Trials (138)
Developmental Biology (9418)
Ecology (14176)
Epidemiology (2067)
Evolutionary Biology (18299)
Genetics (12235)
Genomics (16793)
Immunology (11863)
Microbiology (28066)
Molecular Biology (11580)
Neuroscience (60925)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4956)
Plant Biology (10422)
Scientific Communication and Education (1683)
Synthetic Biology (2883)
Systems Biology (7338)
Zoology (1650)

[1] ↵
Woese, C. R. & Fox, G. E. Phylogenetic structure of the prokaryotic domain: The primary kingdoms. Proc. Natl. Acad. Sci. U.S.A. 74, 5088–5090, doi:10.1073/pnas.74.11.5088 (1977).
OpenUrl Abstract/FREE Full Text

[2] ↵
Woese, C. R., Kandler, O. & Wheelis, M. L. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc. Natl. Acad. Sci. U.S.A. 87, 4576–4579, doi:10.1073/pnas.87.12.4576 (1990).
OpenUrl Abstract/FREE Full Text

[3] ↵
Tikhonovich, I. A. & Provorov, N. A. Microbiology is the basis of sustainable agriculture: An opinion. Ann. Appl. Biol. 159, 155–168, doi:10.1111/j.1744-7348.2011.00489.x (2011).
OpenUrl CrossRef

[4] ↵
J T Staley, a. & Konopka, A. Measurement of in situ activities of nonphotosynthetic microorganisms in aquatic and terrestrial habitats. Annu Rev Microbiol 39, 321–346, doi:10.1146/annurev.mi.39.100185.001541 (1985).
OpenUrl CrossRef PubMed Web of Science

[5] ↵
Weller, R. & Ward, D. M. Selective recovery of 16S rRNA sequences from natural microbial communities in the form of cDNA. Appl. Environ. Microbiol. 55, 1818–1822 (1989).
OpenUrl Abstract/FREE Full Text

[6] ↵
Hebert, P. D. N., Cywinska, A., Ball, S. L. & deWaard, J. R. Biological identifications through DNA barcodes. Proc R Soc Lond [Biol] 270, 313–321, doi:10.1098/rspb.2002.2218 (2003).
OpenUrl CrossRef PubMed Web of Science

[7] ↵
Singer, E. et al. High-resolution phylogenetic microbial community profiling. ISME J 10, 2020–2032, doi:10.1038/ismej.2015.249 (2016).
OpenUrl CrossRef

[8] ↵
Hill, J. E., Penny, S. L., Crowell, K. G., Goh, S. H. & Hemmingsen, S. M. cpnDB: A chaperonin sequence database. Genome Res. 14, 1669–1675 (2004).
OpenUrl Abstract/FREE Full Text

[9] ↵
Adékambi, T., Drancourt, M. & Raoult, D. The rpoB gene as a tool for clinical microbiologists. Trends Microbiol. 17, 37–45 (2009).
OpenUrl CrossRef PubMed Web of Science

[10] ↵
Barret, M. et al. Identification of Methanoculleus spp. as active methanogens during anoxic incubations of swine manure storage tank samples. Appl. Environ. Microbiol. 79, 424–433 (2013).
OpenUrl Abstract/FREE Full Text

[11] ↵
Schoch, C. L. et al. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc. Natl. Acad. Sci. U.S.A. 109, 6241–6246, doi:10.1073/pnas.1117018109 (2012).
OpenUrl Abstract/FREE Full Text

[12] ↵
Barret, M. et al. Emergence shapes the structure of the seed-microbiota. Appl. Environ. Microbiol. 81, 1257–1266 (2015).
OpenUrl Abstract/FREE Full Text

[13] ↵
Walker, A. W. et al. 16S rRNA gene-based profiling of the human infant gut microbiota is strongly influenced by sample processing and PCR primer choice. Microbiome 3, 26, doi:10.1186/s40168-015-0087-4 (2015).
OpenUrl CrossRef PubMed

[14] ↵
Guo, J., Cole, J. R., Zhang, Q., Brown, C. T. & Tiedje, J. M. Microbial community analysis with ribosomal gene fragments from shotgun metagenomes. Appl. Environ. Microbiol. 82, 157–166, doi:10.1128/aem.02772-15 (2016).
OpenUrl Abstract/FREE Full Text

[15] ↵
Lynch, M. D. J. & Neufeld, J. D. Ecology and exploration of the rare biosphere. Nat Rev Micro 13, 217–229, doi:10.1038/nrmicro3400 (2015).
OpenUrl CrossRef PubMed

[16] ↵
Poretsky, R., Rodriguez-R, L. M., Luo, C., Tsementzi, D. & Konstantinidis, K. T. Strengths and limitations of 16S rRNA gene amplicon sequencing in revealing temporal microbial community dynamics. PLOS ONE 9, e93827, doi:10.1371/journal.pone.0093827 (2014).
OpenUrl CrossRef PubMed

[17] ↵
Hess, M. et al. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science 331, 463–467 (2011).
OpenUrl Abstract/FREE Full Text

[18] Raymond, F. et al. The initial state of the human gut microbiome determines its reshaping by antibiotics. ISME J 10, 707–720, doi:10.1038/ismej.2015.148 (2016).
OpenUrl CrossRef PubMed

[19] ↵
Handley, K. M. et al. Biostimulation induces syntrophic interactions that impact C, S and N cycling in a sediment microbial community. ISME J. 7, 800–816, doi:10.1038/ismej.2012.148 (2013).
OpenUrl CrossRef PubMed

[20] ↵
Fierer, N. et al. Cross-biome metagenomic analyses of soil microbial communities and their functional attributes. Proc. Natl. Acad. Sci. U.S.A. 109, 21390–21395, doi:10.1073/pnas.1215210110 (2012).
OpenUrl Abstract/FREE Full Text

[21] ↵
Luo, C. et al. Soil microbial community responses to a decade of warming as revealed by comparative metagenomics. Appl. Environ. Microbiol. 80, 1777–1786, doi:10.1128/aem.03712-13 (2014).
OpenUrl Abstract/FREE Full Text

[22] ↵
Chaban, B. & Hill, J. E. A ‘universal’ type II chaperonin PCR detection system for the investigation of Archaea in complex microbial communities. ISME J. 6, 430–439 (2012).
OpenUrl CrossRef PubMed

[23] ↵
Links, M. G. et al. Simultaneous profiling of seed-associated bacteria and fungi reveals antagonistic interactions between microorganisms within a shared epiphytic microbiome on Triticum and Brassica seeds. New Phytol. 202, 542–553, doi:10.1111/nph.12693 (2014).
OpenUrl CrossRef PubMed Web of Science

[24] ↵
Links, M. G., Dumonceaux, T. J., Hemmingsen, S. M. & Hill, J. E. The chaperonin-60 universal target is a barcode for bacteria that enables de novo assembly of metagenomic sequence data. PLoS One 7, e49755, doi:10.1371/journal.pone.0049755 (2012).
OpenUrl CrossRef PubMed

[25] ↵
Links, M. G., Chaban, B., Hemmingsen, S., Muirhead, K. & Hill, J. mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences. Microbiome 1, 23 (2013).
OpenUrl CrossRef

[26] ↵
Hill, J. E., Town, J. R. & Hemmingsen, S. M. Improved template representation in cpn60 polymerase chain reaction (PCR) product libraries generated from complex templates by application of a specific mixture of PCR primers. Environ. Microbiol. 8, 741–746, doi:10.1111/j.1462-2920.2005.00944.x (2006).
OpenUrl CrossRef PubMed

[27] ↵
Dumonceaux, T. J., Hill, J. E., Hemmingsen, S. M. & Van Kessel, A. G. Characterization of intestinal microbiota and response to dietary virginiamycin supplementation in the broiler chicken. Appl. Environ. Microbiol. 72, 2815–2823 (2006).
OpenUrl Abstract/FREE Full Text

[28] ↵
Schuenemann, V. J. et al. Targeted enrichment of ancient pathogens yielding the pPCP1 plasmid of Yersinia pestis from victims of the Black Death. Proc. Natl. Acad. Sci. U.S.A. 108, E746–E752, doi:10.1073/pnas.1105107108 (2011).
OpenUrl Abstract/FREE Full Text

[29] Wagner, D. M. et al. Yersinia pestis and the Plague of Justinian 541-543 AD: a genomic analysis. Lancet Infect Dis (2014).

[30] ↵
Gasc, C. & Peyret, P. Hybridization capture reveals microbial diversity missed using current profiling methods. Microbiome 6, 61, doi:10.1186/s40168-018-0442-3 (2018).
OpenUrl CrossRef

[31] ↵
Topp, E. et al. Accelerated biodegradation of veterinary antibiotics in agricultural soil following long-term exposure, and isolation of a sulfamethazine-degrading Microbacterium sp. J. Environ. Qual. 42, 173–178, doi:10.2134/jeq2012.0162 (2013).
OpenUrl CrossRef PubMed

[32] ↵
Martin-Laurent, F., Marti, R., Waglechner, N., Wright, G. D. & Topp, E. Draft Genome Sequence of the Sulfonamide Antibiotic-Degrading Microbacterium sp. Strain C448. Genome Announcements 2, e01113–01113, doi:10.1128/genomeA.01113-13 (2014).
OpenUrl CrossRef

[33] ↵
Bustin, S. A. et al. The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem 55, 611–622 (2009).
OpenUrl Abstract/FREE Full Text

[34] ↵
Props, R. et al. Absolute quantification of microbial taxon abundances. ISME J, doi:10.1038/ismej.2016.117 (2016).
OpenUrl CrossRef

[35] ↵
Johnson, L. A., Chaban, B., Harding, J. C. & Hill, J. E. Optimizing a PCR protocol for cpn60-based microbiome profiling of samples variously contaminated with host genomic DNA. BMC research notes 8, 253, doi:10.1186/s13104-015-1170-4 (2015).
OpenUrl CrossRef

[36] Green, S. J., Venkatramanan, R. & Naqib, A. Deconstructing the polymerase chain reaction: Understanding and correcting bias associated with primer degeneracies and primer-template mismatches. PLoS ONE 10, doi:10.1371/journal.pone.0128122 (2015).
OpenUrl CrossRef

[37] ↵
Lee, C. K. et al. Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing. PLoS ONE 7, doi:10.1371/journal.pone.0044224 (2012).
OpenUrl CrossRef PubMed

[38] ↵
Pinto, A. J. & Raskin, L. PCR biases distort bacterial and archaeal community structure in pyrosequencing datasets. PLOS ONE 7, e43093, doi:10.1371/journal.pone.0043093 (2012).
OpenUrl CrossRef PubMed

[39] ↵
Logares, R. et al. Metagenomic 16S rDNA Illumina tags are a powerful alternative to amplicon sequencing to explore diversity and structure of microbial communities. Environ. Microbiol. 16, 2659–2671, doi:10.1111/1462-2920.12250 (2014).
OpenUrl CrossRef PubMed

[40] ↵
Ranjan, R., Rani, A., Metwally, A., McGee, H. S. & Perkins, D. L. Analysis of the microbiome: Advantages of whole genome shotgun versus 16S amplicon sequencing. Biochem. Biophys. Res. Commun. 469, 967–977, doi:http://dx.doi.org/10.1016/j.bbrc.2015.12.083 (2016).
OpenUrl CrossRef PubMed

[41] ↵
Chaban, B., Links, M. & Hill, J. A molecular enrichment strategy based on cpn60 for detection of Epsilon-Proteobacteria in the dog fecal microbiome. Microb. Ecol. 63, 348–357, doi:10.1007/s00248-011-9931-7 (2012).
OpenUrl CrossRef PubMed

[42] ↵
Demirel, B. & Scherer, P. The roles of acetotrophic and hydrogenotrophic methanogens during anaerobic conversion of biomass to methane: a review. Rev. Environ. Sci. Biotechnol. 7, 173–190 (2008).
OpenUrl CrossRef

[43] ↵
de Menezes, A. B., Richardson, A. E. & Thrall, P. H. Linking fungal–bacterial co-occurrences to soil ecosystem function. Curr. Opin. Microbiol. 37, 135–141, doi:http://dx.doi.org/10.1016/j.mib.2017.06.006 (2017).
OpenUrl

[44] ↵
Carballa, M., Regueiro, L. & Lema, J. M. Microbial management of anaerobic digestion: exploiting the microbiome-functionality nexus. Curr. Opin. Biotechnol. 33, 103–111, doi:http://dx.doi.org/10.1016/j.copbio.2015.01.008 (2015).
OpenUrl CrossRef PubMed

[45] Gopal, M. & Gupta, A. Microbiome selection could spur next-generation plant breeding strategies. Frontiers in microbiology 7, doi:10.3389/fmicb.2016.01971 (2016).
OpenUrl CrossRef

[46] ↵
Muegge, B. D. et al. Diet drives convergence in gut microbiome functions across mammalian phylogeny and within humans. Science 332, 970–974, doi:10.1126/science.1198719 (2011).
OpenUrl Abstract/FREE Full Text

[47] ↵
Kau, A. L., Ahern, P. P., Griffin, N. W., Goodman, A. L. & Gordon, J. I. Human nutrition, the gut microbiome and the immune system. Nature 474, 327–336, doi:10.1038/nature10213 (2011).
OpenUrl CrossRef PubMed Web of Science

[48] ↵
Busby, P. E. et al. Research priorities for harnessing plant microbiomes in sustainable agriculture. PLOS Biology 15, e2001793, doi:10.1371/journal.pbio.2001793 (2017).
OpenUrl CrossRef PubMed

[49] ↵
Koch, C., Müller, S., Harms, H. & Harnisch, F. Microbiomes in bioenergy production: From analysis to management. Curr. Opin. Biotechnol. 27, 65–72, doi:10.1016/j.copbio.2013.11.006 (2014).
OpenUrl CrossRef PubMed

[50] ↵
Fierer, N. Embracing the unknown: disentangling the complexities of the soil microbiome. Nature reviews. Microbiology (2017).

[51] ↵
Cleary, D. W. et al. Long-term antibiotic exposure in soil is associated with changes in microbial community structure and prevalence of class 1 integrons. FEMS Microbiol Ecol 92, doi:10.1093/femsec/fiw159 (2016).
OpenUrl CrossRef PubMed

[52] ↵
Dumonceaux, T. J. et al. Multiplex detection of bacteria associated with normal microbiota and with bacterial vaginosis in vaginal swabs by use of oligonucleotide-coupled fluorescent microspheres. J Clin Microbiol 47, 4067–4077, doi:10.1128/jcm.00112-09 (2009).
OpenUrl Abstract/FREE Full Text

[53] ↵
Lee, D. H., Zo, Y. G. & Kim, S. J. Nonradioactive method to study genetic profiles of natural bacterial communities by PCR-single-strand-conformation polymorphism. Appl. Environ. Microbiol. 62, 3112–3120 (1996).
OpenUrl Abstract/FREE Full Text

[54] ↵
Dumonceaux, T. J., Links, M. G., Town, J. R., Hill, J. E. & Hemmingsen, S. M. Targeted capture of cpn60 gene fragments for PCR-independent microbial community profiling. Protoc exch (2017).

[55] ↵
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10, R25 (2009).
OpenUrl CrossRef PubMed

[56] ↵
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. Methods in molecular biology (Clifton, N.J.) 132, 365–386 (2000).
OpenUrl

[57] ↵
K.A. Bishop-Lilly
Pérez-López, E., Hammond, C., Olivier, C. Y. & Dumonceaux, T. J. in Diagnostic Bacteriology Vol. 1616 Methods in Molecular Biology (ed K.A. Bishop-Lilly) (Humana Press, 2017).

[58] K.A. Bishop-Lilly

[59] ↵
Caporaso, J. G. et al. QIIME allows analysis of high-throughput community sequencing data. Nature Meth. 7, 335–336 (2010).
OpenUrl