Abstract
Microbial community analysis can be biased by multiple technical factors, such as storage conditions, DNA extraction, or amplification conditions. In a high-throughput laboratory that relies on samples obtained from thousands of different subjects, knowledge of the extent of subject-introduced sampling and storage variation on the outcome of the inferred microbiome, as well as the effect of laboratory-introduced variation caused by reagent batches, equipment, or operator on the consistency of these processes within the laboratory is paramount. Here, we analyzed the effect of sampling from different parts of the same stool specimen or on different consecutive days, as well as short-term storage of samples at different temperatures on microbiome profiles obtained by 16S rRNA gene amplification. Each of these factors had relatively little effect on the microbial composition. In addition, replicate amplification of 44 stool samples showed reproducible results. Finally, 363 independent replicate extractions and amplifications of a single human homogenized stool (HS) specimen showed reproducible results (average Lin’s correlation = 0.95), with little variation introduced by HS batch, operator, extraction equipment, or DNA sequencer. In all cases, variations between replicates were significantly smaller than those between individual samples; subject identity always was the largest determinant. We propose that homogenized stool specimens could be used as quality control to routinely monitor the laboratory process and to validate new methods.