Abstract
Differential-abundance (DA) analyses enable microbiome researchers to assess how microbial species vary in relative or absolute abundance with specific host or environmental conditions, such as health status or pH. These analyses typically use sequencing-based community measurements that are taxonomically biased to measure some species more efficiently than others. Understanding the effects that taxonomic bias has on the results of a DA analysis is essential for achieving reliable and translatable findings; yet currently, these effects are unknown. Here, we characterized these effects for DA analyses of both relative and absolute abundances, using a combination of mathematical theory and data analysis of real and simulated case studies. We found that, for analyses based on species proportions, taxonomic bias can cause significant errors in DA results if the average measurement efficiency of the community is associated with the condition of interest. These errors can be avoided by using more robust DA methods (based on species ratios) or quantified and corrected using appropriate controls. Wide adoption of our recommendations can improve the reproducibility, interpretability, and translatability of microbiome DA studies.
This manuscript was rendered from commit 3bde34f of https://github.com/mikemc/differential-abundance-theory. Supporting data analyses can be found in the accompanying computational research notebook. Please post comments or questions on GitHub. The manuscript is licensed under a CC BY 4.0 License. See the GitHub Releases or Zenodo record for earlier versions.
Competing Interest Statement
The authors have declared no competing interest.