RT Journal Article SR Electronic T1 GuaCAMOLE: GC-bias aware estimation improves the accuracy of metagenomic species abundances JF bioRxiv FD Cold Spring Harbor Laboratory SP 2024.09.20.614100 DO 10.1101/2024.09.20.614100 A1 Holcik, Laurenz A1 von Haseseler, Arndt A1 Pflug, Florian G. YR 2024 UL http://biorxiv.org/content/early/2024/09/23/2024.09.20.614100.abstract AB GuaCAMOLE is a novel computational method which detects and removes GC bias from metagenomic sequencing data. Metagenomic sequencing measures the species composition of microbial communities, and has revealed the crucial role of microbiomes in the etiology of a range of diseases such as colorectal cancer. Quantitative comparisons of microbial communities are, however, affected by GC-content dependent biases. GuaCAMOLE works regardless of the specific amount or direction of GC-bias present in the data and requires only a single sample. The algorithm reports unbiased abundances and quantifies the amount of bias present in terms of GC-depdendent sequencing efficiencies. Experimental mock community data confirms both estimates to be accurate across a wide range of experimental protocols. In gut microbiomes of colorectal cancer patients we observe a clear bias against GC-poor species in the abundances reported by existing methods. GuaCAMOLE successfully removes this bias and corrects the abundance of clinically relevant GC-poor species such as F. nucleatum (28% GC) by up to a factor of two. GuaCAMOLE thus contributes to a better quantitative understanding of microbial communities by improving the accuracy and comparability of species abundances across experimental setups.Competing Interest StatementThe authors have declared no competing interest.