RT Journal Article SR Electronic T1 Ultrafast and accurate sequence alignment and clustering of viral genomes JF bioRxiv FD Cold Spring Harbor Laboratory SP 2024.06.27.601020 DO 10.1101/2024.06.27.601020 A1 Zielezinski, Andrzej A1 Gudyƛ, Adam A1 Barylski, Jakub A1 Siminski, Krzysztof A1 Rozwalak, Piotr A1 Dutilh, Bas E. A1 Deorowicz, Sebastian YR 2024 UL http://biorxiv.org/content/early/2024/07/02/2024.06.27.601020.abstract AB Viromics produces millions of viral genomes and fragments annually, overwhelming traditional sequence comparison methods. We introduce Vclust, a novel approach that determines average nucleotide identity by Lempel-Ziv parsing and clusters viral genomes with thresholds endorsed by authoritative viral genomics and taxonomy consortia. Vclust demonstrates superior accuracy and efficiency compared to existing tools, clustering millions of virus genomes in a few hours on a mid-range workstation.Competing Interest StatementThe authors have declared no competing interest.