RT Journal Article SR Electronic T1 OGRE: Overlap Graph-based metagenomic Read clustEring JF bioRxiv FD Cold Spring Harbor Laboratory SP 511014 DO 10.1101/511014 A1 Marleen Balvert A1 Tina Hauptfeld A1 Alexander Schönhuth A1 Bas E. Dutilh YR 2019 UL http://biorxiv.org/content/early/2019/01/03/511014.abstract AB The microbes that live in an environment can be identified from the genomic material that is present, also referred to as the metagenome. Using Next Generation Sequencing techniques this genomic material can be obtained from the environment, resulting in a large set of sequencing reads. A proper assembly of these reads into contigs or even full genomes allows one to identify the microbial species and strains that live in the environment. Assembling a metagenome is a challenging task and can benefit from clustering the reads into species-specific bins prior to assembly. In this paper we propose OGRE, an Overlap-Graph based Read clustEring procedure for metagenomic read data. OGRE is the only method that can successfully cluster reads in species-specific bins for large metagenomic datasets without running into computation time-or memory issues.