TY - JOUR T1 - deSAMBA: fast and accurate classification of metagenomics long reads with sparse approximate matches JF - bioRxiv DO - 10.1101/736777 SP - 736777 AU - Gaoyang Li AU - Bo Liu AU - Yadong Wang Y1 - 2019/01/01 UR - http://biorxiv.org/content/early/2019/08/18/736777.abstract N2 - Summary Long read sequencing technologies are promising to metagenomics studies. However, there is still lack of read classification tools to fast and accurately identify the taxonomies of noisy long reads, which is a bottleneck to the use of long read sequencing. Herein, we propose deSAMBA, a tailored long read classification approach that uses a novel sparse approximate match block (SAMB)-based pseudo alignment algorithm. Benchmarks on real datasets demonstrate that deSAMBA enables to simultaneously achieve fast speed and good classification yields, which outperforms state-of-the-art tools and has many potentials to cutting-edge metagenomics studies.Availability and Implementation https://github.com/hitbc/deSAMBA.Supplementary information: ER -