PT - JOURNAL ARTICLE AU - Xuan Lv AU - Zhiguang Chen AU - Yutong Lu AU - Yuedong Yang TI - An End-to-end Oxford Nanopore Basecaller Using Convolution-augmented Transformer AID - 10.1101/2020.11.09.374165 DP - 2020 Jan 01 TA - bioRxiv PG - 2020.11.09.374165 4099 - http://biorxiv.org/content/early/2020/11/10/2020.11.09.374165.short 4100 - http://biorxiv.org/content/early/2020/11/10/2020.11.09.374165.full AB - Oxford Nanopore sequencing is fastly becoming an active field in genomics, and it’s critical to basecall nucleotide sequences from the complex electrical signals. Many efforts have been devoted to developing new basecalling tools over the years. However, the basecalled reads still suffer from a high error rate and slow speed. Here, we developed an open-source basecalling method, CATCaller, by simultaneously capturing global context through Attention and modeling local dependencies through dynamic convolution. The method was shown to consistently outper-form the ONT default basecaller Albacore, Guppy, and a recently developed attention-based method SACall in read accuracy. More importantly, our method is fast through a heterogeneously computational model to integrate both CPUs and GPUs. When compared to SACall, the method is nearly 4 times faster on a single GPU, and is highly scalable in parallelization with a further speedup of 3.3 on a four-GPU node.Competing Interest StatementThe authors have declared no competing interest.