ABSTRACT
Next-generation sequencing (NGS) methods are transforming our capacity to detect pathogens and perform disease diagnosis. Although sequencing advances have enabled accessible and point-of-care NGS, data analysis pipelines have yet to provide robust tools for precise and certain diagnosis, particularly in cases of low sequencing coverage. Lack of standardized metrics and harmonized detection thresholds confound the problem further, impeding the adoption and implementation of these solutions in real-world applications. In this work, we tackle these issues and propose biologically-informed viral genome assembly coverage as a method to improve diagnostic certainty. We use the identification of viral replicases, an essential function of viral life cycles, to define genome coverage thresholds in which biological functions can be described. We validate the analysis pipeline, Viroscope, using field samples, synthetic and published datasets and demonstrate that it provides sensitive and specific viral detection. Furthermore, we developed Viroscope.io a web-service to provide on-demand NGS data viral diagnosis to facilitate adoption and implementation by phytosanitary agencies to enable precise viral diagnosis.
Competing Interest Statement
BP, SLV, TN, IF, CN are employed by Meristem SpA. BP, VM, JCJ, FG are employed by Multiplex SpA. This study received funding from Meristem SpA. The funder had the following involvement with the study: Meristem SpA is a startup company developing gene editing tools for fruit crops, Multiplex SpA is a molecular diagnostics spin-off startup company developing novel methods for virus detection in plants. Multiplex has filed an invention patent in Chile covering the method described herein. All authors declare no other competing interests.
Footnotes
↵ǂ Joint first authors