PT - JOURNAL ARTICLE AU - Charlie Beirnaert AU - Pieter Meysman AU - Trung Nghia Vu AU - Nina Hermans AU - Sandra Apers AU - Luc Pieters AU - Adrian Covaci AU - Kris Laukens TI - Speaq 2.0: A Wavelet-Based Workflow For High-Throughput 1D NMR Spectra Processing And Quantification AID - 10.1101/138503 DP - 2017 Jan 01 TA - bioRxiv PG - 138503 4099 - http://biorxiv.org/content/early/2017/05/16/138503.short 4100 - http://biorxiv.org/content/early/2017/05/16/138503.full AB - Nuclear Magnetic Resonance (NMR) spectroscopy is, together with liquid chromatography-mass spectrometry (LC-MS), the most established platform to perform metabolomics. In contrast to LC-MS however, NMR data is predominantly being processed with commercial software. This has the effect that its data processing remains tedious and dependent on user interventions. As a follow-up to speaq, a previously released workflow for NMR spectral alignment and quantitation, we present speaq 2.0. This completely revised framework to automatically analyze 1D NMR spectra uses wavelets to efficiently summarize the raw spectra with minimal information loss or user interaction. The tool offers a fast and easy workflow that starts with the common approach of peak-picking, followed by grouping. This yields a matrix consisting of features, samples and peak values that can be conveniently processed either by using included multivariate statistical functions or by using many other recently developed methods for NMR data analysis. speaq 2.0 facilitates robust and high-throughput metabolomics based on 1D NMR but is also compatible with other NMR frameworks or complementary LC-MS workflows. The methods are benchmarked using two publicly available datasets. speaq 2.0 is distributed through the existing speaq R package to provide a complete solution for NMR data processing. The package and the code for the presented case studies are freely available on CRAN (https://cran.r-project.org/package=speaq) and GitHub (https://github.com/beirnaert/speaq).Author summary We present speaq 2.0: a user friendly workflow for processing NMR spectra quickly and easily. By limiting the need for user interaction and allowing the construction of workflows by combining R functions, metabolomics data analysis becomes fully reproducible and shareable. Such advances are critical for the future of the metabolomics field as it needs to move towards a fully open-science approach. This is no trivial goal as many researchers are still using black-box commercial software that often requires manually doing several steps, thus hampering reproducibility. To encourage the shift towards open source, we deliberately made our method usable for anyone with the most basic of R experience, something that is easily acquired. speaq 2.0 allows a stand-alone analysis from spectra to statistical analysis. In addition, the package can be combined with existing tools to improve performance, as it provides a superior peak picking method compared to the standard binning approach.