Abstract
Motivation Metabolomics is an increasingly common part of health research and there is need for pre-analytical data processing. Researchers typically need to characterize the data and to exclude errors within the context of the intended analysis. While some pre-processing steps are common, there is currently a lack of standardization and reporting transparency for these procedures.
Results Here we introduce metaboprep, a standardized data processing workflow to extract and characterize high quality metabolomics data sets. The package extracts data from pre-formed worksheets, provides summary statistics and enables the user to select samples and metabolites for their analysis based on a set of quality metrics. A report summarizing quality metrics and the influence of available batch variables on the data is generated for the purpose of open disclosure. Where possible, we provide users flexibility in defining their own selection thresholds.
Availability and implementation metaboprep is an open-source R package available at https://github.com/MRCIEU/metaboprep
Contact d.a.hughes{at}bristol.ac.uk or laura.corbin{at}bristol.ac.uk
Competing Interest Statement
The authors have declared no competing interest.