Empowering bioinformatics communities with Nextflow and nf-core
Abstract
Standardised analysis pipelines are an important part of FAIR bioinformatics research. Over the last decade, there has been a notable shift from point-and-click pipeline solutions such as Galaxy towards command-line solutions such as Nextflow and Snakemake. We report on recent developments in the nf-core and Nextflow frameworks that have led to widespread adoption across many scientific communities. We describe how adopting nf-core standards enables faster development, improved interoperability, and collaboration with the >8,000 members of the nf-core community. The recent development of Nextflow Domain-Specific Language 2 (DSL2) allows pipeline components to be shared and combined across projects. The nf-core community has harnessed this with a library of modules and subworkflows that can be integrated into any Nextflow pipeline, enabling research communities to progressively transition to nf-core best practices. We present a case study of nf-core adoption by six European research consortia, grouped under the EuroFAANG umbrella and dedicated to farmed animal genomics. We believe that the process outlined in this report can inspire many large consortia to seek harmonisation of their data analysis procedures.
Competing Interest Statement
Some co-authors, as indicated by their affiliation, are employed at “Seqera Labs S.L.”
Subject Area
- Biochemistry (13393)
- Bioengineering (10202)
- Bioinformatics (32603)
- Biophysics (16793)
- Cancer Biology (13866)
- Cell Biology (19695)
- Clinical Trials (138)
- Developmental Biology (10643)
- Ecology (15755)
- Epidemiology (2067)
- Evolutionary Biology (20063)
- Genetics (13248)
- Genomics (18387)
- Immunology (13487)
- Microbiology (31583)
- Molecular Biology (13172)
- Neuroscience (68791)
- Paleontology (510)
- Pathology (2133)
- Pharmacology and Toxicology (3683)
- Physiology (5744)
- Plant Biology (11797)
- Synthetic Biology (3313)
- Systems Biology (8046)
- Zoology (1819)