immuneML: an ecosystem for machine learning analysis of adaptive immune receptor repertoires

Abstract
Adaptive immune receptor repertoires (AIRR) are key targets for biomedical research as they record past and ongoing adaptive immune responses. The capacity of machine learning (ML) to identify complex discriminative sequence patterns renders it an ideal approach for AIRR-based diagnostic and therapeutic discovery. To date, widespread adoption of AIRR ML has been inhibited by a lack of reproducibility, transparency, and interoperability. immuneML (immuneml.uio.no) addresses these concerns by implementing each step of the AIRR ML process in an extensible, open-source software ecosystem that is based on fully specified and shareable workflows. To facilitate widespread user adoption, immuneML is available as a command-line tool and through an intuitive Galaxy web interface, and extensive documentation of workflows is provided. We demonstrate the broad applicability of immuneML by (i) reproducing a large-scale study on immune state prediction, (ii) developing, integrating, and applying a novel method for antigen specificity prediction, and (iii) showcasing streamlined interpretability-focused benchmarking of AIRR ML.
Competing Interest Statement
VG declares advisory board positions in aiNET GmbH and Enpicom B.V.
Footnotes
↵☯ Joint supervision
Subject Area
- Biochemistry (9637)
- Bioengineering (7127)
- Bioinformatics (24958)
- Biophysics (12676)
- Cancer Biology (10002)
- Cell Biology (14406)
- Clinical Trials (138)
- Developmental Biology (7991)
- Ecology (12152)
- Epidemiology (2067)
- Evolutionary Biology (16029)
- Genetics (10956)
- Genomics (14784)
- Immunology (9911)
- Microbiology (23750)
- Molecular Biology (9516)
- Neuroscience (51101)
- Paleontology (370)
- Pathology (1545)
- Pharmacology and Toxicology (2694)
- Physiology (4038)
- Plant Biology (8700)
- Synthetic Biology (2406)
- Systems Biology (6461)
- Zoology (1350)