The tidyomics ecosystem: Enhancing omic data analyses
Abstract
The growth of omic data presents evolving challenges in data manipulation, analysis, and integration. Addressing these challenges, Bioconductor1 provides an extensive community-driven biological data analysis platform. Meanwhile, tidy R programming2 offers a revolutionary standard for data organisation and manipulation. Here, we present the tidyomics software ecosystem, bridging Bioconductor to the tidy R paradigm. This ecosystem aims to streamline omic analysis, ease learning, and encourage cross-disciplinary collaborations. We demonstrate the effectiveness of tidyomics by analysing 7.5 million peripheral blood mononuclear cells from the Human Cell Atlas3, spanning six data frameworks and ten analysis tools.
Competing Interest Statement
R.G. has received consulting income from Takeda and Sanofi, and declares ownership in Ozette Technologies. M.K. is an employee of and declares ownership in Achilles Therapeutics. The remaining authors declare no competing interests.
Footnotes
↵† A list of authors and their affiliations appears at the end of the paper.
We revised the manuscript with new figures.
Subject Area
- Biochemistry (13369)
- Bioengineering (10172)
- Bioinformatics (32534)
- Biophysics (16740)
- Cancer Biology (13825)
- Cell Biology (19651)
- Clinical Trials (138)
- Developmental Biology (10623)
- Ecology (15718)
- Epidemiology (2067)
- Evolutionary Biology (20024)
- Genetics (13223)
- Genomics (18350)
- Immunology (13460)
- Microbiology (31512)
- Molecular Biology (13131)
- Neuroscience (68662)
- Paleontology (509)
- Pathology (2128)
- Pharmacology and Toxicology (3669)
- Physiology (5723)
- Plant Biology (11784)
- Synthetic Biology (3305)
- Systems Biology (8029)
- Zoology (1813)