Skip to main content
bioRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search
New Results

A reproducible and generalizable software workflow for analysis of large-scale neuroimaging data collections using BIDS Apps

View ORCID ProfileChenying Zhao, Dorota Jarecka, Sydney Covitz, Yibei Chen, Simon B. Eickhoff, Damien A. Fair, Alexandre R. Franco, Yaroslav O. Halchenko, Timothy J. Hendrickson, Felix Hoffstaedter, Audrey Houghton, Gregory Kiar, Austin Macdonald, Kahini Mehta, Michael P. Milham, Taylor Salo, Michael Hanke, Satrajit S. Ghosh, Matthew Cieslak, View ORCID ProfileTheodore D. Satterthwaite
doi: https://doi.org/10.1101/2023.08.16.552472
Chenying Zhao
aLifespan Informatics and Neuroimaging Center (PennLINC), Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
bPenn/CHOP Lifespan Brain Institute, Perelman School of Medicine, Children’s Hospital of Philadelphia Research Institute, Philadelphia, PA, USA
cDepartment of Bioengineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
dDepartment of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Chenying Zhao
Dorota Jarecka
eMcGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sydney Covitz
aLifespan Informatics and Neuroimaging Center (PennLINC), Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
bPenn/CHOP Lifespan Brain Institute, Perelman School of Medicine, Children’s Hospital of Philadelphia Research Institute, Philadelphia, PA, USA
dDepartment of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yibei Chen
eMcGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Simon B. Eickhoff
fInstitute of Neuroscience and Medicine, Brain & Behaviour (INM-7), Research Center Jülich, Jülich, Germany
gInstitute of Systems Neuroscience, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Damien A. Fair
hMasonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
iInstitute of Child Development, College of Education and Human Development, University of Minnesota, Minneapolis, MN, USA
jDepartment of Pediatrics, University of Minnesota Medical School, University of Minnesota, Minneapolis, MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandre R. Franco
lChild Mind Institute, New York, NY, USA
mCenter for Biomedical Imaging and Neuromodulation, Nathan Kline Institute for Psychiatric Research, Orangeburg, NY, USA
nDepartment of Psychiatry, NYU Grossman School of Medicine, New York, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yaroslav O. Halchenko
oDepartment of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Timothy J. Hendrickson
hMasonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
kMinnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Felix Hoffstaedter
fInstitute of Neuroscience and Medicine, Brain & Behaviour (INM-7), Research Center Jülich, Jülich, Germany
gInstitute of Systems Neuroscience, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Audrey Houghton
hMasonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gregory Kiar
lChild Mind Institute, New York, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Austin Macdonald
oDepartment of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kahini Mehta
aLifespan Informatics and Neuroimaging Center (PennLINC), Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
bPenn/CHOP Lifespan Brain Institute, Perelman School of Medicine, Children’s Hospital of Philadelphia Research Institute, Philadelphia, PA, USA
dDepartment of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael P. Milham
lChild Mind Institute, New York, NY, USA
mCenter for Biomedical Imaging and Neuromodulation, Nathan Kline Institute for Psychiatric Research, Orangeburg, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Taylor Salo
aLifespan Informatics and Neuroimaging Center (PennLINC), Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
bPenn/CHOP Lifespan Brain Institute, Perelman School of Medicine, Children’s Hospital of Philadelphia Research Institute, Philadelphia, PA, USA
dDepartment of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Hanke
fInstitute of Neuroscience and Medicine, Brain & Behaviour (INM-7), Research Center Jülich, Jülich, Germany
gInstitute of Systems Neuroscience, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Satrajit S. Ghosh
eMcGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
pDepartment of Otolaryngology, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew Cieslak
aLifespan Informatics and Neuroimaging Center (PennLINC), Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
bPenn/CHOP Lifespan Brain Institute, Perelman School of Medicine, Children’s Hospital of Philadelphia Research Institute, Philadelphia, PA, USA
dDepartment of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Theodore D. Satterthwaite
aLifespan Informatics and Neuroimaging Center (PennLINC), Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
bPenn/CHOP Lifespan Brain Institute, Perelman School of Medicine, Children’s Hospital of Philadelphia Research Institute, Philadelphia, PA, USA
dDepartment of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
qCenter for Biomedical Image Computation and Analytics, University of Pennsylvania, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Theodore D. Satterthwaite
  • For correspondence: sattertt@pennmedicine.upenn.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Neuroimaging research faces a crisis of reproducibility. With massive sample sizes and greater data complexity, this problem becomes more acute. Software that operates on imaging data defined using the Brain Imaging Data Structure (BIDS) – BIDS Apps – have provided a substantial advance. However, even using BIDS Apps, a full audit trail of data processing is a necessary prerequisite for fully reproducible research. Obtaining a faithful record of the audit trail is challenging – especially for large datasets. Recently, the FAIRly big framework was introduced as a way to facilitate reproducible processing of large-scale data by leveraging DataLad – a version control system for data management. However, the current implementation of this framework was more of a proof of concept, and could not be immediately reused by other investigators for different use cases. Here we introduce the BIDS App Bootstrap (BABS), a user-friendly and generalizable Python package for reproducible image processing at scale. BABS facilitates the reproducible application of BIDS Apps to large-scale datasets. Leveraging DataLad and the FAIRly big framework, BABS tracks the full audit trail of data processing in a scalable way by automatically preparing all scripts necessary for data processing and version tracking on high performance computing (HPC) systems. Currently, BABS supports jobs submissions and audits on Sun Grid Engine (SGE) and Slurm HPCs with a parsimonious set of programs. To demonstrate its scalability, we applied BABS to data from the Healthy Brain Network (HBN; n=2,565). Taken together, BABS allows reproducible and scalable image processing and is broadly extensible via an open-source development model.

Competing Interest Statement

The authors have declared no competing interest.

Footnotes

  • http://pennlinc-babs.readthedocs.io/

  • https://github.com/PennLINC/babs

Copyright 
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted August 18, 2023.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about bioRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A reproducible and generalizable software workflow for analysis of large-scale neuroimaging data collections using BIDS Apps
(Your Name) has forwarded a page to you from bioRxiv
(Your Name) thought you would like to see this page from the bioRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A reproducible and generalizable software workflow for analysis of large-scale neuroimaging data collections using BIDS Apps
Chenying Zhao, Dorota Jarecka, Sydney Covitz, Yibei Chen, Simon B. Eickhoff, Damien A. Fair, Alexandre R. Franco, Yaroslav O. Halchenko, Timothy J. Hendrickson, Felix Hoffstaedter, Audrey Houghton, Gregory Kiar, Austin Macdonald, Kahini Mehta, Michael P. Milham, Taylor Salo, Michael Hanke, Satrajit S. Ghosh, Matthew Cieslak, Theodore D. Satterthwaite
bioRxiv 2023.08.16.552472; doi: https://doi.org/10.1101/2023.08.16.552472
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
A reproducible and generalizable software workflow for analysis of large-scale neuroimaging data collections using BIDS Apps
Chenying Zhao, Dorota Jarecka, Sydney Covitz, Yibei Chen, Simon B. Eickhoff, Damien A. Fair, Alexandre R. Franco, Yaroslav O. Halchenko, Timothy J. Hendrickson, Felix Hoffstaedter, Audrey Houghton, Gregory Kiar, Austin Macdonald, Kahini Mehta, Michael P. Milham, Taylor Salo, Michael Hanke, Satrajit S. Ghosh, Matthew Cieslak, Theodore D. Satterthwaite
bioRxiv 2023.08.16.552472; doi: https://doi.org/10.1101/2023.08.16.552472

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Neuroscience
Subject Areas
All Articles
  • Animal Behavior and Cognition (4657)
  • Biochemistry (10309)
  • Bioengineering (7629)
  • Bioinformatics (26217)
  • Biophysics (13462)
  • Cancer Biology (10637)
  • Cell Biology (15354)
  • Clinical Trials (138)
  • Developmental Biology (8461)
  • Ecology (12766)
  • Epidemiology (2067)
  • Evolutionary Biology (16781)
  • Genetics (11368)
  • Genomics (15416)
  • Immunology (10562)
  • Microbiology (25064)
  • Molecular Biology (10165)
  • Neuroscience (54203)
  • Paleontology (398)
  • Pathology (1658)
  • Pharmacology and Toxicology (2878)
  • Physiology (4319)
  • Plant Biology (9206)
  • Scientific Communication and Education (1582)
  • Synthetic Biology (2543)
  • Systems Biology (6759)
  • Zoology (1454)