Abstract
A joint and integrated analysis of multi-site diffusion MRI (dMRI) datasets can dramatically increase the statistical power of neuroimaging studies and enable comparative studies pertaining to several brain disorders. However, dMRI data sets acquired on multiple scanners cannot be naively pooled for joint analysis due to scanner specific nonlinear effects as well as differences in acquisition parameters. Consequently, for joint analysis, the dMRI data has to be harmonized, which involves removing scanner-specific differences from the raw dMRI signal. In this work, we present a dMRI harmonization method that, when applied to multi-site data, is capable of removing scanner-specific effects, while accounting for minor differences in acquisition parameters such as b-value, spatial resolution and number of gradient directions in the dMRI data (typical for multi-site clinical research scans). We validate our algorithm on dMRI data acquired from two sites: Philadelphia Neurodevelopmental Cohort (PNC) with 800 healthy adolescents (ages 8 to 22 years) and Brigham and Women’s Hospital (BWH) with 70 healthy subjects (ages 14 to 54 years). In particular, we show that gender differences and maturation in different age groups are preserved after harmonization, as measured using effect sizes (small, medium and large), irrespective of the test sample size. Further, because we use matched control subjects from different scanners to estimate scanner-specific effects, we tested how many subjects are needed from each site to achieve best harmonization results. Our results indicate that at-least 16 to 18 well-matched healthy controls from each site are needed to reliably capture scanner related differences. The proposed method can thus be used for retrospective harmonization of raw dMRI data across sites despite differences in acquisition parameters, while preserving inter-subject anatomical variability.