Abstract
Increasing interest in the structural and functional organization of the human brain in health and disease encourages the acquisition of big datasets consisting of multiple neuroimaging modalities accompanied by additional information obtained from health records, cognitive tests, biomarkers and genotypes. Diffusion weighted magnetic resonance imaging data enables a range of promising imaging phenotypes probing structural connections as well as macroanatomical and microstructural properties of the brain. The reliability and biological sensitivity and specificity of diffusion data depend on processing pipeline. A state-of-the-art framework for data processing facilitates crossstudy harmonisation and reduces pipeline-related variability. Using diffusion data from the UK Biobank we provide a comprehensive evaluation of different processing steps that have been suggested to reduce imaging artefacts and improve reliability of diffusion metrics. We consider a general pipeline comprising 7 post-processing blocks: noise correction; Gibbs ringing correction; evaluation of field distortions; susceptibility, eddy-current and motion-induced distortion corrections; bias field correction; spatial smoothing; and final diffusion metric estimations. Based on this evaluation, we suggest an optimised diffusion pipeline for processing of diffusion weighted imaging data.
Introduction
Increasing interest in the role of individual differences in human brain architecture in health and disease has stimulated the neuroscience community to initiate a number of large brain data projects. Due to the attractive combination of increasing availability, low costs, its non-invasive nature and high sensitivity magnetic resonance imaging (MRI) including T1/T2-weighted images, functional MRI with tasks and resting state, perfusion and diffusion weighted imaging has become the preferred and standard brain imaging modality in these large efforts, including the UK Biobank (UKB) (Miller et al., 2016). Brain imaging data are often accompanied by clinical and biological information such as psychological tests, blood samples, genetic data and other parameters depending on original project targets. Combining all obtained information into a common statistical framework benefits from universally adapted post-processing pipelines for harmonised data quality assessment and manipulation.
Diffusion MRI is based on the effect of the Brownian motion of water molecules in biological tissue (Basser et al., 1994) and allows one to probe and visualise brain organisation at the micrometer scale (Johansen-Berg and Behrens, 2014). Current impetuous growth of theoretical and experimental diffusion MRI approaches (Novikov et al., 2018) has offered various diffusion models and sequences in order to effectively describe the signal decay due to water diffusion. Advanced diffusion measurements are technically challenging and optimal data quality places high demands on practical implementation and protocol, including hardware gradient system and coil. Due to limited time and technical constraints researchers designing imaging studies face various trade-offs, influencing, e.g. signal-to-noise ratio (SNR) and options related to the specific pulse sequences such as mono-or bipolar diffusion gradients etc.
Before diffusion metric estimations and statistical analysis there are many different approaches of quality control (QC) and corrections applied to the data in order to verify diffusion data integrity (Alfaro-Almagro et al., 2018; Farzinfar et al., 2013; Esteban et al., 2017; Hasan, 2007; Oguz et al., 2014). Ideally, the QC methods should rapidly identify and correct typical artefacts such as head motion, discarded volumes, and low SNR, which may be particularly present at high diffusion weightings, also known as b-values. Despite recent major developments and improvements (Alfaro-Almagro et al., 2018; Ciu et al., 2013; Miller et al., 2016; Roalf et al., 2016), automated procedures for QC and artefact reduction largely represent unresolved challenges in the imaging community.
Various post-processing steps have been suggested in order to correct common sources of noise and distortions, including thermal noise evaluation (Veraart et al., 2016a; Veraart et al., 2016b), Gibbs ringing correction (Kellner et al., 2016; Veraart et al., 2016c), susceptibility distortion correction (Andersson and Sotiropoulos, 2016a), motion correction (Andersson and Sotiropoulos, 2016a; Andersson et al., 2016b), correction of physiological noise and outliers (Maximov et al., 2011; Maximov et al., 2015; Sairanen et al., 2018; Walker et al., 2011), and eddy current induced geometrical distortions (Taylor et al., 2016). However, although the application of even part of the post-processing steps such as noise correction has been demonstrated to improve sensitivity and provide additional information about absolute diffusion metrics (Kochunov et al., 2018), systematic evaluations of the effects of the different steps on the diffusion metrics are scarce.
Several minimal post-processing pipelines have been recommended in order to prepare structural, functional and diffusion MRI data (Alfaro-Almagro et al., 2018; Ciu et al., 2013; Glasser et al., 2013; Sotiropoulos et al., 2013). Albeit fairly comprehensive, none of the recommended protocols include all steps listed above. For example, the UKB diffusion pipeline first employs fieldmap generation using the anterior-posterior (AP) and posterior-anterior (PA) images of original diffusion data. In turn, the selection of most reliable AP-PA images is performed by an estimation of relative correlations over all AP-PA images in order to find the most accurate reference image.
Thus, the UKB data exhibit only one diffusion-specific pipeline step based on eddy (Andersson et al., 2016a; Andersson et al., 2016b; Andersson et al., 2017), correcting the eddy currents and head motion, susceptibility artefacts and identification and replacement of outlier slices.
With the aim to identify the most efficient and adequate pipeline for diffusion data analysis we tested the effects of various diffusion post-processing steps on different diffusion scalar metrics, based on diffusion tensor imaging (Basser et al., 1994), diffusion kurtosis imaging (Jensen et al., 2005), and white matter tract integrity (Fieremans et al., 2011) using UKB data. In order to assess to which degree the chosen pipeline influences across-subject analysis and corresponding biological interpretations we compared estimated age-curves (Grinberg et al., 2017; Tamnes et al., 2017; Westlye et al., 2010; Westlye et al., 2012) of the diffusion metrics between pipelines using tract-based spatial statistic (Smith et al., 2006, 2007).
Methods and Materials
Subjects and data
Table 1 summarises the demographics of the 218 UKB subjects included in the present work. We computed diffusion scalar metrics using four different pipelines, i.e. the total number of datasets included in the analysis is 872. An accurate overview of the UKB data acquisition, protocol parameters, and image validation can be found elsewhere (Alfaro-Almagro et al., 2018; Miller et al., 2016). The original UKB post-processing pipeline is described in details online (http://biobank.ctsu.ox.ac.uk/crystal/docs/brain_mri.pdf).
Demographic data of the used UK Biobank sample.
The pipeline used in the present work is shown in Figure 1. We divided the post-processing flow into 7 general blocks. Additional block i (marked by blue frame in Fig.1) consists of extra steps allowing one to substitute or extend used algorithms. An advantage of the discussed pipeline is freely accessible open source code for all processing steps. Below we shortly describe each step in the suggested order.
Schematic representation of a general pipeline. Numbers in the upper left corner correspond to the step order. The step 7 is an estimation of final diffusion metrics depending on used diffusion model. The step i presents a possible variability in the pipeline but omitted in the present work, for example, a frequency drift correction, application of different spatial filters (isotropic vs anisotropic), difference in algorithmic utility implementations (ANTs vs FSL), permutations in the step orders (step 5 vs step 4).
1. Noise correction
The noise in diffusion data is spatially dependent in the case of multi-channel receive coils (Aja-Fernandez et al., 2014; Andre et al., 2014; Maximov et al., 2012). Principle component analysis of Marchenko-Pastur noise-only distribution provides an accurate and fast method of noise evaluation (Veraart et al., 2016a; Veraart et al., 2016b), thereby enabling signal-to-noise ratio enhancements by the Rician noise correction. In the presented work we used the original Veraart’s Matlab code (The Mathworks, Natick, Massachussets, USA): https://github.com/NYU-DiffusionMRI/mppca_denoise. The noise correction methods are regularly improved and in the future it might be substituted by a more efficient approach (see an evaluation of methods, for example, in reviews of Aja-Fernandez et al., 2014; Manjon et al., 2015).
2. Gibbs-ringing correction
Various artefacts appearing in the raw data due to table vibration (Gallichan et al., 2010), radiofrequency based distortions, incorrect magnetic field gradient calibration (McRobbie et al., 2006) can significantly degrade the diffusion data. One of the most frequent artefact is known as the Gibbs ringing artifact. This appears due to a A>space truncation along finite image sampling and can be suppressed by post hoc methods (Kellner et al., 2016; Perrone et al., 2015; Veraart et al., 2016c). Here we used the approach developed by Kellner and colleagues (Kellner et al., 2016) and the original Matlab code: https://bitbucket.org/reisert/unring.
3. EPI distortions
Diffusion data acquisition is based on echo-planar imaging (EPI), which is susceptible to multiple distortions originating from a magnetic field inhomogeneity. A few approaches have been developed to correct field inhomogeneities: a simple and robust method based on field mapping; a method based on evaluation of point spread function; and reversed gradient approach (Wu et al., 2008). FSL (Smith et al., 2004) offers an excellent utility for the EPI geometric distortion correction (topup, https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/topup, Andersson et al., 2003). Topup requires data with opposite phase-encoding directions for the non-diffusion weighted images (so called b0 images), for example, anterior-posterior and posterior-anterior pair or left-right and right-left pair.
4. Motion, eddy current and susceptibility distortion correction
Topup and eddy works together for correcting distortions appeared due to eddy currents, head motion and susceptibility originated artefacts. The GPU accelerated version of eddy (eddy_cuda) allows one to significantly speed up the computations as well as providing additional options such as in slice alignments, improved outlier detection and multi-band dataset estimations (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/eddy/UsersGuide) (Andersson et al., 2016a; Andersson et al., 2016b; Andersson et al., 2017).
5. Field non-uniformity
MR images possess a low frequency intensity shift appearing as intensity inhomogeneity over the image. Several studies have evaluated its influence on the intra- and inter-subject reproducibility of T1-weighted structural MRI data (Banerjee and Maji, 2015; Ganzetti et al., 2016). However, less has been published regarding effects of non-uniformity correction on diffusion data. In order to avoid bias based on the field non-uniformity we applied a bias field correction for b0 image. Then, the estimated field map was applied to all diffusion images in order to decrease the field inhomogeneity. We used the N4BiasFieldCorrection utility from ANTs (Tustison et al., 2010). An applied order of the bias field correction step is discussed below.
6. Spatial smoothing
After all previous steps, the diffusion data, in theory, are ready for diffusion scalar metric evaluation. In order to increase SNR, which may be particularly beneficial for the numerical stability of advanced diffusion models (Maximov et al., 2017; Vellmer et al., 2018), we applied spatial smoothing of the raw diffusion data. For simplicity, we used isotropic smoothing with a Gaussian kernel 1 mm3 implemented in the FSL function fslmaths.
7. Metric estimation
UKB diffusion data acquisition was done using a multi-shell protocol with b = 1000 and 2000 s/mm2 in addition to b=0. The dataset allows one to obtain typical diffusion metrics such as conventional diffusion tensor imaging (cDTI) (Basser et al., 1994), namely, fractional anisotropy (FA), mean, axial, and radial diffusivity (MD, AD, and RD, respectively); diffusion kurtosis imaging (DKI) (Jensen et al., 2005) with FA, MD, AD, RD, mean, axial, and radial kurtosis (MK, AK, RK, respectively); white matter tract integrity (WMTI) (Fieremans et al., 2011) metrics with axonal water fraction (AWF), extra-axonal axial and radial diffusivities (AE and RE), and tortuosity (Tort). These metrics are based on a cumulant expansion of the diffusion propagation function, i.e. strictly speaking they do not represent a comprehensive diffusion biophysical model (Novikov et al., 2018). Nevertheless, these maps are very popular and easy to obtain in clinical studies. For simplicity, we consider here only DTI, DKI and WMTI metrics. For DKI, we used an approach proposed by Veraart et al., (2013) and the original Matlab code (https://github.com/NYU-DiffusionMRI/Diffusion-Kurtosis-Imaging). DTI was estimated using DTIFIT in FSL, by means of a linear weighted least squares option in command line for the shell b = 1000 s/mm2. We assume that the original UKB DTI metrics were estimated with the same option, although it was not mentioned in the description (Miller et al., 2016).
i. Additional options
Some of the steps can be substituted by other approaches or implementations. For example, nonuniformity field corrections used in functional MRI and brain tissue segmentation may increase the accuracy of the motion correction (Ganzetti et al., 2016). The applied isotropic spatial filtering even with a quite small Gaussian kernel introduces blurring of tissue borders and increase partial voluming. A classical anisotropic diffusion filter based on the Perona-Malik algorithm (Perona and Malik, 1987) may provide an alternative with less blurring (Vellmer et al., 2018; Van Hecke et al., 2010). Therefore, we suggest to carefully consider the influence of different degradation factors on the diffusion image quality and to choose a reliable and robust tool for the correction step tailored to the study (see, for example, considerations related to neonatal neuroimaging: Bastiani et al., 2019).
Temporal SNR
In order to quantify the difference between the pipelines on a conventional data quality metric we estimated temporal signal-to-noise ratios (tSNR, Roalf et al., 2016) for each pipeline, which allows one to present a single numeric metric characterising the whole brain diffusion weighted dataset and to perform comparative estimations of data quality (Tønnesen et al., 2018). In order to accommodate the estimation to multi-shell data we separated the datasets into two independent b-shells with b = 1000 and 2000 s/mm2, respectively. In each case, tSNR was evaluated for S4, S5, S7 and UKB pipelines (see below).
Statistical analysis
In order to perform a statistical comparison between pipelines we used TBSS (Smith et al., 2006). Initially, all volumes were aligned to the FMRI58_FA template, supplied by FSL, using non-linear transformation realised by FNIRT (Andersson et al., 2007). Next, a mean FA image of all subjects was obtained and thinned in order to create mean FA skeleton. Afterwards, all FA images were thresholded at FA > 0.2. The maximal FA values were projected onto the skeleton in order to minimise confounding effects due to partial voluming.
Voxel-wise associations between age and the diffusion metrics were tested using general linear models (GLMs), including sex as covariate. The statistical analysis was performed using permutation-based inference implemented in randomise (Winkler et al., 2014) with 5000 permutations. Threshold-free cluster enhancement (TFCE, Smith and Nichols, 2009) was used. Statistical p value maps were thresholded at p < 0.05 corrected for multiple comparisons across space. In addition to voxel-wise statistics, we used diffusion metrics averaged over the skeletons for visualisation and plotting of differences between pipelines. The pipeline difference was visualised by correlation plots with estimated Pearson correlation coefficients (corrplot function of Matlab, The Mathworks, Natick, MA USA).
In order to assess to which degree the chosen pipeline can influence the interpretation of the results we compared the estimated age-related slopes for the diffusion metrics between S7 and UKB pipelines. Averaged diffusion metrics were used for plotting the age-curves. Linear regressions were performed using ordinary least squares fitting extracting regression slopes and intercepts. The Pearson correlation coefficient r and the coefficient of determination r2 between age and diffusion metrics were estimated using the corr function of Matlab.
Results
Voxel-wise comparisons
Figure 2 shows the scatter plots for the DTI metrics (FA, MD, AD, and RD) obtained from the DKI signal fitting and four pipelines. Left column of correlation plots corresponds to the mean values of diffusion metrics evaluated over the skeleton, the right column represents the standard deviations. Briefly, the results revealed high correlations of the diffusion metrics between all pipelines, however, the S7 pipeline exhibited significantly (Levene’s test with p < 0.001) reduced standard deviation (histogram peak values are at MD – 0.18; AD – 0.29; RD – 0.15) compared to the original UKB pipeline (histogram peak values are at MD – 0.21; AD – 0.35; RD – 0.23). Figure 3 shows the correlation plots for the DKI scalar metrics (MK, AK, and RK). The kurtosis scalar metrics were highly correlated between pipelines. The standard deviations are also lower in the S4, S5, and S7 pipelines compared to UKB. Figure 4 shows the correlation plots for the WMTI scalar metrics (AWF, AE, RE, and Tort). Since the estimation of WMTI metrics were based on the DKI values, the WMTI diffusion metrics exhibited quite high correlations for all pipelines similar to Fig. 3. The standard deviation of S7 pipeline was lower compared to all other pipelines (histogram peak values are at AWF – 0.07; AE – 0.40; RE – 0.18; Tort – 0.65). Figure 5 shows the scatter plots for the conventional single-shell (b = 1000 s/mm2) DTI metrics, suggesting similar relationships between the pipelines as for DKI, with lower standard deviation in the S7 pipeline compared to the original UKB pipeline.
Correlation plots for diffusion metrics based on DKI fitting obtained for four different pipelines (see Fig. 1): a) up to Step 5; b) up to Step 4; c) up to Step 7; d) original UK Biobank pipeline. Diffusion metrics were averaged over estimated subject skeletons in the case of each pipeline in accordance with the TBSS preparation pipeline.
Correlation plots for diffusion metrics based on DKI fitting obtained for four different pipelines: a) up to Step 5; b) up to Step 4; c) up to Step 7; d) original UKB pipeline. Diffusion metrics were averaged over estimated subject skeletons in the case of each pipeline in accordance with the TBSS preparation pipeline.
Correlation plots for diffusion metrics based on DKI fitting obtained for four different pipelines: a) up to Step 5; b) up to Step 4; c) up to Step 7; d) original UKB pipeline. Diffusion metrics were averaged over estimated subject skeletons in the case of each pipeline in accordance with the TBSS preparation pipeline.
Figure 6 shows the results from the voxel-wise comparison between the original UKB pipeline and S7. Both DKI/WMTI (Fig. 5a) and cDTI (Fig. 5b) revealed significant differences (p < 0.05) between pipelines, where the S7 pipeline metrics revealed both higher and lower values compared to UKB pipeline. The results of the analysis based on S5 vs S4 and S5 vs S7 are presented in Figure 7.
Correlation plots for diffusion metrics based on conventional DTI fitting obtained for four different pipelines: a) up to Step 5; b) up to Step 4; c) up to Step 7; d) original UKB pipeline. Diffusion metrics are averaged over estimated subject skeletons in the case of each pipeline in accordance with TBSS preparation pipeline.
Results of TBSS analysis between the pipelines: original UKB and the proposed here (S7). a) TBSS analysis for diffusion metrics based on DKI and WMTI approaches (2 ?-shells); b) TBSS analysis for diffusion metrics based on DTI fitting (1 b-shell). All images are in standard MNI space and correspond to the coordinates: x = 26; y = −8; z = 24. Any coordinate changes are marked. The red-yellow colour means that metrics from S6 pipeline are significantly higher than from UKB (p < 0.05); the blue-light-blue colour means an opposite situation.
Results of TBSS analysis between a) S5 and S7; b) S5 and original UKB pipeline. All images are in standard MNI space and correspond to the coordinates: x = 26; y = −8; z = 24. Any coordinate changes are marked. The red-yellow colour means that the first pipeline is significantly higher than the second one (p < 0.05); the blue-light-blue colour means an opposite situation.
tSNR
For b = 1000 s/mm2 mean estimated tSNR (std) = 3.83 (0.25), 3.83 (0.25), 3.94 (0.29) and 1.81 (0.10) for S4, S5, S7, and UKB, respectively. For b = 2000 s/mm2 mean estimated tSNR (std) = 1.92 (0.12), 1.92 (0.12), 1.93 (0.14), and 1.73 (0.10). These results indicate 2.2 times (for b = 1000 s/mm2) and 1.1 time (for b = 2000 s/mm2) higher tSNR in the S7 pipeline compared to the UKB pipeline.
Age-related differences across pipelines
Figure 8 shows the estimated age-curves for the various diffusion metrics and Table 2 shows the summary stats from the regression models. The cDTI metrics exhibited expected age-related differences with lower FA and higher MD, AD, and RD with higher age. MK, AK, and RK showed age-related reductions, i.e. reduced non-Gaussianity of the water diffusion with increased age. Metrics based on WMTI (AWF, AE, and RE) demonstrated reduction of the axonal water fraction and extension of the extra-axonal water diffusivity with increased age. Though the main tendency in all age-curves was similar for both pipelines, we found that the Pearson correlations and slopes were different for FA, MK, RK, and AWF, and significantly for AE indicating stronger associations with age in S7 pipeline.
Linear age correlations of diffusion metrics obtained from two pipelines (S7 and original UKB). Colour rectangles present an 95% interval of confidence. The red colour corresponds to UKB values, the blue colour corresponds to the S7 pipeline values.
Estimated regression slopes/intercepts and Pearson correlations coefficient r/r2 for diffusion metrics in age-curves in Fig. 8. The red colour emphasises the significant differences (p < 0.003) in the estimated parameters obtained after applied Fisher’s transformation.
Figure 9 shows the results from general linear model (GLM) testing linear age associations with the different diffusion metrics across the skeleton. The column “Diff” for the same diffusion metrics in Figure 9 visualizes the voxel-wise difference in the correlations. Voxels in red and blue showed significant age association only in S7 and UKB pipelines, respectively. The voxels where both pipelines detected significant age correlations are not shown. The relationship between the number of voxels marked as pipeline specific (red or blue) to the total number of voxels (yellow-red for each pipeline S7 and UKB) is summarised in Table 3. For example, for DTI metrics such as MD and AD the S7 pipeline identified strong age correlations in splenium as well as in the occipital white matter. Similar behaviour is observed for WMTI metrics with strong associations detected by S7 pipeline. In turn, DKI metrics exhibited more regions with strong correlations detected by UKB pipeline.
Results of General Linear Model (GLM) tests of diffusion metricses vs age across the skeleton. The linear fitting model is based on two pipelines S7 and UKB. “Diff” columns visualise the spatial difference between the GLM results: the red colour marked the regions with significant difference (p < 0.001) detected along S7 pipeline but not in UKB based tests; the blue colour marked the voxels with significant age correlation detected along UKB pipeline but not in S7. The mean skeleton is visualised by the green colour.
The number of voxels depending on the pipeline with significant age-correlation specific for the given pipeline vs a total number of voxels with significant age correlation (p < 0.001). See also Fig. 9.
Discussion
A growing interest in utilising advanced diffusion weighted imaging to study the human brain motivated us to test the effects of various data processing pipelines on different diffusion metrics. Differences in data post-processing steps are likely to influence reliability and results. Thus, a harmonised diffusion pipeline may prove valuable for increasing sensitivity, reliability, and generalisability across studies. We suggest a general framework with the following post-processing steps: 1) noise correction, 2) Gibbs ringing correction, 3) field mapping, 4) susceptibility, eddy current and motion distortion corrections, 5) B1 field correction, 6) spatial smoothing; and 7) final metrics estimation. Our comparison between four diffusion pipelines demonstrated that the general pipeline suggested here yield a substantially higher tSNR compared to the original UKB pipeline, and also influence the estimated age curves for MK, RK, AE, and AWF, with potentially important implications for the interpretations.
Overall, the diffusion metrics derived using the different pipelines demonstrated high correlations and similar standard deviations. In some cases, S7 resulted in lower standard deviations of the diffusion metrics than others, e.g. for WMTI. S7 also exhibited high correlations with S5 for the conventional DTI metrics. Results from the UKB pipeline showed relatively high correlations with S4 and S5 for the DTI metrics, and slightly lower for DKI and WMTI. For cDTI the UBK pipeline showed stronger correlations with S4 than to S5. The correlations between the S7 and UKB pipelines were lower than for other pipeline pairs. Overall, the results support that all blocks of the proposed S7 pipeline might lead to relevant improvements in the estimations of absolute diffusion metrics.
TBSS revealed a significant difference between the S7 and UKB pipelines for all diffusion metrics. Interestingly, the differences between pipelines did not reflect global shifts of the diffusion parameters across the skeleton, but rather spatially variable differences across several metrics, including DTI, DKI, and WMTI. The observed differences between S4 and S5 pipelines suggest significant effects of bias field corrections across a large part of the skeleton, nevertheless, the difference between the S4 and S5 pipelines looks similar to the value shift in contrast to the case of S7 and UKB comparison. The comparison between S5 and UKB pipelines revealed similar results as those seen when comparing the S7 and UKB pipelines (Fig 7b), suggesting that spatial smoothing in the S7 pipeline is a reasonable improvement which did not remove important information from the dataset. The findings of regionally specific increases or decreases in diffusion metrics for different pipelines might partly be explained by an effect of the noise correction step on physiological noise around the large arteries or strong susceptibility artefacts close to air cavities in the brain. Such artefacts might introduce spatially variable distortions, which could lead to spurious findings in clinical studies. This could explain the previously demonstrated higher sensitivity to group differences after noise correction (Kochunov et al., 2018). Moreover, our comparison between pipelines demonstrated that noise and Gibbs ringing corrections influenced tSNR both in the case of conventional diffusion data (b = 1000 s/mm2) and at higher diffusion weightings (b = 2000 s/mm2). In contrast, the spatial smoothing step (S7) did not introduce a strong shift in tSNR in our analysis (S4, S5), however, has been shown to influence further diffusion metric estimations by reducing the number of “bad” voxels (Veraart et al., 2013).
In order to assess possible practical consequences of the different noise correction steps we compared the estimated age slopes in DKI and WMTI metrics between pipelines. Age-related differences are abundant in the relevant age span (Grinberg et al., 2017; Westlye et al., 2010). Apart from MD and RD, all included metrics showed a large main effect of pipeline and, although similar signs, different estimated age-slopes between the S7 and UKB pipelines. In particular, the S7 pipeline revealed stronger age-correlations compared to UKB for WMTI metrics. Further, the voxel-wise comparisons revealed a higher number of voxels showing significant age associations in the S7 compared to the UKB pipeline for DTI and WMTI metrics. On the contrary, the UKB pipeline demonstrated a higher number of significant voxels in the case of DKI estimations. Although subtle, pipeline related global and spatially varying differences in diffusion metrics will have consequences for subsequent analyses, for example, for machine-learning based age prediction or diagnostic classification or prediction of clinical traits (Alnaes et al., 2018; Doan et al., 2017; Kuhn at al., 2018; Richards et al., 2018).
In conclusion, our analysis demonstrated that diffusion metric estimations are sensitive to specific pipeline and might benefit from the proposed sequential advanced post-processing steps. Our proposed pipeline is also an example of a general approach for harmonisation of postprocessing steps across diffusion MRI studies for increased sensitivity and generalisability, which may be particular important when applying complex diffusion models to multishell diffusion data.
Conflict of interests
Authors declared no conflict of interests.
Acknowledgement
This work was funded by the Research Council of Norway (249795) and the South-Eastern Norway Regional Health Authority (2014097, 2015073, 2016083).