Abstract
Multi-compartment T2-mapping has gained particular relevance for the study of myelin water in brain. As a facilitator of rapid saltatory axonal signal transmission, myelin is a cornerstone indicator of white matter development and function. Regularized non-negative least squares fitting of multi-echo T2 data has been widely employed for the computation of the myelin water fraction (MWF) and the obtained MWF maps have been histopathologically validated. MWF measurements depend upon the quality of the data acquisition, B1+-homogeneity and a range of fitting parameters. In this special issue article, we discuss the relevance of these factors for the accurate computation of multi-compartment T2 and MWF maps. We generated multi-echo spin-echo T2 decay curves following the approach of CarrPurcell-Meiboom-Gill for various myelin concentrations and myelin T2 scenarios by simulating the evolution of the magnetization vector between echoes based on the Bloch equations. We demonstrated that noise and imperfect refocusing flip angles yield systematic underestimations in MWF and intra-/extracellular water geometric mean (gm) T2. MWF estimates were more stable than myelin water gmT2 time across different settings of the T2 analysis. We observed that the lower limit of the T2 distribution grid should be slightly shorter than TE1. Both TE1 and the acquisition echo spacing also have to be sufficiently short to capture the rapidly decaying myelin water T2 signal. Among all parameters of interest, the estimated MWF and intra-/extracellular water gmT2 differed by approximately 0.13-4 percentage points and 3-4 ms, respectively, from the true values, with larger deviations observed in the presence of greater B1+-inhomogeneities and at lower signal-to-noise ratio. Tailoring acquisition strategies may allow to better characterize the T2 distribution, including the myelin water, in vivo.
Since the inception of magnetic resonance imaging (MRI), researchers have strived to use MR signal relaxation techniques for the characterization of tissues and tissue pathologies. T2-weighting by means of spin-echoes, first described by Erwin Hahn [1,2], captures the effect of random field fluctuations on the transverse MR signal. Quantitative T2 relaxation measurements are commonly performed using the Carr-Purcell-Meiboom-Gill (CPMG) acquisition scheme [3,4]. Rapid acquisitions or 3D encoding utilize a “train” of spin-echoes, a technique pioneered by Jürgen Hennig [5]. Hennig also devised the extended phase graph (EPG), used to describe the formation of spurious echoes in the T2 decay due to imperfect refocusing flip angles (FA) in the presence of static field inhomogeneities [6,7]. By ensuring that subsequent refocusing pulses have the same phase, orthogonal to that of the excitation pulse, CPMG mitigates errors due to imperfect refocusing and achieves higher signal through constructive interference of refocusing pathways. T2-mapping has been primarily applied to characterize brain tissue [8,9], but also outside the brain for imaging of cartilage [10] and intervertebral discs [11]. Notably, microstructural tissue compartmentalization can contribute more than one exponential component to the T2 decay [12–15].
The short T2 component in the brain and its relationship with myelin Multi-component analysis of multi-echo T2 data through construction of continuous T2 distributions was described by Whittall and MacKay [16]. They employed non-negative least squares (NNLS) [17] to solve the inverse problem that is posed by an unknown number of T2 components contributing to the T2 decay. In the early nineties, Menon and Allen [18] described two T2 components in feline brain ex vivo and assigned the shorter component to water in myelin. Thereafter, a similar T2 component was observed in guinea pig brain ex vivo [13]. MacKay and Whittall et al. were the first to report a short T2 component in human brain in vivo at 1.5 T [19]. This short T2 signal co-localized with brain white matter (WM) and its signal proportion was in line with the amounts of “myelin water” (MW) reported in vitro [20]. These observations strongly suggested that the short T2 signal originates from water molecules trapped within the tightly wrapped myelin bilayers. At 30 Å [21], i.e. only about ten times the size of water molecules, the space occupied by water within the bilayers is much narrower than intra- and inter-axonal distances. The therefore restricted motion leads to faster spin-spin dephasing of aqueous protons in MW than in the intra- and extracellular space. The MR signal from non-aqueous protons of the phospholipid bilayers decays to zero in less than 100 μs; hence it is not visible with conventional MR systems [22]. The relationship of the short T2 component and myelin was illustrated in cases of known myelin pathology, where lesions devoid of myelin showed little short T2 signal [23,24]. Histopathological studies later validated the association of the short T2 component with myelin lipid concentrations [25,26]. The term myelin water fraction (MWF) was coined for the short T2 fraction of the total water signal. However, water protons other than those in myelin may contribute to rapid T2 decay. Biological iron, calcium and other tissue constituents can cause additional T2 shortening of water signal in the intra- and extracellular space [27], increasing the apparent MWF [28]. Since WM iron is predominantly found in myelin forming oligodendrocytes and contributes to lipid synthesis [29], the sensitivity of the short T2 component to both myelin and iron should be considered. Mapping myelin in vivo permits us to study brain development [30–32], trauma [33] or aging [34,35], as well as neurodegenerative diseases that have a direct link to myelination, such as multiple sclerosis (MS) [23,36]. However, despite being proposed 25 years ago, spin-echo myelin water imaging (MWI) has to-date found limited clinical application. This is in part due to the long acquisition times and an intricate signal analysis.
MWI using non-negative least squares (NNLS) fitting to multi-exponential T2 data The NNLS analysis and the computation of the MWF depend on a number of parameters. In vivo literature has primarily reported on multi-component T2-mapping and corresponding analysis parameters in human brain at 1.5 T and 3 T. As MWI expands to other field strengths and applications for which multi-component T2 times have not yet been established, adjustment of the T2 analysis and application of NNLS may be required to obtain reliable MWF measurements. For example, if T2 relaxation times are shorter, the T2 boundaries that distinguish the different water compartments need to be redefined. Otherwise, signal from a water pool with a T2 longer than that of MW may be erroneously counted toward the short water pool fraction as it crosses the pre-established T2 cut-off [37]. In brain tissues, the intermediate water compartment is attributed to intra- and extracellular water (IEW). This intermediate T2 pool can typically not be further divided using the resolution of T2 time at 3 T. However, a few studies have found two T2 times in the intermediate T2 range, ascribing the longer component to extracellular water [38–40].
To provide greater insight into the data analysis, we discuss in this article the various parameters involved in the computation of the NNLS solution [17] in accurately determining different water pool fractions and their T2 properties. A number of groups have assessed the properties of NNLS for multi-component T2 analysis, particularly in the first decade of MWI [16,41,42–44]. We intend to update those findings and provide a guideline for the application of NNLS with EPG to multi-component T2 analysis in different settings. The main challenge for testing analysis parameters in vivo relates to the lack of a known ground truth. The MWF and even more so the T2 of myelin water (MW) are not well established. Differences in acquisition timing may further influence the range of T2 time that is accessible for analysis of T2 decay data [41]. In this work, we used Bloch simulations to generate signal decay curves from different multi-echo T2 acquisition strategies for synthetic voxels that contained realistic myelin geometries. First, the EPG algorithm was employed to estimate the unknown refocusing FAs [7,45]. Thereafter, the generated T2 decay curves were analyzed with varying NNLS parameters and results were compared to the known ground truth. We addressed the influence of noise and regularization on the computation, in scenarios of low to high MWFs and different MW T2 times.
Experimental
Numerical simulation of T2 decay data
Bloch simulations were performed on a 256 × 256 grid over a 2D voxel (Figure 1). The layout was based on a publicly available electron microscopy image (https://upload.wikimedia.org/wikipedia/commons/c/c1/Myelinated_neuron.jpg, accessed Sept 12th 2018). The voxel contained three myelinated axons, two of which had thinner myelin sheaths than the third, amounting to 21% MWF (Fig. 1A). To assess whether lower MW contents can also be correctly estimated, the voxel layout was adjusted to yield approximately 11.6% and 5.1% MWF (Fig. 1B and C). These models are henceforth denoted as models (M) 1, 2, 3, corresponding to 21, 11.6 and 5.1% MWF.
We defined four subspaces: Extra-axonal water, intra-axonal water, water in between the myelin bilayers and the non-aqueous myelin bilayers. Literature T2 and T1 relaxation times [46–48] were assigned to spins in different compartments by random sampling from a Gaussian distribution around the mean compartmental T1 and T2. Local spin resonance frequencies were allocated by forward field computation of the magnetic susceptibility distribution [49] (χ = 0 ppm for all water, χ= −0.02 ppm for the myelin lipid bilayers, assuming 80% myelin lipids and 20% proteins [50,51]). Resonance frequency offsets (f) [52–54], relative to the extra-axonal water frequency, were then assigned to the different water pools considering the computed field perturbations. Intra- and extracellular water were assumed to have the same T1 and T2 (T1 = 1200 200 ms [47,48], T2 = 70 10 ms [46]), but different f (−3.052 and 0.053 Hz, respectively). MW was presumed to exhibit a rapid signal decay in both T1 and T2 (T1 = 600 100 ms [47,48], T2= varied from 3 – 20 5 ms [46], f = 7 Hz [53,54]). Very short, (“MR invisible”) relaxation rates and high offset frequencies were modelled for the highly anisotropic non-aqueous myelin bilayers (T1 = 120 20 ms, T2 = 0.1 0.05 ms, f =−20 Hz). Note that the proton signal from myelin phospholipid bilayers does not decay exponentially, but is characterized by a Super-Lorentzian line shape [55,56]. Literature on T1 of the aqueous and non-aqueous parts of myelin is scarce and in vivo measurements are known to be affected by magnetization exchange [57]. Hence, T1 values represent approximate values only.
A linear frequency variation was added to the frequency map as encoding gradients impose much larger frequency offsets than induced locally [7]. Gaussian noise was added to the T1 and T2 maps and independently to the real and imaginary components of the complex field map (signal-to-noise ratio (SNR) = 250).
Multi-echo spin-echo data were generated analogous to the in vivo MR protocol of the gradient-echo and spin-echo acquisition currently performed for MWI at 3 T [58]: 32 echoes, TE1/ΔTE=10/10 ms and TR=1200 ms. The magnetization vector evolution was described with rotation matrices for each isochromat, allowing for spin excitation and dephasing in the rotating frame of reference. The Carr-Purcell-Meiboom-Gill spin-echo experiments consisted of a 90° excitation pulse along ŷ and subsequent 180° refocusing pulses along, with the time between the excitation and first refocusing pulse being TE1/2. At time of echo formation, the total magnetization of the set of isochromats was computed and the magnitude of the complex signal, i.e. the transverse component of the magnetization vector, was taken. For each of the three models (Fig. 1), we varied the MW T2 through 3, 5, 8, 10, 13, 15, 18 and 20 ms. Six refocusing FAs between 130°−180° were considered and the effect of shorter echo spacing was tested by also computing decay curves of 48 echoes at 8 ms TE/ΔTE and 64 echoes at 6 ms TE/ΔTE [59]. Note that the 48 and 64 echo acquisitions included a few later TE times than the 32 echo acquisition. To accommodate the longer readouts, TR was chosen longer than typically used at 3 T for 32 echoes.
Computation of T2 distributions and analysis parameters of interest
The most fundamental step in correctly estimating T2 and subsequently the MWF from multi-echo MRI is the determination of the actual refocusing FA to account for the presence of stimulated echoes [45]. To this end, hypothetical T2 decay curves are computed by EPG for a number of T2 times (T2,i) and at a few different FAs, typically eight, equally spaced between 100°−180°. EPG identifies three magnetization phase configurations and traces state transitions by relaxation and transition matrices. The states, Fn, and Zn, represent transverse and longitudinal dephasing and rephasing spins. Following the application of a radiofrequency pulse α with an initial phase of zero, state transitions occur by [45]
Between refocusing pulses, spins may transition from Fn to and reversely, noting that the sum of the magnetization in all states has to remain constant. Echoes are formed when transitions to F1, with echo amplitudes corresponding to the magnitude of . When periodicity conditions are met, i.e. TEn = n·ΔTE, the number of configurations that need to be computed grows linearly with the number of echoes. If echoes with different phases occur, additional states need to be traced. An average, single exponential tissue T1 (1 s for WM at 3 T [9]) and sequence parameters are also considered by EPG, including echo train length (ETL), Δ TE and the attempted refocusing pulse angle, customarily 180° for spin-echo sequences. Note that computing voxel specific T1 times does not improve MWF estimation [45]. For each FA, NNLS determines the corresponding T2 distribution. A T2 decay curve is then computed by back projection from the T2 distribution and compared to the measured data. The sum of squares of the residuals, i.e. χ2, is calculated for each FA.
By cubic spline interpolation of all eight χ2, the FA corresponding to the minimum χ2, i.e. the actual FA, is established.
With the determined FA, EPG is used to recompute T2 decay curves for each T2i time in the grid. The magnetization computed at each echo will depend on the echo amplitudes computed by EPG, the assumed proton density (PD), the longitudinal magnetization M0 and the excitation FA [45].
A represents the already computed deviation from the pre-scribed FA in each voxel. NNLS subsequently estimates the T2 distribution by comparing the theoretical decay curves to the measured data. The data remains unaltered when using EPG. Only the generated decay curves are adjusted to reflect imperfect signal refocusing and T1 effects in the T2 decay, thereby matching better to the acquired data. Due to discretization, resulting T2 distributions feature only delta-peaks at few T2 times, which may vary for different noise realizations. The sum S(T2) of the delta-functions for a given T2 with amplitudes ai determined at times T2i can be expressed as:
The amplitudes ai are larger or equal zero as T2i times are either present with a given amplitude or absent from the T2 distribution. This constraint turns Eq.4 into a non-linear problem. For the solution to resemble the true T2 distribution, the T2 grid needs to contain a sufficient number and range of T2i [16]. The T2 grid is established by logarithmic spacing between the lowest (T2,min) and highest T2, based on the number of allowed T2 times (nT2) and the expected width of the T2 distribution. More nT2 times may enhance confidence in the determination of the different components in the T2 distribution, but will prolong processing time. Most researchers utilize 40 nT2 times, encompassing T2 times between 15 ms and 2 s [46,60,61]. At these parameters, 30 min to several hours are required to compute a whole brain MWF map, depending on available hardware. We tested NNLS with nT2 of 20, 40, 80 and 120 T2 times, while varying the lower bound of the T2 grid, T2,min, between 3 – 15 ms in 1 ms intervals.
Assuming that , the signal Sj of echo j is defined by a system of linear equations
The matrix coefficients Aji represent the exponentials, the decay signal computed by EPG. In principle, the amplitudes ai can be calculated by taking the pseudo-inverse of Aji. However, exact inverse solutions are contaminated by noisy data and no constraints, such as non-negativity of all ai, can be imposed. By contrast, NNLS allows for a misfit between the simulated and measured decay data yj, for instance due to noise, using the χ2 statistic (Eq.2). When minimizing all ai are implicitly constrained to be non-negative by NNLS operating as an active set algorithm. The active set contains the regression coefficients, i.e. signal amplitudes, that are negative or zero when estimated without constraints. If the true active set were known, the solution would equal the unconstrained solution of all non-negative coefficients [62]. Starting with a zero vector where all coefficients are non-negative, NNLS approaches a prediction of the active set by iterative combination of all T2,i with different amplitudes, fulfilling the non-negativity constraint.
Regularization is employed to produce continuous distributions of T2 relaxation times, which are expected to conform to reality. Thereby, adjacent peaks are merged to form a few distinct water pools. To constrain T2 distributions, an l2-norm regularization term is incorporated into Eq.6, which minimizes the system’s “energy” to produce smooth, continuous T2 distributions [42]. By increasing the strength of the regularization, the central IEW peak becomes broader, while the short fraction size reduces. Other regularization, e.g. by the first or second order derivatives of the spectrum, may also be used [16,63].
Without regularization, reflects the misfit in each data point by about one standard deviation [16]. With regularization, will be adjusted incrementally to increase χ2, since it is typically too low, fitting data too accurately [16,42]. The ratio of χ2 over has commonly been set to 1.02 [60]. The misfit depends on the acquisition type and noise level.
To test the robustness of NNLS with respect to noise and the influence of regularization, both the decay curve SNR and the ratio were varied. White Gaussian noise was independently added to each of the computed decay curves (Matlab’s awgn function). Thereby, the power of the input signal was assessed before noise was added, yielding prescribed decay curve SNR levels of 30, 50, 100, 150, 200, 250, 300, 500 and 1000. The noisy data was in some cases compared to decay curves without added noise, i.e. no specific SNR level was generated. The regularization parameter was varied to yield of 1.02, 1.05, 1.1, 1.15, 1.25, i.e. allowing misfits of 2 – 25%.
The estimated geometric mean (gm) T2 of the IEW and MW peak and the obtained MWF were compared to the ground truth to assess the accuracy of the NNLS solutions with EPG.
In vivo data
To relate the simulations to in vivo imaging, MWI data from one female healthy volunteer (age 23 years) was collected. The volunteer gave written informed consent. One 32 echo GraSE T2 data set was acquired at a 3T Philips Achieva (Best, The Netherlands) with an eight-channel head coil, using TE/ΔTE/TR=10/10/1000ms [57]. The data were processed with the regularized NNLS, as described for the simulated data, using T2,min = 15 ms, nT2 = 40 and , typical for in vivo multi-component T2 analysis [57,58].
Results
Extended Phase Graph – Flip Angle estimation
The reliability of the FA estimation by EPG depends predominantly on SNR. Figure 2 compares the FA estimates for the different acquisition types and decay curve noise levels. Despite greater variation from the true FA at lower SNR, estimated and true FAs still correlated linearly at all SNR levels (p < 0.023, r > 0.873). In the 32-echo simulation, the average slope was 0.836 ± 0.140 for all SNR levels. For 48-echoes and 64-echoes, FA estimates improved and the average slopes of the regression lines were 0.996 ± 0.060 and 1.022 ± 0.066, respectively. Note that EPG estimates B1+ inhomogeneities relative to the assumed refocusing angle of 180°, so that larger FAs, e.g. 190°, are displayed as 170°.
Table 1 summarizes the SNR-dependent root-mean squared error (RMSE) of the FA estimates across different acquisition types and the three MWF models (M1, M2, M3). The three acquisition types are referred to by their ETL: 32e, 48e and 64e indicating acquisitions of 32 echoes with 10 ms ΔTE; 48 echoes with 8 ms ΔTE and 64 echoes with 6 ms ΔTE, respectively. With improving decay curve SNR, FA estimates became more reliable. Similar RMSE estimates indicated little difference between acquisitions using different ΔTE. Across all noise levels, models and acquisition types, the estimated FAs were 127.8 ± 3.6°, 138.4 ± 3.7°, 148.6 ± 3.9°, 158.9 ± 3.2°, 168.5 ± 3.1°, 173.5 ± 4.2° relative to the true FAs of 130°, 140°, 150°, 160°, 170° and 180°. The apparent difficulty in capturing FA = 180° (see also Fig. 2) is due to the FA limit of 180° with EPG. Small systematic deviations from 180°, however, are of little consequence for the EPG signal decay.
Given that only a limited number of T2i times is used to describe the T2 decay, some variance is expected with respect to nT2 and the choice of T2,min. For a particular acquisition type and model (M2, 48e, MW T2 =10 ms, SNR = 500), changing nT2 yielded on average variations of less than 1° across all FAs. Changing the MW T2 led to arbitrary, nonsystematic variations in the FA estimation. On the other hand, changing T2,min from the lowest to highest (3 ms to 15 ms) resulted in variations of 1° – 4° from the true FA with a tendency towards higher FAs for longer T2,min. Regularization played no role in the FA estimation.
Decay curve SNR and B1+ inhomogeneities: Comparison of simulated and in vivo data
The SNR of the decay curves affects the FA estimation, with low SNR also resulting in significant residuals. Importantly, while lower FAs produce systematically lower signal at the first echo by inefficient re-focusing, a signal increase due to positive noise at the first echo could lead to an overestimation of the MWF. Together B1+ inhomogeneity and the decay curve SNR contribute to the estimated SNR (SNRest) that is obtained from the NNLS fit.
The surface plot in Figure 3A exemplifies the influence of these two parameters on SNRest for M2, 32e. In vivo data acquired at 3 T are depicted for comparison in panel B.
In vivo data yielded SNRest values > 180 and FAs of > 150 for most WM. By comparison, SNRest of the simulation at FAs around 180° and signal SNR = 1000 was just over 100. Thus, particularly in WM, the decay curve SNR at respective FAs in vivo appears to be sufficient for T2 estimation, and the simulated data represent possibly worse signal SNR scenarios than encountered in vivo at 3 T. This is further supported by the direct comparison of the decay curves (Figure 4). At comparable FAs, the decay curves displayed similar stimulated echo and noise patterns for high SNR values. Thus, all subsequent figures and computations will evaluate the decay curves with a simulated decay curve SNR = 1000.
Relationship between T2,min and MW T2 time
To understand the relationship between the choice of T2,min and the MW T2 time, it is most informative to look at the regularized T2 distributions (Figure 5). Independent of T2,min, FA, noise level or MW T2, the IEW peak was centered at the true IEW T2 time. Nonetheless, the gmT2 deviated slightly from the true value (IEW gmT2 time: panel A 66.4 – 67.5 ms, B 69.3 – 70.2 ms, C 71.0 – 71.4 ms), likely introduced by the limited data sampling of the T2 distributions. For instance, with nT2 = 40, 4 – 7 data points characterized the IEW peak. Choosing larger nT 2may provide more stable estimates of the IEW gmT2. But, the gain in better estimating the IEW gmT2, relative to the choice of nT2, is small (about 1ms) and computational time must be weighed against minimally improved accuracy.
Due to its rapid decay and lesser signal, the MW compartment is more sensitive to noise than the IEW component. Hence, the MW pool was typically represented by only one or two elevated points rising towards T2,min, rather than a peak (panels A – D), indicating that our acquisitions and T2 distributions have insufficient range to fully describe the MW signal. On the contrary, T2 distributions representing decay curves with no added Gaussian noise (E & F), explicitly captured the influence of T2,min on the MW peak. In panel F, when T2,min > MW T2, the MW “peak” was a slope, characterized by a single T2i time, pushing towards the lowest allowed T2, i.e. T2,min. Once T2,min < MW T2, small peaks formed that fully depicted the short T2 signal and the MW gmT2 resembled the true MW T2. For MW T2 = 10 ms (panel F), the estimated MW gmT2 was 15, 13, 10, 9.5, 9.4 ms for T2,min of 15, 13, 10, 8 and 5 ms. In the presence of added noise (panels A – D), both T2 peaks became wider, which made it particularly difficult to capture the short T2 component fully.
It needs to be determined how strongly the MWF assessment depends on the selection of T2,min and the estimation of the MW gmT2. Furthermore, it needs to be ascertained how short T2,min can be selected relative to the shortest acquisition TE (e.g. 6, 8 or 10 ms) to recover the true MWF.
Estimating the geometric mean T2 of the intermediate water – IEW – compartment In healthy WM, the intermediate T2 is assigned to water in the axonal lumen, cell bodies and interstitium. The IEW peak is relatively broad, with a gmT2 of approximately 68 – 72 ms at 3 T [59]. Therefore, varying nT2 hardly influenced the determination of IEW gmT2 (variations approximately 1 ms). Increasing regularization progressively reduced the IEW gmT2 from the true time of 70 ms. Lower FAs did not systematically underestimate IEW gmT2. Differences in T2,min changed IEW gmT2 by less than 3 ms. When the IEW gmT2 was not stable with respect to T2,min, it approached the true value when the T2,min was closer to the MW T2 (Supplementary Table 1). If T2,min was longer than the MW T2, IEW gmT2 increased above the true value, in line with the observations from Fig. 5C. In turn, for long MW T2 (20 ms, Fig.5A), the IEW gmT2 was underestimated. The amount of MW did not affect the IEW gmT2 estimation.
Across all FAs, T2,min and MW T2 times, we obtained 66.2 ± 2.2 ms, 66.7 ± 2.1 ms and 66.4 ± 2.2 ms for M1 (32/48/64e,respectively), 66.7 ± 1.6 ms, 66.8 ± 1.6 ms and 66.8 ± 1.6 ms for M2, and 67.6 ± 1.5 ms, 67.7 ± 1.6 and 67.6 ± 1.5 ms for M3, yielding an approximate variation of 3 – 4 ms from the true value . Without the addition of noise to the decay data, IEW gmT2 times were closer to the ground truth (less than 2 ms difference) and demonstrated a dependency on FA. For M1, 32e the obtained IEW gmT2 was 69.5±0.8 ms at 180°, 69.2 ± 0.9 ms at 170°, 68.4 ± 1.4 ms at 150° and 68.4 ± 1.5 ms at 130°.
Estimating the geometric mean T2 of the MW peak and the MWF
When comparing the true and estimated MW gmT2 at different FAs (M2 - 48e), the three MW T2’s of 20, 13 and 8 ms were only accurately reproduced at relatively high FAs and T2,min < MW T2. Once T2,min < MW T2, the MW gmT2 reached a plateau and the solution remained stable in most cases (Figure 6). However, the plateau gmT2 time was typically shorter than the true MW T2, particularly at lower FAs. If TT2,min was very short, MW gmT2 tended to decline further. Moreover, it can be difficult to reach a stable solution, if MW T2 is short, e.g. MW T2 = 8 ms. In the absence of added noise (lower right panel) a similarly plateauing behavior was observed, with some plateau values closer to the true T2 time than in the SNR = 1000 case. Again, the dependence on FA was less pronounced when noise was added to the decay curves. Without added noise, the median MW gmT2 for the 20 ms MW T2 was 19.5, 19.4, 17.5 and 14.6 ms at 180°, 170°, 160°, 150°. For 13 ms MW T2, we obtained 12.7, 12.9, 11.8 and 11.7 ms at the decreasing FAs and 9, 9, 9 and 9 ms relative to the 8 ms true MW T2 for the same M2, 48e acquisition.
Variations with nT2 were on average below 1 ms, with nT2 = 20 T2 times yielding the largest offsets from the other nT2 estimates. Larger nT2 tended to increase the MW gmT2 minimally. Stronger regularization led in some cases to much longer MW gmT2, presumably a result of coalescence of the two water peaks. Correspondingly, with increasing regularization, the MWF tended to decrease initially, and then increase as the peaks merged and the consolidated peak extended into the MW T2 window. Table 2 compares the estimated MW gmT2 time (left) in the three models and for the three different acquisition approaches relative to the selected T2,min, and contrasts gmT2 estimates with the obtained MWF values (right).
MW gmT2 estimates improved in many cases for acquisitions with shorter echo spacing. Particularly the 48e, 8 ms echo spacing acquisition showed stable estimates, close to the true MW T2 time of 20 ms. Notably, even in the presence of unstable MW gmT2 estimates, MWFs were consistent across T2,min, albeit underestimated for the 20 ms MW T2. Figure 7 summarizes the relationship between MWF, MW T2 and selection of T2,min. Data for 48e and 64e are shown; 32e data are provided in Supplementary Figure 1. The top row displays the mean behavior in estimated MWF across MW T2 times of 15, 13, 10 and 8 ms relative to T2,min, since typically the true T2 time of the MW compartment is unknown. The bottom row illustrates at the relationship of MWF and individual MW T2 after selection of T2,min.
Average MWF estimates were relatively stable for T2,min > TE1, although often underestimated (top row). In most cases, the estimated MWF was closest to the true value if T2,min was just below or on the order of TE1. However, the exact relationship depended on the true MW T2. Selecting T2,min according to TE1, i.e. 8 ms for 10 ms TE, 6 ms for 8 ms TE, and 4 ms for 6 ms TE, we obtained 14.63 ± 6.61%, 17.66 ± 5.57% and 17.79 ± 7.28% for 32e, 48e and 64e, across all possible MW T2 times (bottom row, filled circles). These estimates were lower than the true MWF of 21% due to the inclusion of MW T2 times shorter than T2,min. Excluding data for which MW T2 < T2,min, the average MWFs were 17.00 ± 5.82%, 19.31 ± 4.17% and 20.01 ± 3.95% for 32e, 48e and 64e, respectively, for M1, 12.19 ± 5.46%, 12.32 ± 5.00% and 9.32 ± 2.71% for M2 (true fraction 11.6%) and 3.76 ± 3.07%, 5.23 ± 3.92% and 2.45 ± 2.94% for M3 (true fraction 5.1%), at appropriately chosen T2,min. Shorter echo spacing improved the consistency of the MWF estimation across MW T2 times, as reflected in the smaller standard deviations. Regardless, MWFs differed insignificantly between acquisition schemes, if T2,min was chosen as described. Overall, the true MWF was underestimated by 0.13-4 percentage points (at 170°, SNR = 1000, ). In the absence of added noise (filled stars and fitted line), MWF values demonstrated an even higher degree of similarity, deviating from the true value by 0.02 - 0.64 percentage points. Specifically, we obtained 20.36 ± 0.31% for 32e, 21.02 ± 0.90% for 48e and 20.26 ± 0.69% for 64e, close to the true value of 21% MWF. For M2, we found 11.23 0 ±.24%, 11.73 ± 0.49% and 11.00 ± 0.60% and for M3 4.62 0 ±.17%, 4.97 0 ±.14% and 4.62 ± 0.46%, respectively, for 32e, 48e and 64e at appropriately chosen T2,min.
Discussion
Importance of B1+ homogeneity and SNR
The EPG algorithm has much enhanced our ability to accurately interpret T2 decay data in the presence of stimulated echoes [45]. Yet, even when modeling spurious echoes in the T2 decay, imperfect refocusing pulses yield reduced signal amplitudes at early odd echoes, which leads to slight, systematic underestimations in gmT2 time and MWF. Notably, underestimations in MWF and IEW gmT2 also occur in the presence of noise44. Thus, while the influence of sub-optimal refocusing FAs and increased noise are clearly characterized individually, their combined presence leads to less systematic, and therefore less interpretable effects. Although noise affects predominately the residuals of the later echoes, noise in earlier data points may be mistakenly interpreted as stimulated echo artifact and vice versa.
Here, we used small-scale constraints, which increased χ2 to raise accuracy in the NNLS component estimation. Graham et al. showed that least squares-based constraints, as compared to χ2, may be advantageous at intermediate to low SNR in obtaining acceptable accuracy in determining multiple T2 components [42]. In early years of multi-echo T2 analysis, acquisition parameters of TE = 10 ms, 32 echoes and voxel SNR of 100, as determined at TE = 0 ms and the standard deviation of the noise, did not match the optimal TE of 6.4 ms and SNR of 700, then quoted to be required to resolve multiple T2 components by NNLS [42]. As shown in Figure 3, these limits may now be regarded as too strict. With advancing MR hardware and newer acquisitions, we are coming closer to achieving optimal data collection, and traditional restrictions for multi-component T2 analyses may need to be revised. Nevertheless, regularization should be used conservatively, allowing data misfit of only a few percent, typically 1 or 2 % [64] of that of the unregularized χ2, as stronger Tikhonov regularization quickly affects both MWF and IEW T2 measurements. Increasing noise by itself will also lead to widening of the T2 peaks (Fig.5), reflecting the uncertainty in the T2 estimation.
Regardless of the noise level, EPG captured the true FAs on average within an accuracy of 1 – 4°. Errors tended to be larger at lower FAs and lower decay curve SNRs. FAs at 180° appeared to be less accurately estimated. This is in part due to the EPG estimation limit of FAs < 180°, so that FAs will always be underestimated at this limit. Moreover, the trigonometric relationship between signal loss and FA plateaus at 180°, making the identification of the correct FA more challenging. Stimulated artifacts are, however, very small at FAs between 165° and 180°.
Although it is important to compute the actual FAs and characterize stimulated echo artifacts in the T2 decay curves, EPG adds to the computational post-processing time. Replacing the FA estimation by EPG with acquired FA maps could lessen the required processing time. FA maps are typically acquired at a lower spatial resolution than multi-echo T2 data, as B1+ is assumed to be spatially slowly varying. Using spatial constraints on T2 distributions and FAs across anatomical and pathological structures, as recently described [65], has the potential to improve noise robustness for multi-component T2-mapping.
We want to emphasize again that EPG does not reverse signal loss due to low FAs or the resulting underestimation of the MWF. Thus, improving B1+ homogeneity, such that refocusing FAs are near 180°, should be considered a priority. Greater B1+ homogeneity can be achieved by dielectric pads or parallel excitation by multi-element transceiver arrays. At 3 T, underestimations in MWF may be considered less significant than at higher field strengths, where gr eater B1+ inhomogeneity is likely to impose larger spatial variations in MWF. However, as the combination of FA and noise, i.e. the SNRest, will be the determining factor for the MWF estimation, systematic underestimations in MWF measurements may be alleviated as increased SNR may make up for greater B1+ variations at higher field strength. Even more, SNR profiles of multi-channel receive coils, which provide greatest SNR near the edge of the head, oppose the typical B1+ profile. Thus, these coils may avoid the discussed underestimations. Further work is needed to explore B1+ penalties as well as acquisition strategies at 7 T that can appropriately capture the shorter T2 decay of MW within specific absorption rate (SAR) limits.
Regularization
Although regularization is useful for resolving multi-components of the T2 distribution in the presence of noise, it comes at the cost of underestimating the MWF. We also noted that the IEW gmT2 time shortened with increasing regularization, albeit underestimations were less than 1 ms from the true value44. χ2 constraints should only be applied over a narrow range that ensures accurate detection of multiple T2 components [42]. Increasing the “energy” constraint, i.e. enforcing greater smoothing by Tikhonov regularization, widens and subsequently merges different water peaks, leading to shorter IEW gmT2 and reduced apparent MWF. As the coalesced peak becomes even wider, it can partly extend into the a priori defined short T2 interval, increasing measured MWF. Other regularization approaches have focused on data denoising [66] or used spatial constraints for handling of noisy data [65,67–69]. Although these methodologies yield visually appealing results, the impact of regularization should be carefully evaluated. As discussed, noise cannot always be distinguished from decay curve alterations due to stimulated echoes. Moreover, voxels of relatively low myelin content may appear noisy and residual myelin signal may be falsely removed. Spatial regularization may also have limited success for smaller tracts, where partial volume related underestimations in the MWF may propagate into the region-of-interest.
Other methodologies for mapping of myelin water
Different myelin water imaging techniques have been reviewed by Alonso-Ortiz et al.[70]. Multi-echo gradient-echo imaging has gained popularity for multi-compartmental fitting of T2* decay data [71,72]. Despite the simpler and faster acquisition and lower SAR, increased sensitivity to static field inhomogeneities [73] and physiological noise [74] have made it challenging to robustly obtain T2* myelin maps comparable to those obtained using spin-echo based techniques [71,75]. Importantly, while NNLS17 enables determination of an unlimited and unknown number of T2 components [16], most T2*-based techniques assume a fixed number of water compartments, usually two or three [72,76–78]. Currently, no NNLS algorithm exists that can separate the T2* magnitude and frequency offsets that need to be modelled for an unknown number of signal components. A priori imposition of the number of water pools may hamper the use of these techniques and other model-based approaches [79] in pathological cases, where changes in T2 and additional water pools have been reported [23,36,80]. In this work, only the multi-component T2 acquisition and NNLS analysis were considered, although other acquisition types for myelin water imaging have been proposed [81–83] and different analysis techniques may be employed [84,85]. To date, most of these approaches lack thorough histopathological validation and some are known to depend on the presence of magnetization exchange [86,87].
T2,min, MW T2 and MWF
Using NNLS, MW T2 may be inaccurately estimated in the presence of noise as the MW gmT2 computation depends on T2,min and acquisition echo spacing. Typically, MW gmT2 will reach a stable solution if T2,min ≤ MW T2, i.e. when the short T2 peak is fully captured. Note that the MW T2 is currently not regarded as having clinical relevance. Our observation of stable MWF even in the presence of changing MW gmT2 (Tab. 2) further supports the use of MWF over MW gmT2. All dependencies are summarized in Table 3.
We observed that the MW peak was best captured if T2,min was somewhat shorter than TE1 and shorter or equal to the MW T2 in order to reproduce the true MWF. Choosing T2,min much lower than TE1 resulted in artifactually high MWF values due to noise in the first data points. Interestingly, longer MW T2 time did not necessarily result in more accurate gmT2 and MWF values, despite both T2,min and the acquisition TE1 being sufficiently short to capture the MW signal. Previous work suggested that longer MW T2 may require a longer first TE to obtain optimal SNR for the description of the water pools [41]. This is in line with our observations, suggesting that the optimal TE for MWI may need to be close to the MW T2.
Hardware restrictions make it difficult to accurately determine very short T2 times as echo spacing is limited in current acquisitions. Because the in vivo MW T2 is unknown, further work is needed to determine the relevance of ΔTE in accurately estimating the MWF. The increased number of echoes necessary with shorter ΔTE will require longer TR to comply with SAR restrictions, possible posing a limit on measurement time. Undersampling approaches for spatial encoding, such as compressed sensing [88,89], may help to offset such limits.
Finally, one fixed set of T2 boundaries and parameters is typically chosen, which is believed to characterize the IEW and MW pools on average. Note that we did not discuss the T2 cut-off time here that separates thesewater pools. This parameter was well established in our simulations, but typically needs to be determined from the T2 distributions. As shown, single voxel MWF computations in vivo are challenging and should be approached with caution as noise and natural variations in myelin across the WM affect the MWF computation. Voxel-based average MWF measures are most commonly performed. They provide better reproducibility than region-based anal-yses, which would yield only a single regional MWF [44,60]. Across different MW T2 times, in the presence of noise and due to imperfect refocusing FAs, we found that the true MWF can be estimated at 0.13 – 4 percentage point deviation from the true value. Our simulations suggest that in vivo MWF measurements are reproducible [90], albeit underestimated. Reducing T2,min below the first TE appears to reasonably recover some signal lost due to regularization and FA imperfections. For comparison of data from different sites and different acquisition protocols, average SNRest and or FAs may need to be reported to assess MWF differences.
Limitations
The translation of our observations to in vivo scenarios is skewed by our limited knowledge of the true T2 distribution. We attempted to cover a wide range of acquisition and analysis parameters, relevant to the estimation of MWF. We did not simulate non-equidistant refocusing pulse schemes, previously proposed to better capture long T2 components [43], and also limited our simulations to pure spin-echo T2 decays, rather than the gradient and spin-echo acquisition currently used [58]. Non-uniform echo spacing implies a non-linear increase in refocusing pathways, greatly prolonging the computation of all magnetization states with EPG [7]. It is thus not commonly used. Computational time for EPG remains manageable if the echo spacing is changed by multiples of TE. Only Gaussian noise was investigated, as other noise patterns, such as Rician noise, have been described to affect MWF measurements similarly [44]. We chose Gaussian noise, because it is predominantly present in the earlier echoes that capture the rapid signal decay of the MW compartment. The present Bloch simulation ignored effects such as diffusion or exchange [91,92]. In the presence of exchange, an MWF decrease may be associated with a decrease in the IEW gmT2 [93]. Such exchange-associated shift in IEW gmT2 would be larger than what we observed here by altering acquisition and post-processing parameters [94]. We did not cover scenarios with MWF close to zero, as might be the case in neonates, completely demyelinated MS plaques or gray matter [95]. The created MWF scenarios only differed in the number of myelin bilayers, but kept the thickness or spacing of individual sheaths unchanged. Such changes, however, might occur due to swelling or vacuolization. This simplified model geometry created realistic water pool fractions and permits incorporating the above described parameters in the future. Although the pool fractions were the determining factor in this work, less so the actual geometry and distribution of the myelinated axons, the chosen setup allowed us to compute field inhomogeneities [96] and assign different compartmental resonance frequencies, which have been described in the literature. However, without modeling of diffusion, central k-space echoes will only minimally be affected. Finally, our simulations allowed generation of decay curves with stimulated echo artifacts independent of the EPG, which was used to analyze the decay curves.
Conclusions
The selection of shorter echo spacing and an appropriate T2,min improved the estimation of the amount of short T2 signal. We showed by realistic simulations that voxel-based averages were able to estimate the true MWF within 0.13 – 4 percentage points, with increasing accuracy if T2,min was selected slightly shorter than the first echo time. The number of T2 times used to compute the T2 distributions hardly affected the results. By contrast, increasing regularization should be applied with caution. Reporting the estimated SNR and average FA for specific regions of interest may help researchers to assess the systematic underestimation in MWF incurred due to the dependence of the T2 signal decay on noise and B1+ inhomogeneities. Advances in MR hardware may overcome current limitations in multi-echo T2 acquisition schemes to map the MWF more accurately.
ACKNOWLEDGMENTS
This work was funded by the Natural Sciences and Engineering Research Council of Canada and the National Multiple Sclerosis Society. AR is funded by Canada Research Chairs.
Footnotes
The authors have no conflicts of interest to declare.
References
- [1].↵
- [2].↵
- [3].↵
- [4].↵
- [5].↵
- [6].↵
- [7].↵
- [8].↵
- [9].↵
- [10].↵
- [11].↵
- [12].↵
- [13].↵
- [14].
- [15].↵
- [16].↵
- [17].↵
- [18].↵
- [19].↵
- [20].↵
- [21].↵
- [22].↵
- [23].↵
- [24].↵
- [25].↵
- [26].↵
- [27].↵
- [28].↵
- [29].↵
- [30].↵
- [31].
- [32].↵
- [33].↵
- [34].↵
- [35].↵
- [36].↵
- [37].↵
- [38].↵
- [39].
- [40].↵
- [41].↵
- [42].↵
- [43].↵
- [44].↵
- [45].↵
- [46].↵
- [47].↵
- [48].↵
- [49].↵
- [50].↵
- [51].↵
- [52].↵
- [53].↵
- [54].↵
- [55].↵
- [56].↵
- [57].↵
- [58].↵
- [59].↵
- [60].↵
- [61].↵
- [62].↵
- [63].↵
- [64].↵
- [65].↵
- [66].↵
- [67].↵
- [68].
- [69].↵
- [70].↵
- [71].↵
- [72].↵
- [73].↵
- [74].↵
- [75].↵
- [76].↵
- [77].
- [78].↵
- [79].↵
- [80].↵
- [81].↵
- [82].
- [83].↵
- [84].↵
- [85].↵
- [86].↵
- [87].↵
- [88].↵
- [89].↵
- [90].↵
- [91].↵
- [92].↵
- [93].↵
- [94].↵
- [95].↵
- [96].↵