Abstract
Large-scale neuroimaging datasets present unique challenges for automated processing pipelines. Mo-tivated by a large clinical trials dataset of Multiple Sclerosis with over 235,000 MRI scans, we consider the challenge of defacing - anonymisation to remove identifying facial features. The defacing process must undergo quality control (QC) checks to ensure that the facial features have been adequately anonymised and that the brain tissue is left completely intact. Visual QC checks – particularly on a project of this scale – are time-consuming and can cause delays in preparing data. We have developed a convolutional neural network (CNN) that can assist with the QC of the application of MRI defacing; our CNN is able to distinguish between scans that are correctly defaced, and can also classify defacing failures into three sub-types to facilitate parameter tuning during remedial re-defacing. Since integrating the CNN into our anonymisation pipeline, over 75,000 scans have been processed. Strict thresholds have been applied so that ambiguous classifications are referred for visual QC checks, however all scans still undergo a time-efficient verification check before being marked as passed. After applying the thresholds, our network is 92% accurate and is able to classify nearly half of the scans without the need for protracted manual checks. Even with the introduction of the verification checks, incorporation of the CNN has reduced the time spent undertaking QC checks by 42% during initial defacing, and by 35% overall. With the help of the CNN, we have been able to successfully deface 96% of the scans in the project whilst maintaining high QC standards. In a similarly sized new project of this scale, we would expect the model to reduce the time spent on manual QC checks by 125 hours. Our approach is applicable to other projects with the potential to greatly improve the efficiency of imaging anonymisation pipelines.
Competing Interest Statement
The authors have declared no competing interest.