Noise injection for training artificial neural networks: a comparison with weight decay and early stopping

Richard M Zur; Yulei Jiang; Lorenzo L Pesce; Karen Drukker

doi:10.1118/1.3213517

Noise injection for training artificial neural networks: a comparison with weight decay and early stopping

Med Phys. 2009 Oct;36(10):4810-8. doi: 10.1118/1.3213517.

Authors

Richard M Zur¹, Yulei Jiang, Lorenzo L Pesce, Karen Drukker

Affiliation

¹ Department of Radiology, The University of Chicago, 5841 South Maryland Avenue, MC2026, Chicago, Illinois 60637, USA. zur@uchicago.edu

Abstract

The purpose of this study was to investigate the effect of a noise injection method on the "overfitting" problem of artificial neural networks (ANNs) in two-class classification tasks. The authors compared ANNs trained with noise injection to ANNs trained with two other methods for avoiding overfitting: weight decay and early stopping. They also evaluated an automatic algorithm for selecting the magnitude of the noise injection. They performed simulation studies of an exclusive-or classification task with training datasets of 50, 100, and 200 cases (half normal and half abnormal) and an independent testing dataset of 2000 cases. They also compared the methods using a breast ultrasound dataset of 1126 cases. For simulated training datasets of 50 cases, the area under the receiver operating characteristic curve (AUC) was greater (by 0.03) when training with noise injection than when training without any regularization, and the improvement was greater than those from weight decay and early stopping (both of 0.02). For training datasets of 100 cases, noise injection and weight decay yielded similar increases in the AUC (0.02), whereas early stopping produced a smaller increase (0.01). For training datasets of 200 cases, the increases in the AUC were negligibly small for all methods (0.005). For the ultrasound dataset, noise injection had a greater average AUC than ANNs trained without regularization and a slightly greater average AUC than ANNs trained with weight decay. These results indicate that training ANNs with noise injection can reduce overfitting to a greater degree than early stopping and to a similar degree as weight decay.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Algorithms*
Computer Simulation
Decision Making*
Models, Statistical*
Neural Networks, Computer*

Abstract

Publication types

MeSH terms

Grants and funding