
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe, Christian Szegedy
00
2015-02-11
optimizationtraining
Abstract
This paper introduces and evaluates the idea described in “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift”, and reports empirical results that helped shape subsequent work in optimization, training.