Machine learning
Batch Normalization
Batch Normalization is a training technique introduced by Sergey Ioffe and Christian Szegedy in 2015 that normalizes the pre-activation outputs of each layer using the mean and variance computed over the current mini-batch. By stabilizing the input distribution to each layer throughout training, it substantially reduces internal covariate shift, enabling the use of higher learning rates and making deep networks train faster and more reliably.
Open in MethodMindSoonVideoSoon
Read the full method
Members only
Sign inSign in with a free account to read this section.
Sources
- Ioffe, S. & Szegedy, C. (2015). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Proceedings of the 32nd International Conference on Machine Learning (ICML), PMLR 37, 448–456. link ↗
- Goodfellow, I., Bengio, Y. & Courville, A. (2016). Deep Learning (Ch. 8). MIT Press. ISBN: 978-0-262-03561-3
- Ioffe, S. & Szegedy, C. (2015). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. arXiv preprint arXiv:1502.03167. link ↗