Salīdzināt metodes
Apskatiet izvēlētās metodes blakus; rindas, kas atšķiras, ir izceltas.
| AlexNet× | Batču normalizācija× | |
|---|---|---|
| Nozare | Dziļā mācīšanās | Dziļā mācīšanās |
| Saime | Machine learning | Machine learning |
| Izcelsmes gads≠ | 2012 | 2015 |
| Autors≠ | Krizhevsky, A.; Sutskever, I.; Hinton, G. E. | Ioffe, S. & Szegedy, C. |
| Tips≠ | Deep Convolutional Neural Network (CNN) | Normalization technique (applied per mini-batch during training) |
| Pirmavots≠ | Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems, 25, 1097–1105. (Republished: Communications of the ACM, 60(6), 84–90, 2017.) DOI ↗ | Ioffe, S. & Szegedy, C. (2015). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Proceedings of the 32nd International Conference on Machine Learning (ICML), PMLR 37, 448–456. link ↗ |
| Citi nosaukumi | AlexNet, Krizhevsky net, SuperVision CNN, ImageNet CNN 2012 | BatchNorm, BN, batch norm, mini-batch normalization |
| Saistītās≠ | 3 | 1 |
| Kopsavilkums≠ | AlexNet is a deep convolutional neural network (CNN) introduced by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton in 2012. It won the ImageNet Large Scale Visual Recognition Challenge (ILSVRC 2012) with a top-5 error rate of 15.3%, outstripping the runner-up by more than 10 percentage points and reigniting broad interest in deep learning. The architecture introduced or popularised several techniques — ReLU activations, dropout regularisation, and multi-GPU training — that became standard practice across the field. | Batch Normalization is a training technique introduced by Sergey Ioffe and Christian Szegedy in 2015 that normalizes the pre-activation outputs of each layer using the mean and variance computed over the current mini-batch. By stabilizing the input distribution to each layer throughout training, it substantially reduces internal covariate shift, enabling the use of higher learning rates and making deep networks train faster and more reliably. |
| ScholarGateDatu kopa ↗ |
|
|