Bandingkan kaedah
Semak kaedah pilihan anda secara bersebelahan; baris yang berbeza akan diserlahkan.
| ResNet (Residual Network)× | AlexNet× | DenseNet× | Inception Network (GoogLeNet)× | |
|---|---|---|---|---|
| Bidang | Pembelajaran Mendalam | Pembelajaran Mendalam | Pembelajaran Mendalam | Pembelajaran Mendalam |
| Keluarga | Machine learning | Machine learning | Machine learning | Machine learning |
| Tahun asal≠ | 2016 | 2012 | 2017 | 2015 |
| Pengasas≠ | He, K.; Zhang, X.; Ren, S.; Sun, J. | Krizhevsky, A.; Sutskever, I.; Hinton, G. E. | Huang, G.; Liu, Z.; van der Maaten, L.; Weinberger, K. Q. | Christian Szegedy et al. (Google) |
| Jenis≠ | Deep Convolutional Neural Network with skip connections | Deep Convolutional Neural Network (CNN) | Dense convolutional neural network (feed-forward dense connectivity) | Deep CNN with parallel multi-scale convolutions |
| Sumber perintis≠ | He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. DOI ↗ | Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet Classification with Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems, 25, 1097–1105. (Republished: Communications of the ACM, 60(6), 84–90, 2017.) DOI ↗ | Huang, G., Liu, Z., van der Maaten, L., & Weinberger, K. Q. (2017). Densely Connected Convolutional Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4700–4708. DOI ↗ | Szegedy, C., et al. (2015). Going deeper with convolutions. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1–9. DOI ↗ |
| Alias≠ | ResNet, Residual Network, Deep Residual Learning, ResNet-50 | AlexNet, Krizhevsky net, SuperVision CNN, ImageNet CNN 2012 | DenseNet, Dense Convolutional Network, densely connected CNN, DenseNet-121 | GoogLeNet, Inception v1, Deep Convolutional Neural Network (Google), Başlangıç Ağı |
| Berkaitan≠ | 4 | 3 | 2 | 2 |
| Ringkasan≠ | ResNet (Residual Network) is a deep convolutional neural network architecture introduced by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun at CVPR 2016. By inserting shortcut (skip) connections that carry the input of a block directly to its output — defining the block's task as learning a residual correction rather than a full mapping — ResNet enabled training of networks with hundreds or even thousands of layers without the vanishing-gradient degradation that had previously made very deep networks impractical. It won the ILSVRC 2015 image recognition competition with a top-5 error of 3.57% and remains the most widely used backbone architecture in computer vision. | AlexNet is a deep convolutional neural network (CNN) introduced by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton in 2012. It won the ImageNet Large Scale Visual Recognition Challenge (ILSVRC 2012) with a top-5 error rate of 15.3%, outstripping the runner-up by more than 10 percentage points and reigniting broad interest in deep learning. The architecture introduced or popularised several techniques — ReLU activations, dropout regularisation, and multi-GPU training — that became standard practice across the field. | DenseNet (Densely Connected Convolutional Network), introduced by Huang, Liu, van der Maaten, and Weinberger at CVPR 2017 (Best Paper Award), connects every layer to every subsequent layer within a dense block so that each layer receives the concatenated feature maps of all preceding layers — maximising feature reuse, strengthening gradient flow, and achieving competitive accuracy with substantially fewer parameters than comparable architectures such as ResNet. | The Inception Network, introduced by Szegedy et al. at Google in 2015 and submitted to CVPR under the name GoogLeNet, is a 22-layer deep convolutional neural network designed for large-scale image recognition. Its defining contribution is the Inception module, which applies convolutions of multiple kernel sizes in parallel and concatenates their outputs, enabling the network to capture spatial features at different scales simultaneously without a proportional increase in computational cost. |
| ScholarGateSet data ↗ |
|
|
|
|