방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| Inception Network (GoogLeNet)× | ResNet (Residual Network)× | |
|---|---|---|
| 분야 | 딥러닝 | 딥러닝 |
| 계열 | Machine learning | Machine learning |
| 기원 연도≠ | 2015 | 2016 |
| 창시자≠ | Christian Szegedy et al. (Google) | He, K.; Zhang, X.; Ren, S.; Sun, J. |
| 유형≠ | Deep CNN with parallel multi-scale convolutions | Deep Convolutional Neural Network with skip connections |
| 원전≠ | Szegedy, C., et al. (2015). Going deeper with convolutions. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1–9. DOI ↗ | He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. DOI ↗ |
| 별칭≠ | GoogLeNet, Inception v1, Deep Convolutional Neural Network (Google), Başlangıç Ağı | ResNet, Residual Network, Deep Residual Learning, ResNet-50 |
| 관련≠ | 2 | 4 |
| 요약≠ | The Inception Network, introduced by Szegedy et al. at Google in 2015 and submitted to CVPR under the name GoogLeNet, is a 22-layer deep convolutional neural network designed for large-scale image recognition. Its defining contribution is the Inception module, which applies convolutions of multiple kernel sizes in parallel and concatenates their outputs, enabling the network to capture spatial features at different scales simultaneously without a proportional increase in computational cost. | ResNet (Residual Network) is a deep convolutional neural network architecture introduced by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun at CVPR 2016. By inserting shortcut (skip) connections that carry the input of a block directly to its output — defining the block's task as learning a residual correction rather than a full mapping — ResNet enabled training of networks with hundreds or even thousands of layers without the vanishing-gradient degradation that had previously made very deep networks impractical. It won the ILSVRC 2015 image recognition competition with a top-5 error of 3.57% and remains the most widely used backbone architecture in computer vision. |
| ScholarGate데이터셋 ↗ |
|
|