เปรียบเทียบวิธี
ดูวิธีที่เลือกเทียบกันแบบเคียงข้าง แถวที่ต่างกันจะถูกเน้นไว้
| VGGNet (เครือข่ายประสาทคอนโวลูชันแบบลึกมาก)× | ResNet (เครือข่ายส่วนที่เหลือ)× | |
|---|---|---|
| สาขาวิชา | การเรียนรู้เชิงลึก | การเรียนรู้เชิงลึก |
| ตระกูล | Machine learning | Machine learning |
| ปีกำเนิด≠ | 2014 | 2016 |
| ผู้ริเริ่ม≠ | Simonyan, K. & Zisserman, A. (Visual Geometry Group, Oxford) | He, K.; Zhang, X.; Ren, S.; Sun, J. |
| ประเภท≠ | Deep Convolutional Neural Network (image classification) | Deep Convolutional Neural Network with skip connections |
| แหล่งต้นตำรับ≠ | Simonyan, K., & Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv:1409.1556 [cs.CV]. Published at ICLR 2015. DOI ↗ | He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. DOI ↗ |
| ชื่อเรียกอื่น≠ | VGG, VGG-16, VGG-19, Very Deep ConvNet | ResNet, Residual Network, Deep Residual Learning, ResNet-50 |
| ที่เกี่ยวข้อง | 4 | 4 |
| สรุป≠ | VGGNet is a deep convolutional neural network architecture introduced by Karen Simonyan and Andrew Zisserman at the Visual Geometry Group, Oxford, in 2014 (published at ICLR 2015). It demonstrated that network depth — achieved exclusively through stacking small 3x3 convolutional filters — is the single most critical factor for high image-classification accuracy, and its two canonical variants (VGG-16 and VGG-19) became the dominant benchmark architectures for CNN design throughout the mid-2010s. | ResNet (Residual Network) is a deep convolutional neural network architecture introduced by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun at CVPR 2016. By inserting shortcut (skip) connections that carry the input of a block directly to its output — defining the block's task as learning a residual correction rather than a full mapping — ResNet enabled training of networks with hundreds or even thousands of layers without the vanishing-gradient degradation that had previously made very deep networks impractical. It won the ILSVRC 2015 image recognition competition with a top-5 error of 3.57% and remains the most widely used backbone architecture in computer vision. |
| ScholarGateชุดข้อมูล ↗ |
|
|