Σύγκριση μεθόδων
Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.
| ResNeXt× | DenseNet× | MobileNet: Αποδοτικά Συνελικτικά Νευρωνικά Δίκτυα για Όραση σε Κινητές Συσκευές× | ResNet (Δίκτυο Κατάλοιπων)× | |
|---|---|---|---|---|
| Πεδίο | Βαθιά Μάθηση | Βαθιά Μάθηση | Βαθιά Μάθηση | Βαθιά Μάθηση |
| Οικογένεια | Machine learning | Machine learning | Machine learning | Machine learning |
| Έτος προέλευσης≠ | 2017 | 2017 | 2017 | 2016 |
| Δημιουργός≠ | Xie, S.; Girshick, R.; Dollár, P.; Tu, Z.; He, K. | Huang, G.; Liu, Z.; van der Maaten, L.; Weinberger, K. Q. | Andrew Howard et al. (Google) | He, K.; Zhang, X.; Ren, S.; Sun, J. |
| Τύπος≠ | Convolutional neural network with grouped/cardinality-based residual blocks | Dense convolutional neural network (feed-forward dense connectivity) | Lightweight CNN architecture | Deep Convolutional Neural Network with skip connections |
| Θεμελιώδης πηγή≠ | Xie, S., Girshick, R., Dollár, P., Tu, Z., & He, K. (2017). Aggregated Residual Transformations for Deep Neural Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5987–5995. DOI ↗ | Huang, G., Liu, Z., van der Maaten, L., & Weinberger, K. Q. (2017). Densely Connected Convolutional Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 4700–4708. DOI ↗ | Howard, A. G., et al. (2017). MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint. link ↗ | He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. DOI ↗ |
| Εναλλακτικές ονομασίες≠ | ResNeXt, Aggregated Residual Transformations, grouped convolution residual network, cardinality-based ResNet | DenseNet, Dense Convolutional Network, densely connected CNN, DenseNet-121 | MobileNets, Depthwise Separable CNN, Efficient Mobile Vision Network, Mobil Evrişimli Sinir Ağı | ResNet, Residual Network, Deep Residual Learning, ResNet-50 |
| Συναφείς≠ | 4 | 2 | 2 | 4 |
| Σύνοψη≠ | ResNeXt is a deep convolutional neural network architecture introduced by Xie, Girshick, Dollár, Tu, and He at CVPR 2017. It extends the residual network (ResNet) design by introducing a new architectural dimension called cardinality — the number of independent, parallel transformation paths within each residual block — enabling higher accuracy with fewer parameters and a simpler, more uniform design than its predecessors. | DenseNet (Densely Connected Convolutional Network), introduced by Huang, Liu, van der Maaten, and Weinberger at CVPR 2017 (Best Paper Award), connects every layer to every subsequent layer within a dense block so that each layer receives the concatenated feature maps of all preceding layers — maximising feature reuse, strengthening gradient flow, and achieving competitive accuracy with substantially fewer parameters than comparable architectures such as ResNet. | MobileNet is a family of lightweight convolutional neural network architectures introduced by Howard et al. at Google in 2017. It is designed to run image classification, object detection, and other vision tasks directly on mobile devices and embedded systems with limited computational budgets. By replacing standard convolutions with depthwise separable convolutions and exposing two global hyperparameters, MobileNet dramatically reduces multiply-add operations and model size while retaining competitive accuracy. | ResNet (Residual Network) is a deep convolutional neural network architecture introduced by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun at CVPR 2016. By inserting shortcut (skip) connections that carry the input of a block directly to its output — defining the block's task as learning a residual correction rather than a full mapping — ResNet enabled training of networks with hundreds or even thousands of layers without the vanishing-gradient degradation that had previously made very deep networks impractical. It won the ILSVRC 2015 image recognition competition with a top-5 error of 3.57% and remains the most widely used backbone architecture in computer vision. |
| ScholarGateΣύνολο δεδομένων ↗ |
|
|
|
|