Sammenlign metoder

Gjennomgå de valgte metodene side om side; rader som avviker, er uthevet.

	YOLO (You Only Look Once)×	ResNet (Residual Network)×
Fagfelt	Dyp læring	Dyp læring
Familie	Machine learning	Machine learning
Opprinnelsesår	2016	2016
Opphavsperson≠	Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A.	He, K.; Zhang, X.; Ren, S.; Sun, J.
Type≠	Single-shot convolutional object detector	Deep Convolutional Neural Network with skip connections
Opprinnelig kilde≠	Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 779–788. DOI ↗	He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. DOI ↗
Alias≠	You Only Look Once, YOLO detector, YOLOv1, single-shot detector	ResNet, Residual Network, Deep Residual Learning, ResNet-50
Relaterte≠	1	4
Sammendrag≠	YOLO (You Only Look Once) is a single-shot, end-to-end convolutional object detector introduced by Redmon, Divvala, Girshick, and Farhadi at CVPR 2016. It reframes object detection as a single regression problem — predicting bounding box coordinates and class probabilities directly from an image in one forward pass — achieving real-time detection speeds that prior two-stage methods such as R-CNN could not match. The original paper spawned a widely adopted family of successors (YOLOv2 through v11) that continues to dominate applied object detection benchmarks.	ResNet (Residual Network) is a deep convolutional neural network architecture introduced by Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun at CVPR 2016. By inserting shortcut (skip) connections that carry the input of a block directly to its output — defining the block's task as learning a residual correction rather than a full mapping — ResNet enabled training of networks with hundreds or even thousands of layers without the vanishing-gradient degradation that had previously made very deep networks impractical. It won the ILSVRC 2015 image recognition competition with a top-5 error of 3.57% and remains the most widely used backbone architecture in computer vision.
ScholarGateDatasett ↗	v1 2 Kilder PUBLISHED	v1 3 Kilder PUBLISHED

Gå til søk → Download slides