مقایسهٔ روشها
روشهای انتخابی خود را کنار هم مرور کنید؛ ردیفهای متفاوت برجسته شدهاند.
| Mask R-CNN: تقسیمبندی نمونه با ماسکهای سطح پیکسل× | Faster R-CNN× | یونِت× | |
|---|---|---|---|
| حوزه | یادگیری عمیق | یادگیری عمیق | یادگیری عمیق |
| خانواده | Machine learning | Machine learning | Machine learning |
| سال پیدایش≠ | 2017 | 2015 | 2015 |
| پدیدآور≠ | Kaiming He et al. (FAIR) | Ren, S.; He, K.; Girshick, R.; Sun, J. (Microsoft Research) | Ronneberger, O., Fischer, P., & Brox, T. |
| نوع≠ | Instance segmentation deep neural network | Two-stage object detection CNN | Encoder-decoder convolutional network with skip connections |
| منبع بنیادین≠ | He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask R-CNN. IEEE International Conference on Computer Vision (ICCV), 2980–2988. DOI ↗ | Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Advances in Neural Information Processing Systems (NeurIPS), 28, 91–99. link ↗ | Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. In N. Navab et al. (Eds.), Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, LNCS 9351 (pp. 234–241). Springer. DOI ↗ |
| نامهای دیگر≠ | Mask Region-based Convolutional Neural Network, Instance Segmentation R-CNN, He et al. 2017 Segmentation Model, Maske R-CNN | Faster RCNN, Faster-RCNN, RPN-based detector, two-stage object detector | U-Net, UNet, encoder-decoder with skip connections, fully convolutional segmentation network |
| مرتبط≠ | 2 | 2 | 3 |
| خلاصه≠ | Mask R-CNN is a deep learning framework for instance segmentation introduced by Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick at Facebook AI Research (FAIR) in 2017. It extends Faster R-CNN by adding a parallel branch that predicts a binary pixel-level mask for each detected object instance, enabling simultaneous object detection, classification, and fine-grained segmentation in a single forward pass. | Faster R-CNN is a two-stage deep convolutional object detection framework introduced by Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun (Microsoft Research) at NeurIPS 2015. It replaces the slow selective-search region proposal step used in its predecessors R-CNN and Fast R-CNN with a learned Region Proposal Network (RPN) that shares convolutional features with the detection head, enabling the first end-to-end trainable, near-real-time accurate object detector and establishing a long-standing accuracy benchmark on PASCAL VOC and MS COCO. | U-Net is a fully convolutional encoder-decoder architecture, introduced by Ronneberger, Fischer, and Brox at MICCAI 2015, that produces dense pixel-wise segmentation masks by combining a contracting path that captures context with a symmetric expanding path that enables precise localization — all bridged by skip connections that preserve fine spatial detail. It established the standard baseline for biomedical image segmentation and has since become one of the most widely adopted architectures for any pixel-level prediction task. |
| ScholarGateمجموعهداده ↗ |
|
|
|