Compara mètodes
Revisa els mètodes seleccionats l'un al costat de l'altre; les files que difereixen es ressalten.
| Mask R-CNN: Segmentació d'instàncies amb màscares a nivell de píxel× | Faster R-CNN× | U-Net× | |
|---|---|---|---|
| Camp | Aprenentatge profund | Aprenentatge profund | Aprenentatge profund |
| Família | Machine learning | Machine learning | Machine learning |
| Any d'origen≠ | 2017 | 2015 | 2015 |
| Autor original≠ | Kaiming He et al. (FAIR) | Ren, S.; He, K.; Girshick, R.; Sun, J. (Microsoft Research) | Ronneberger, O., Fischer, P., & Brox, T. |
| Tipus≠ | Instance segmentation deep neural network | Two-stage object detection CNN | Encoder-decoder convolutional network with skip connections |
| Font seminal≠ | He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask R-CNN. IEEE International Conference on Computer Vision (ICCV), 2980–2988. DOI ↗ | Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Advances in Neural Information Processing Systems (NeurIPS), 28, 91–99. link ↗ | Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. In N. Navab et al. (Eds.), Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, LNCS 9351 (pp. 234–241). Springer. DOI ↗ |
| Àlies≠ | Mask Region-based Convolutional Neural Network, Instance Segmentation R-CNN, He et al. 2017 Segmentation Model, Maske R-CNN | Faster RCNN, Faster-RCNN, RPN-based detector, two-stage object detector | U-Net, UNet, encoder-decoder with skip connections, fully convolutional segmentation network |
| Relacionats≠ | 2 | 2 | 3 |
| Resum≠ | Mask R-CNN is a deep learning framework for instance segmentation introduced by Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick at Facebook AI Research (FAIR) in 2017. It extends Faster R-CNN by adding a parallel branch that predicts a binary pixel-level mask for each detected object instance, enabling simultaneous object detection, classification, and fine-grained segmentation in a single forward pass. | Faster R-CNN is a two-stage deep convolutional object detection framework introduced by Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun (Microsoft Research) at NeurIPS 2015. It replaces the slow selective-search region proposal step used in its predecessors R-CNN and Fast R-CNN with a learned Region Proposal Network (RPN) that shares convolutional features with the detection head, enabling the first end-to-end trainable, near-real-time accurate object detector and establishing a long-standing accuracy benchmark on PASCAL VOC and MS COCO. | U-Net is a fully convolutional encoder-decoder architecture, introduced by Ronneberger, Fischer, and Brox at MICCAI 2015, that produces dense pixel-wise segmentation masks by combining a contracting path that captures context with a symmetric expanding path that enables precise localization — all bridged by skip connections that preserve fine spatial detail. It established the standard baseline for biomedical image segmentation and has since become one of the most widely adopted architectures for any pixel-level prediction task. |
| ScholarGateConjunt de dades ↗ |
|
|
|