So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| U-Net× | Mask R-CNN: Phân đoạn đối tượng với mặt nạ cấp pixel× | |
|---|---|---|
| Lĩnh vực | Học sâu | Học sâu |
| Họ | Machine learning | Machine learning |
| Năm ra đời≠ | 2015 | 2017 |
| Người khởi xướng≠ | Ronneberger, O., Fischer, P., & Brox, T. | Kaiming He et al. (FAIR) |
| Loại≠ | Encoder-decoder convolutional network with skip connections | Instance segmentation deep neural network |
| Công trình gốc≠ | Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. In N. Navab et al. (Eds.), Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, LNCS 9351 (pp. 234–241). Springer. DOI ↗ | He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask R-CNN. IEEE International Conference on Computer Vision (ICCV), 2980–2988. DOI ↗ |
| Tên gọi khác≠ | U-Net, UNet, encoder-decoder with skip connections, fully convolutional segmentation network | Mask Region-based Convolutional Neural Network, Instance Segmentation R-CNN, He et al. 2017 Segmentation Model, Maske R-CNN |
| Liên quan≠ | 3 | 2 |
| Tóm tắt≠ | U-Net is a fully convolutional encoder-decoder architecture, introduced by Ronneberger, Fischer, and Brox at MICCAI 2015, that produces dense pixel-wise segmentation masks by combining a contracting path that captures context with a symmetric expanding path that enables precise localization — all bridged by skip connections that preserve fine spatial detail. It established the standard baseline for biomedical image segmentation and has since become one of the most widely adopted architectures for any pixel-level prediction task. | Mask R-CNN is a deep learning framework for instance segmentation introduced by Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick at Facebook AI Research (FAIR) in 2017. It extends Faster R-CNN by adding a parallel branch that predicts a binary pixel-level mask for each detected object instance, enabling simultaneous object detection, classification, and fine-grained segmentation in a single forward pass. |
| ScholarGateBộ dữ liệu ↗ |
|
|