방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| 인스턴스 분할× | 다중 모달 인스턴스 분할× | |
|---|---|---|
| 분야 | 딥러닝 | 딥러닝 |
| 계열 | Machine learning | Machine learning |
| 기원 연도≠ | 2017 | 2017–present |
| 창시자≠ | He, K., Gkioxari, G., Dollar, P., Girshick, R. | He, K., Gkioxari, G., Dollar, P., Girshick, R. (Mask R-CNN foundation); extended by community to multimodal settings |
| 유형≠ | Pixel-level detection and mask prediction | Supervised deep learning — instance segmentation |
| 원전 | He, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗ | He, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗ |
| 별칭 | instance-level segmentation, object instance segmentation, mask prediction, panoptic instance segmentation | multimodal Mask R-CNN, RGB-D instance segmentation, multi-sensor instance segmentation, cross-modal instance segmentation |
| 관련≠ | 4 | 5 |
| 요약≠ | Instance segmentation is a computer vision task that simultaneously detects every distinct object in an image and produces a precise pixel-level mask for each individual object instance. Unlike semantic segmentation, which labels every pixel with a class, instance segmentation distinguishes between separate objects of the same class, enabling fine-grained spatial understanding. | Multimodal instance segmentation extends classical instance segmentation — which assigns a per-pixel mask and a class label to every individual object in an image — by incorporating complementary sensor streams such as depth maps, LiDAR point clouds, or infrared frames. Fusing these modalities helps the model handle ambiguous appearances, low light, and occlusion that trip up RGB-only systems. |
| ScholarGate데이터셋 ↗ |
|
|