قارن الطرق
راجع الطرق التي اخترتها جنبًا إلى جنب؛ الصفوف المختلفة مميَّزة.
| تقسيم الكيانات (Instance Segmentation)× | تجزئة الكائنات متعددة الوسائط× | |
|---|---|---|
| المجال | التعلم العميق | التعلم العميق |
| العائلة | Machine learning | Machine learning |
| سنة النشأة≠ | 2017 | 2017–present |
| صاحب الطريقة≠ | He, K., Gkioxari, G., Dollar, P., Girshick, R. | He, K., Gkioxari, G., Dollar, P., Girshick, R. (Mask R-CNN foundation); extended by community to multimodal settings |
| النوع≠ | Pixel-level detection and mask prediction | Supervised deep learning — instance segmentation |
| المصدر التأسيسي | He, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗ | He, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗ |
| الأسماء البديلة | instance-level segmentation, object instance segmentation, mask prediction, panoptic instance segmentation | multimodal Mask R-CNN, RGB-D instance segmentation, multi-sensor instance segmentation, cross-modal instance segmentation |
| ذات صلة≠ | 4 | 5 |
| الملخص≠ | Instance segmentation is a computer vision task that simultaneously detects every distinct object in an image and produces a precise pixel-level mask for each individual object instance. Unlike semantic segmentation, which labels every pixel with a class, instance segmentation distinguishes between separate objects of the same class, enabling fine-grained spatial understanding. | Multimodal instance segmentation extends classical instance segmentation — which assigns a per-pixel mask and a class label to every individual object in an image — by incorporating complementary sensor streams such as depth maps, LiDAR point clouds, or infrared frames. Fusing these modalities helps the model handle ambiguous appearances, low light, and occlusion that trip up RGB-only systems. |
| ScholarGateمجموعة البيانات ↗ |
|
|