Σύγκριση μεθόδων
Εξετάστε τις επιλεγμένες μεθόδους δίπλα-δίπλα· οι γραμμές που διαφέρουν επισημαίνονται.
| Τμηματική κατάτμηση πολλαπλών τρόπων (Multimodal Instance Segmentation)× | Ανίχνευση Αντικειμένων× | |
|---|---|---|
| Πεδίο | Βαθιά Μάθηση | Βαθιά Μάθηση |
| Οικογένεια | Machine learning | Machine learning |
| Έτος προέλευσης≠ | 2017–present | 2014–2016 |
| Δημιουργός≠ | He, K., Gkioxari, G., Dollar, P., Girshick, R. (Mask R-CNN foundation); extended by community to multimodal settings | Girshick, R. et al. (R-CNN); Redmon, J. et al. (YOLO) |
| Τύπος≠ | Supervised deep learning — instance segmentation | Supervised deep learning (region proposal or single-shot) |
| Θεμελιώδης πηγή≠ | He, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗ | Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 580–587. DOI ↗ |
| Εναλλακτικές ονομασίες | multimodal Mask R-CNN, RGB-D instance segmentation, multi-sensor instance segmentation, cross-modal instance segmentation | visual object detection, image object localization, region-based object detection, bounding-box detection |
| Συναφείς≠ | 5 | 3 |
| Σύνοψη≠ | Multimodal instance segmentation extends classical instance segmentation — which assigns a per-pixel mask and a class label to every individual object in an image — by incorporating complementary sensor streams such as depth maps, LiDAR point clouds, or infrared frames. Fusing these modalities helps the model handle ambiguous appearances, low light, and occlusion that trip up RGB-only systems. | Object detection is a computer vision task in which a deep neural network simultaneously locates and classifies every instance of one or more object categories within an image, producing a bounding box and a class label for each detected object. Modern detectors — from the R-CNN family to YOLO and DETR — achieve near-human accuracy at real-time speeds on standard benchmarks. |
| ScholarGateΣύνολο δεδομένων ↗ |
|
|