So sánh phương pháp

Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.

Phân đoạn thể hiện đa phương thức×Phát hiện đối tượng×
Lĩnh vựcHọc sâuHọc sâu
HọMachine learningMachine learning
Năm ra đời2017–present2014–2016
Người khởi xướngHe, K., Gkioxari, G., Dollar, P., Girshick, R. (Mask R-CNN foundation); extended by community to multimodal settingsGirshick, R. et al. (R-CNN); Redmon, J. et al. (YOLO)
LoạiSupervised deep learning — instance segmentationSupervised deep learning (region proposal or single-shot)
Công trình gốcHe, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗Girshick, R., Donahue, J., Darrell, T., & Malik, J. (2014). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 580–587. DOI ↗
Tên gọi khácmultimodal Mask R-CNN, RGB-D instance segmentation, multi-sensor instance segmentation, cross-modal instance segmentationvisual object detection, image object localization, region-based object detection, bounding-box detection
Liên quan53
Tóm tắtMultimodal instance segmentation extends classical instance segmentation — which assigns a per-pixel mask and a class label to every individual object in an image — by incorporating complementary sensor streams such as depth maps, LiDAR point clouds, or infrared frames. Fusing these modalities helps the model handle ambiguous appearances, low light, and occlusion that trip up RGB-only systems.Object detection is a computer vision task in which a deep neural network simultaneously locates and classifies every instance of one or more object categories within an image, producing a bounding box and a class label for each detected object. Modern detectors — from the R-CNN family to YOLO and DETR — achieve near-human accuracy at real-time speeds on standard benchmarks.
ScholarGateBộ dữ liệu
  1. v1
  2. 2 Nguồn tài liệu
  3. PUBLISHED
  1. v1
  2. 2 Nguồn tài liệu
  3. PUBLISHED

Đến trang tìm kiếm Download slides

ScholarGateSo sánh phương pháp: Multimodal Instance Segmentation · Object Detection. Truy cập ngày 2026-06-15 từ https://scholargate.app/vi/compare