ScholarGate
Asistente

Comparar métodos

Revisa los métodos seleccionados uno junto a otro; las filas que difieren aparecen resaltadas.

Segmentación multimodal de instancias×Detección multimodal de objetos×
CampoAprendizaje profundoAprendizaje profundo
FamiliaMachine learningMachine learning
Año de origen2017–present2015–2019
Autor originalHe, K., Gkioxari, G., Dollar, P., Girshick, R. (Mask R-CNN foundation); extended by community to multimodal settingsMultiple contributors (e.g., Chen & Deng, Liang et al.)
TipoSupervised deep learning — instance segmentationFusion-based deep detection
Fuente seminalHe, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗Liu, Y., Zhang, F., Li, Y., & Lv, H. (2022). Multimodal Object Detection via Bayesian Fusion. IEEE Transactions on Image Processing, 31, 5953–5965. link ↗
Aliasmultimodal Mask R-CNN, RGB-D instance segmentation, multi-sensor instance segmentation, cross-modal instance segmentationmulti-sensor object detection, cross-modal detection, RGB-D object detection, fusion-based object detection
Relacionados56
ResumenMultimodal instance segmentation extends classical instance segmentation — which assigns a per-pixel mask and a class label to every individual object in an image — by incorporating complementary sensor streams such as depth maps, LiDAR point clouds, or infrared frames. Fusing these modalities helps the model handle ambiguous appearances, low light, and occlusion that trip up RGB-only systems.Multimodal object detection extends single-modality object detectors by jointly processing signals from multiple sensor types — such as RGB cameras, depth sensors, LiDAR, radar, or text descriptions — to localize and classify objects with higher accuracy and robustness than any single modality alone. Fusion of complementary information is the core design principle.
ScholarGateConjunto de datos
  1. v1
  2. 2 Fuentes
  3. PUBLISHED
  1. v1
  2. 2 Fuentes
  3. PUBLISHED

Ir a la búsqueda Descargar diapositivas

ScholarGateComparar métodos: Multimodal Instance Segmentation · Multimodal Object Detection. Recuperado el 2026-06-15 de https://scholargate.app/es/compare