ScholarGate
助手

方法对比

并排查看您选择的方法;存在差异的行会高亮显示。

多模态语义分割×实例分割×
领域深度学习深度学习
方法族Machine learningMachine learning
起源年份2014–20162017
提出者Multiple contributors (Hazirbas et al., Long et al., and others)He, K., Gkioxari, G., Dollar, P., Girshick, R.
类型Pixel-level classification with multi-sensor fusionPixel-level detection and mask prediction
开创性文献Hazirbas, C., Ma, L., Domokos, C., & Cremers, D. (2016). FuseNet: Incorporating Depth into Semantic Segmentation via Fusion-based CNN Architecture. In Proceedings of the Asian Conference on Computer Vision (ACCV). Springer. link ↗He, K., Gkioxari, G., Dollar, P., & Girshick, R. (2017). Mask R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2961–2969. DOI ↗
别名multimodal scene parsing, multi-sensor semantic segmentation, RGB-D semantic segmentation, cross-modal semantic segmentationinstance-level segmentation, object instance segmentation, mask prediction, panoptic instance segmentation
相关34
摘要Multimodal semantic segmentation assigns a semantic class label to every pixel in a scene by fusing information from two or more sensor modalities — most commonly RGB images paired with depth maps (RGB-D), LiDAR point clouds, thermal cameras, or text descriptions. Deep encoder-decoder networks learn to align and fuse complementary cues from each modality, producing denser and more accurate segmentation than any single-modality approach.Instance segmentation is a computer vision task that simultaneously detects every distinct object in an image and produces a precise pixel-level mask for each individual object instance. Unlike semantic segmentation, which labels every pixel with a class, instance segmentation distinguishes between separate objects of the same class, enabling fine-grained spatial understanding.
ScholarGate数据集
  1. v1
  2. 2 来源
  3. PUBLISHED
  1. v1
  2. 2 来源
  3. PUBLISHED

前往搜索 下载幻灯片

ScholarGate方法对比: Multimodal Semantic Segmentation · Instance Segmentation. 于 2026-06-15 检索自 https://scholargate.app/zh/compare