ScholarGate
助手
Machine learningDeep learning / NLP / CV

多模态命名实体识别

多模态命名实体识别(Multimodal Named Entity Recognition, MNER)通过融合文本序列和互补模态(最常见的是图像)来扩展经典的命名实体识别(NER),以提高对人名、组织机构名、地名等命名实体的识别和分类能力,尤其是在视觉上下文能够消除歧义或弥补文本稀疏性的场景下。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

登录

Method map

The neighbourhood of related methods — select a node to explore.

来源

  1. Moon, S., Neves, L., & Carvalho, V. (2018). Multimodal Named Entity Recognition for Short Social Media Posts. Proceedings of NAACL-HLT 2018, pp. 852–860. Association for Computational Linguistics. link
  2. Lu, D., Neves, L., Carvalho, V., Zhang, N., & Ji, H. (2018). Visual Attention Model for Name Tagging in Multimodal Social Media. Proceedings of ACL 2018, pp. 1990–1999. Association for Computational Linguistics. link

如何引用本页

ScholarGate. (2026, June 3). Multimodal Named Entity Recognition (Text + Visual/Auxiliary Modality NER). ScholarGate. https://scholargate.app/zh/deep-learning/multimodal-named-entity-recognition

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side
ScholarGateMultimodal Named Entity Recognition (Multimodal Named Entity Recognition (Text + Visual/Auxiliary Modality NER)). 于 2026-06-15 检索自 https://scholargate.app/zh/deep-learning/multimodal-named-entity-recognition · 数据集: https://doi.org/10.5281/zenodo.20539026