Machine learningDeep learning / NLP / CV

多模态命名实体识别

多模态命名实体识别（Multimodal Named Entity Recognition, MNER）通过融合文本序列和互补模态（最常见的是图像）来扩展经典的命名实体识别（NER），以提高对人名、组织机构名、地名等命名实体的识别和分类能力，尤其是在视觉上下文能够消除歧义或弥补文本稀疏性的场景下。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

Method map

The neighbourhood of related methods — select a node to explore.

多模态命名实体识别

[需翻译标题：BERT-based Classi…多模态BERT分类多模态问题解答多模态句子嵌入多模态Transformer 命名实体识别 (NER)

来源

Moon, S., Neves, L., & Carvalho, V. (2018). Multimodal Named Entity Recognition for Short Social Media Posts. Proceedings of NAACL-HLT 2018, pp. 852–860. Association for Computational Linguistics. link ↗
Lu, D., Neves, L., Carvalho, V., Zhang, N., & Ji, H. (2018). Visual Attention Model for Name Tagging in Multimodal Social Media. Proceedings of ACL 2018, pp. 1990–1999. Association for Computational Linguistics. link ↗

如何引用本页

ScholarGate. (2026, June 3). Multimodal Named Entity Recognition (Text + Visual/Auxiliary Modality NER). ScholarGate. https://scholargate.app/zh/deep-learning/multimodal-named-entity-recognition

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

[需翻译标题：BERT-based Classification...]深度学习↔ compare
多模态BERT分类深度学习↔ compare
多模态问题解答深度学习↔ compare
多模态句子嵌入深度学习↔ compare
多模态Transformer深度学习↔ compare
命名实体识别 (NER)文本挖掘↔ compare

Compare side by side →

发现本页有问题？报告或提出修改建议 →

阅读完整方法

Method map

来源

如何引用本页

相关方法

Which method?