ScholarGate
Assistent
Machine learningDeep learning / NLP / CV

Multimodal Named Entity Recognition

Multimodal Named Entity Recognition (MNER) udvider klassisk NER ved at fusionere tekstsekvenser med komplementære modaliteter – oftest billeder – for at forbedre identifikation og klassifikation af navngivne entiteter såsom personer, organisationer og lokationer i scenarier, hvor visuel kontekst disambiguerer tvetydig eller sparsom tekst.

Åbn i MethodMindSnartVideoSnartDownload slides

Læs hele metoden

Kun for medlemmer

Log ind med en gratis konto for at læse dette afsnit.

Log ind

Method map

The neighbourhood of related methods — select a node to explore.

Kilder

  1. Moon, S., Neves, L., & Carvalho, V. (2018). Multimodal Named Entity Recognition for Short Social Media Posts. Proceedings of NAACL-HLT 2018, pp. 852–860. Association for Computational Linguistics. link
  2. Lu, D., Neves, L., Carvalho, V., Zhang, N., & Ji, H. (2018). Visual Attention Model for Name Tagging in Multimodal Social Media. Proceedings of ACL 2018, pp. 1990–1999. Association for Computational Linguistics. link

Sådan citerer du denne side

ScholarGate. (2026, June 3). Multimodal Named Entity Recognition (Text + Visual/Auxiliary Modality NER). ScholarGate. https://scholargate.app/da/deep-learning/multimodal-named-entity-recognition

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side
ScholarGateMultimodal Named Entity Recognition (Multimodal Named Entity Recognition (Text + Visual/Auxiliary Modality NER)). Hentet 2026-06-15 fra https://scholargate.app/da/deep-learning/multimodal-named-entity-recognition · Datasæt: https://doi.org/10.5281/zenodo.20539026