ScholarGate
Assistent
Machine learningDeep learning / NLP / CV

Multimodal grafnevrale nettverk

Et Multimodal Graph Neural Network (MM-GNN) kombinerer data fra flere modaliteter – som tekst, bilder og strukturerte trekk – til en enhetlig grafstruktur og anvender grafbasert meldingsutveksling for å lære felles representasjoner. Det muliggjør relasjonell resonnering på tvers av heterogene datakilder, og går utover hva unimodale eller enkle konkateneringsmetoder kan fange opp.

Åpne i MethodMindSnartVideoSnartDownload slides

Les hele metoden

Kun for medlemmer

Logg inn med en gratis konto for å lese denne delen.

Logg inn

Method map

The neighbourhood of related methods — select a node to explore.

Kilder

  1. Kipf, T. N., & Welling, M. (2017). Semi-Supervised Classification with Graph Convolutional Networks. International Conference on Learning Representations (ICLR). link
  2. Zhang, Z., Lin, H., & Zhao, X. (2020). Multimodal Graph Neural Network for Knowledge-Based Visual Question Answering. Information Processing & Management, 57(6), 102382. link

Slik siterer du denne siden

ScholarGate. (2026, June 3). Multimodal Graph Neural Network (MM-GNN). ScholarGate. https://scholargate.app/no/deep-learning/multimodal-graph-neural-network

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side

Referert av

ScholarGateMultimodal Graph Neural Network (Multimodal Graph Neural Network (MM-GNN)). Hentet 2026-06-15 fra https://scholargate.app/no/deep-learning/multimodal-graph-neural-network · Datasett: https://doi.org/10.5281/zenodo.20539026