ScholarGate
助手
Machine learningDeep learning / NLP / CV

多模态图神经网络

多模态图神经网络(MM-GNN)将来自文本、图像和结构化特征等多种模态的数据整合到统一的图结构中,并应用基于图的消息传递机制来学习联合表示。它能够对异构数据源进行关系推理,超越了单模态或简单拼接方法所能捕捉的能力。

在 MethodMind 中打开即将推出视频即将推出Download slides

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

登录

Method map

The neighbourhood of related methods — select a node to explore.

来源

  1. Kipf, T. N., & Welling, M. (2017). Semi-Supervised Classification with Graph Convolutional Networks. International Conference on Learning Representations (ICLR). link
  2. Zhang, Z., Lin, H., & Zhao, X. (2020). Multimodal Graph Neural Network for Knowledge-Based Visual Question Answering. Information Processing & Management, 57(6), 102382. link

如何引用本页

ScholarGate. (2026, June 3). Multimodal Graph Neural Network (MM-GNN). ScholarGate. https://scholargate.app/zh/deep-learning/multimodal-graph-neural-network

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side

被引用于

ScholarGateMultimodal Graph Neural Network (Multimodal Graph Neural Network (MM-GNN)). 于 2026-06-15 检索自 https://scholargate.app/zh/deep-learning/multimodal-graph-neural-network · 数据集: https://doi.org/10.5281/zenodo.20539026