قارن الطرق

راجع الطرق التي اخترتها جنبًا إلى جنب؛ الصفوف المختلفة مميَّزة.

	شبكة الرسم البياني متعددة الوسائط ×	المشفر التلقائي التبايني متعدد الوسائط ×
المجال	التعلم العميق	التعلم العميق
العائلة	Machine learning	Machine learning
سنة النشأة≠	2019–2020	2018
صاحب الطريقة≠	Kipf & Welling (GNN foundation); extended to multimodal settings by multiple research groups c. 2019–2020	Wu, M. and Goodman, N.
النوع≠	Graph-based deep learning with multimodal input fusion	Generative latent-variable model
المصدر التأسيسي≠	Kipf, T. N., & Welling, M. (2017). Semi-Supervised Classification with Graph Convolutional Networks. International Conference on Learning Representations (ICLR). link ↗	Wu, M., & Goodman, N. (2018). Multimodal Generative Models for Scalable Weakly-Supervised Learning. Advances in Neural Information Processing Systems (NeurIPS), 31. link ↗
الأسماء البديلة	MM-GNN, Multimodal GNN, Multi-modal Graph Network, Cross-modal Graph Neural Network	MVAE, multimodal VAE, multi-modal variational autoencoder, multimodal generative model
ذات صلة≠	6	3
الملخص≠	A Multimodal Graph Neural Network (MM-GNN) combines data from multiple modalities — such as text, images, and structured features — into a unified graph structure and applies graph-based message passing to learn joint representations. It enables relational reasoning across heterogeneous data sources, going beyond what unimodal or simple concatenation approaches can capture.	The Multimodal Variational Autoencoder (MVAE) is a deep generative model that learns a shared latent representation across two or more data modalities — such as images and captions — using a product-of-experts fusion of modality-specific encoders, enabling generation and inference even when only a subset of modalities is observed at test time.
ScholarGateمجموعة البيانات ↗	v1 2 المصادر PUBLISHED	v1 2 المصادر PUBLISHED

انتقل إلى البحث → تنزيل الشرائح