So sánh phương pháp
Xem các phương pháp đã chọn cạnh nhau; những hàng khác biệt được làm nổi bật.
| Multimodal Variational Autoencoder× | Bộ tự mã hóa biến phân× | |
|---|---|---|
| Lĩnh vực | Học sâu | Học sâu |
| Họ | Machine learning | Machine learning |
| Năm ra đời≠ | 2018 | 2014 |
| Người khởi xướng≠ | Wu, M. and Goodman, N. | Kingma, D. P. & Welling, M. |
| Loại≠ | Generative latent-variable model | Deep generative latent-variable model (encoder–decoder) |
| Công trình gốc≠ | Wu, M., & Goodman, N. (2018). Multimodal Generative Models for Scalable Weakly-Supervised Learning. Advances in Neural Information Processing Systems (NeurIPS), 31. link ↗ | Kingma, D. P. & Welling, M. (2014). Auto-Encoding Variational Bayes. International Conference on Learning Representations (ICLR). link ↗ |
| Tên gọi khác | MVAE, multimodal VAE, multi-modal variational autoencoder, multimodal generative model | Değişkensel Otokodlayıcı (VAE), VAE, auto-encoding variational Bayes, deep latent variable model |
| Liên quan≠ | 3 | 5 |
| Tóm tắt≠ | The Multimodal Variational Autoencoder (MVAE) is a deep generative model that learns a shared latent representation across two or more data modalities — such as images and captions — using a product-of-experts fusion of modality-specific encoders, enabling generation and inference even when only a subset of modalities is observed at test time. | The Variational Autoencoder (VAE) is a deep generative latent-variable model, introduced by Diederik Kingma and Max Welling in 2014, that encodes data as a probability distribution in a latent space and samples from that distribution to generate new examples. It is used for data generation, anomaly detection, and feature learning. |
| ScholarGateBộ dữ liệu ↗ |
|
|