방법 비교
선택한 방법을 나란히 검토하세요. 서로 다른 행은 강조 표시됩니다.
| 설명 가능한 변이형 오토인코더× | Multimodal Variational Autoencoder× | |
|---|---|---|
| 분야 | 딥러닝 | 딥러닝 |
| 계열 | Machine learning | Machine learning |
| 기원 연도≠ | 2013–2017 | 2018 |
| 창시자≠ | Kingma, D. P. & Welling, M. (VAE); Higgins et al. (beta-VAE for disentanglement) | Wu, M. and Goodman, N. |
| 유형≠ | Generative model with interpretable latent space | Generative latent-variable model |
| 원전≠ | Kingma, D. P., & Welling, M. (2014). Auto-Encoding Variational Bayes. In Proceedings of the 2nd International Conference on Learning Representations (ICLR 2014). link ↗ | Wu, M., & Goodman, N. (2018). Multimodal Generative Models for Scalable Weakly-Supervised Learning. Advances in Neural Information Processing Systems (NeurIPS), 31. link ↗ |
| 별칭 | XVAE, Interpretable VAE, Disentangled Variational Autoencoder, Explainable Generative Model | MVAE, multimodal VAE, multi-modal variational autoencoder, multimodal generative model |
| 관련≠ | 4 | 3 |
| 요약≠ | An Explainable Variational Autoencoder (XVAE) extends the standard VAE framework with techniques that make its latent space interpretable: disentangling latent dimensions so each corresponds to a human-understandable factor, or post-hoc attribution methods (SHAP, integrated gradients) that trace reconstructions back to input features. It retains the VAE's generative power while adding transparency required in scientific and high-stakes applications. | The Multimodal Variational Autoencoder (MVAE) is a deep generative model that learns a shared latent representation across two or more data modalities — such as images and captions — using a product-of-experts fusion of modality-specific encoders, enabling generation and inference even when only a subset of modalities is observed at test time. |
| ScholarGate데이터셋 ↗ |
|
|