Machine learningDeep learning / NLP / CV

Multimodal Variational Autoencoder

Multimodal Variational Autoencoder (MVAE)는 이미지와 캡션과 같은 두 개 이상의 데이터 양식에 걸쳐 공유 잠재 표현을 학습하는 딥 생성 모델입니다. 이는 양식별 인코더의 전문가 곱 융합(product-of-experts fusion)을 사용하여, 테스트 시점에 양식의 부분집합만 관찰되는 경우에도 생성 및 추론을 가능하게 합니다.

MethodMind에서 열기곧 제공동영상곧 제공Download slides

방법 전문 읽기

회원 전용

무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.

로그인

Method map

The neighbourhood of related methods — select a node to explore.

Multimodal Variational Autoencoder

생성적 적대 신경망 전문가 혼합 Variational Autoencoder 설명 가능한 변이형 오토인코더 다중 양식 확산 모델 멀티모달 GAN 다중 양식 그래프 신경망 자기 지도 변분형 오토인코더

출처

Wu, M., & Goodman, N. (2018). Multimodal Generative Models for Scalable Weakly-Supervised Learning. Advances in Neural Information Processing Systems (NeurIPS), 31. link ↗
Kingma, D. P., & Welling, M. (2014). Auto-Encoding Variational Bayes. International Conference on Learning Representations (ICLR). link ↗

이 페이지 인용 방법

ScholarGate. (2026, June 3). Multimodal Variational Autoencoder (MVAE). ScholarGate. https://scholargate.app/ko/deep-learning/multimodal-variational-autoencoder

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side →

이 방법을 참조하는 항목

설명 가능한 변이형 오토인코더 다중 양식 확산 모델 멀티모달 GAN 다중 양식 그래프 신경망 자기 지도 변분형 오토인코더

이 페이지에서 오류를 발견하셨나요? 신고하거나 수정을 제안하세요 →

방법 전문 읽기

Method map

출처

이 페이지 인용 방법

관련 방법

Which method?

이 방법을 참조하는 항목