Machine learningDeep learning / NLP / CV

다중 양식 합성곱 신경망

다중 양식 합성곱 신경망(Multimodal Convolutional Neural Network, MM-CNN)은 두 개 이상의 입력 양식(예: 이미지와 텍스트, 또는 비디오와 오디오)을 전용 합성곱 분기(convolutional branch)를 통해 처리하고 융합하여, 각 소스로부터 상보적인 신호를 포착하는 공유 표현(shared representation)을 학습합니다. 융합된 표현은 분류, 회귀 또는 검색과 같은 다운스트림 작업을 구동합니다.

MethodMind에서 열기곧 제공동영상곧 제공Download slides

방법 전문 읽기

회원 전용

무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.

로그인

Method map

The neighbourhood of related methods — select a node to explore.

다중 양식 합성곱 신경망

이미지 분류 멀티모달 BERT 기반 분류 Multimodal Recurrent Neu…다중 모달 트랜스포머 컨볼루션 신경망을 이용한 전이 학습 다중 양식 그래프 신경망 Multimodal Multilayer Pe…

출처

Ngiam, J., Khosla, A., Kim, M., Nam, J., Lee, H., & Ng, A. Y. (2011). Multimodal deep learning. In Proceedings of the 28th International Conference on Machine Learning (ICML), 689–696. link ↗
Zhang, Y., Yin, C., Li, Y., Li, D., & Tian, Q. (2020). Multimodal intelligence: Representation learning, information fusion, and applications. IEEE Journal of Selected Topics in Signal Processing, 14(3), 478–493. DOI: 10.1109/JSTSP.2020.2987728 ↗

이 페이지 인용 방법

ScholarGate. (2026, June 3). Multimodal Convolutional Neural Network (MM-CNN). ScholarGate. https://scholargate.app/ko/deep-learning/multimodal-convolutional-neural-network

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

이미지 분류딥러닝↔ compare
멀티모달 BERT 기반 분류딥러닝↔ compare
Multimodal Recurrent Neural Network딥러닝↔ compare
다중 모달 트랜스포머딥러닝↔ compare
컨볼루션 신경망을 이용한 전이 학습딥러닝↔ compare

Compare side by side →

이 방법을 참조하는 항목

다중 양식 그래프 신경망 Multimodal Multilayer Perceptron Multimodal Recurrent Neural Network

이 페이지에서 오류를 발견하셨나요? 신고하거나 수정을 제안하세요 →

방법 전문 읽기

Method map

출처

이 페이지 인용 방법

관련 방법

Which method?

이 방법을 참조하는 항목