Machine learningDeep learning / NLP / CV

Multimodal Doc2Vec

Multimodal Doc2Vec은 텍스트와 이미지, 오디오 또는 구조화된 메타데이터와 같은 하나 이상의 양식에서 정보를 통합하여 여러 소스의 의미를 동시에 포착하는 공유 문서 수준 임베딩을 생성하는 Doc2Vec 단락 벡터 프레임워크를 확장합니다. 이는 교차 양식 검색, 다중 소스 분류 및 텍스트만으로는 불충분한 문서 표현에 사용됩니다.

MethodMind에서 열기곧 제공동영상곧 제공Download slides

방법 전문 읽기

회원 전용

무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.

로그인

Method map

The neighbourhood of related methods — select a node to explore.

Multimodal Doc2Vec

Doc2Vec 멀티모달 BERT 기반 분류 다중 양식 문장 임베딩 다중 모달 트랜스포머 다중모드 워드투벡터 문장 임베딩

출처

Le, Q. V., & Mikolov, T. (2014). Distributed Representations of Sentences and Documents. Proceedings of the 31st International Conference on Machine Learning (ICML), PMLR 32(2), 1188–1196. link ↗
Ngiam, J., Khosla, A., Kim, M., Nam, J., Lee, H., & Ng, A. Y. (2011). Multimodal Deep Learning. Proceedings of the 28th International Conference on Machine Learning (ICML), 689–696. link ↗

이 페이지 인용 방법

ScholarGate. (2026, June 3). Multimodal Doc2Vec (Paragraph Vector with Multi-Source Input). ScholarGate. https://scholargate.app/ko/deep-learning/multimodal-doc2vec

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side →

이 방법을 참조하는 항목

다중모드 워드투벡터

이 페이지에서 오류를 발견하셨나요? 신고하거나 수정을 제안하세요 →