Machine learningDeep learning / NLP / CV

다중 양식 문장 임베딩

다중 양식 문장 임베딩은 텍스트와 이미지를 (때로는 오디오나 비디오까지) 공유되는 연속 벡터 공간으로 매핑하여, 서로 다른 양식의 의미론적으로 관련된 쌍이 가깝게 위치하도록 합니다. 대규모 쌍형 코퍼스에 대한 대조 학습 목표로 훈련된 이러한 표현은 교차 양식 검색, 제로샷 분류 및 시각-언어 추론에 사용됩니다.

MethodMind에서 열기곧 제공동영상곧 제공Download slides

방법 전문 읽기

회원 전용

무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.

로그인

Method map

The neighbourhood of related methods — select a node to explore.

다중 양식 문장 임베딩

CLIP Multimodal Doc2Vec 다중 양식 그래프 신경망 다중 양식 이미지 분류 Multimodal Multilayer Pe…다중 양식 명사 개체 인식 다중 양식 질의응답 다중 모달 RoBERTa 기반 분류 다중 양식 토픽 모델링 다중모드 워드투벡터

출처

Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., ... & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML), pp. 8748–8763. PMLR. link ↗
Frome, A., Corrado, G. S., Shlens, J., Bengio, S., Dean, J., Ranzato, M., & Mikolov, T. (2013). DeViSE: A deep visual-semantic embedding model. In Advances in Neural Information Processing Systems (NeurIPS), Vol. 26. link ↗

이 페이지 인용 방법

ScholarGate. (2026, June 3). Multimodal Sentence Embeddings (Joint Vision-Language Representation Learning). ScholarGate. https://scholargate.app/ko/deep-learning/multimodal-sentence-embeddings

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

CLIP딥러닝↔ compare

Compare side by side →

이 방법을 참조하는 항목

Multimodal Doc2Vec 다중 양식 그래프 신경망 다중 양식 이미지 분류 Multimodal Multilayer Perceptron 다중 양식 명사 개체 인식 다중 양식 질의응답 다중 모달 RoBERTa 기반 분류 다중 양식 토픽 모델링 다중모드 워드투벡터

이 페이지에서 오류를 발견하셨나요? 신고하거나 수정을 제안하세요 →

방법 전문 읽기

Method map

출처

이 페이지 인용 방법

관련 방법

Which method?

이 방법을 참조하는 항목