Machine learningDeep learning / NLP / CV

다중 양식 텍스트 요약

다중 양식 텍스트 요약은 텍스트와 이미지, 또는 비디오 프레임이나 오디오와 같은 여러 입력 양식을 딥러닝 모델을 사용하여 공동으로 처리함으로써 간결한 텍스트 요약을 생성합니다. 이 모델들은 시각적 표현과 언어적 표현을 정렬하며, 결과물은 사용 가능한 모든 양식의 핵심 내용을 포착하는 자연어 요약입니다.

MethodMind에서 열기곧 제공동영상곧 제공Download slides

방법 전문 읽기

회원 전용

무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.

로그인

Method map

The neighbourhood of related methods — select a node to explore.

다중 양식 텍스트 요약

BERT 기반 분류 미세 조정 텍스트 요약 멀티모달 BERT 기반 분류 다중 양식 질의응답 다중 모달 트랜스포머 도메인 적응 텍스트 요약

출처

Zhu, J., Li, H., Liu, T., Zhou, Y., Zhang, J., & Zong, C. (2018). MSMO: Multimodal Summarization with Multimodal Output. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), 4154–4164. link ↗
Zhu, J., Zhou, Y., Zhang, J., Li, H., Zong, C., & Li, C. (2020). Multimodal Summarization with Guidance of Multimodal Reference. Proceedings of the AAAI Conference on Artificial Intelligence, 34(05), 9749–9756. link ↗

이 페이지 인용 방법

ScholarGate. (2026, June 3). Multimodal Text Summarization (Cross-Modal Abstractive and Extractive Summarization). ScholarGate. https://scholargate.app/ko/deep-learning/multimodal-text-summarization

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

Compare side by side →

이 방법을 참조하는 항목

도메인 적응 텍스트 요약 다중 양식 질의응답

이 페이지에서 오류를 발견하셨나요? 신고하거나 수정을 제안하세요 →

방법 전문 읽기

Method map

출처

이 페이지 인용 방법

관련 방법

Which method?

이 방법을 참조하는 항목