Process / pipeline

자동 텍스트 평가 — BLEU, ROUGE, BERTScore

자동 텍스트 평가는 기계 생성 텍스트(예: 번역, 요약 또는 자연어 생성(NLG) 출력)의 품질을 하나 이상의 사람이 작성한 참조 텍스트와 비교하여 측정하는 참조 기반 지표 계열입니다. 2002년 Papineni 등이 BLEU를 통해 개척한 이 분야는 n-그램 중복 지표(BLEU, ROUGE)와 표면 단어 일치를 넘어서는 의미를 포착하는 의미론적으로 인식하는 지표(BERTScore, MoverScore)를 포함하도록 성장했습니다.

MethodMind에서 열기곧 제공동영상곧 제공Download slides

방법 전문 읽기

회원 전용

무료 계정으로 로그인하면 이 섹션을 읽을 수 있습니다.

로그인

Method map

The neighbourhood of related methods — select a node to explore.

자동 텍스트 평가

BERT 임베딩 감성 분석 텍스트 분류 토픽 모델링 자연어 생성 텍스트 일관성 점수화

출처

Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2002). BLEU: A Method for Automatic Evaluation of Machine Translation. Proceedings of ACL 2002. link ↗
Zhang, T., Kishore, V., Wu, F., Weinberger, K. Q., & Artzi, Y. (2020). BERTScore: Evaluating Text Generation with BERT. Proceedings of ICLR 2020. link ↗

이 페이지 인용 방법

ScholarGate. (2026, June 1). Automatic Text Evaluation (BLEU, ROUGE, BERTScore). ScholarGate. https://scholargate.app/ko/text-mining/automatic-text-evaluation

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

BERT 임베딩텍스트 마이닝↔ compare
감성 분석텍스트 마이닝↔ compare
텍스트 분류텍스트 마이닝↔ compare
토픽 모델링딥러닝↔ compare

Compare side by side →

이 방법을 참조하는 항목

자연어 생성 텍스트 일관성 점수화

이 페이지에서 오류를 발견하셨나요? 신고하거나 수정을 제안하세요 →