مقایسهٔ روشها
روشهای انتخابی خود را کنار هم مرور کنید؛ ردیفهای متفاوت برجسته شدهاند.
| تولید زبان طبیعی× | ارزیابی خودکار متن× | |
|---|---|---|
| حوزه | متنکاوی | متنکاوی |
| خانواده | Process / pipeline | Process / pipeline |
| سال پیدایش≠ | 1970s (rule-based origins); 2000s (probabilistic); 2017+ (neural/transformer era) | 2002 (BLEU); 2004 (ROUGE); 2020 (BERTScore) |
| پدیدآور≠ | Reiter & Dale (classical pipeline, 2000); Gatt & Krahmer (modern survey, 2018) | BLEU: Papineni et al. (2002); ROUGE: Lin (2004); BERTScore: Zhang et al. (2020) |
| نوع≠ | NLP generative task — structured data to natural language | Reference-based NLG evaluation metric suite |
| منبع بنیادین≠ | Gatt, A. & Krahmer, E. (2018). Survey of the State of the Art in Natural Language Generation: Core Tasks, Applications and Evaluation. Journal of Artificial Intelligence Research, 61, 65-170. link ↗ | Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2002). BLEU: A Method for Automatic Evaluation of Machine Translation. Proceedings of ACL 2002. link ↗ |
| نامهای دیگر≠ | NLG, data-to-text, text generation, Doğal Dil Üretimi (NLG) | Otomatik Metin Değerlendirme (BLEU, ROUGE, BERTScore), NLG evaluation, MT evaluation metrics |
| مرتبط≠ | 7 | 4 |
| خلاصه≠ | Natural Language Generation (NLG) is the branch of natural language processing that automatically produces fluent, human-readable text from structured data, knowledge graphs, or semantic representations. Formalised in the classical pipeline by Reiter and Dale (2000) and surveyed comprehensively by Gatt and Krahmer (2018), NLG powers applications ranging from automated financial reporting and weather bulletins to data storytelling and conversational agents. | Automatic text evaluation is a family of reference-based metrics used to measure the quality of machine-generated text — such as translations, summaries, or natural-language-generation (NLG) outputs — by comparing them to one or more human-written reference texts. Pioneered by Papineni et al. with BLEU in 2002, the field has grown to include n-gram overlap metrics (BLEU, ROUGE) and semantically aware metrics (BERTScore, MoverScore) that capture meaning beyond surface word matches. |
| ScholarGateمجموعهداده ↗ |
|
|