Comparer des méthodes
Examinez les méthodes sélectionnées côte à côte ; les lignes qui diffèrent sont mises en évidence.
| Détection des hallucinations× | Reconnaissance d'entités nommées (REN)× | |
|---|---|---|
| Domaine | Fouille de textes | Fouille de textes |
| Famille | Process / pipeline | Process / pipeline |
| Année d'origine≠ | 2020 (faithfulness framing); 2023 (SelfCheckGPT) | — |
| Auteur d'origine≠ | Established as a formal task by Maynez et al. (2020); SelfCheckGPT zero-resource variant by Manakul et al. (2023) | — |
| Type≠ | NLP evaluation / quality-assurance pipeline | NLP sequence-labelling task |
| Source fondatrice≠ | Maynez, J., Narayan, S., Bohnet, B., & McDonald, R. (2020). On Faithfulness and Factuality in Abstractive Summarization. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 1906-1919. link ↗ | Nadeau, D. & Sekine, S. (2007). A survey of named entity recognition. Lingvisticae Investigationes. link ↗ |
| Alias≠ | factual consistency checking, faithfulness evaluation, LLM output verification, Hallüsinasyon Tespiti (Factual Consistency) | NER, entity tagging, Adlandırılmış Varlık Tanıma (NER) |
| Apparentées≠ | 5 | 3 |
| Résumé≠ | Hallucination detection is a natural-language-processing pipeline that measures whether the output of a language model is consistent with a reference source document or with verifiable facts. Formalised as a faithfulness evaluation task by Maynez et al. (2020) and extended to a zero-resource black-box setting by Manakul et al. (2023) with SelfCheckGPT, the approach is used to flag unreliable LLM outputs in high-stakes domains such as medicine, law, and journalism. | Named entity recognition (NER) is a natural-language-processing task that automatically detects and labels entities in text — such as people, organisations, locations, and dates. Surveyed by Nadeau and Sekine (2007) and later advanced with neural architectures by Lample et al. (2016), it turns free-running text into tagged spans that downstream tools can use. |
| ScholarGateJeu de données ↗ |
|
|