Process / pipeline

Hallucination Detection — Factual Consistency Checking for LLM Outputs

Hallucination detection is a natural-language-processing pipeline that measures whether the output of a language model is consistent with a reference source document or with verifiable facts. Formalised as a faithfulness evaluation task by Maynez et al. (2020) and extended to a zero-resource black-box setting by Manakul et al. (2023) with SelfCheckGPT, the approach is used to flag unreliable LLM outputs in high-stakes domains such as medicine, law, and journalism.

Open in MethodMindSoonVideoSoon

Read the full method

Members only

Sign in with a free account to read this section.

Sign in

Sources

  1. Maynez, J., Narayan, S., Bohnet, B., & McDonald, R. (2020). On Faithfulness and Factuality in Abstractive Summarization. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 1906-1919. link
  2. Manakul, P., Liusie, A., & Gales, M.J.F. (2023). SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 9004-9017. link

Related methods

ScholarGateHallucination Detection (Hallucination Detection (Factual Consistency)). Retrieved 2026-06-04 from https://scholargate.app/en/text-mining/hallucination-detection