Process / pipeline

Hallucination Detection — Factual Consistency Checking for LLM Outputs

Hallucination Detection (Factual Consistency) · Also known as: factual consistency checking, faithfulness evaluation, LLM output verification, Hallüsinasyon Tespiti (Factual Consistency)

Hallucination detection is a natural-language-processing pipeline that measures whether the output of a language model is consistent with a reference source document or with verifiable facts. Formalised as a faithfulness evaluation task by Maynez et al. (2020) and extended to a zero-resource black-box setting by Manakul et al. (2023) with SelfCheckGPT, the approach is used to flag unreliable LLM outputs in high-stakes domains such as medicine, law, and journalism.

Tools & resources

Download slides

Learn & explore

Read the full method

Members only

Method map

The neighbourhood of related methods — select a node to explore.

Hallucination Detection

BERT Embeddings Named Entity Recognition Question Answering Sentiment Analysis Text Classification

When to use it

Hallucination detection applies whenever a language model generates text that is supposed to be grounded in a source document or in verifiable facts — abstractive summarisation, retrieval-augmented generation, medical report drafting, legal document analysis, and news generation are the canonical use cases. A source document or verification basis must be available; without one, only the sampling-based (SelfCheckGPT) variant can operate, and it requires the ability to draw multiple samples from the model. The minimum corpus for evaluation is around ten reference-output pairs. The method does not apply to purely creative generation where factual grounding is not expected.

Strengths & limitations

Strengths

Provides a principled, automated way to audit LLM outputs for factual consistency before they reach end users.
Multiple verification strategies — NLI, QA, sampling — cover both white-box and black-box model access scenarios.
Fine-grained claim-level scoring reveals exactly which sentences are hallucinated, enabling targeted correction.

Limitations

NLI and QA verifiers are themselves language models and can fail on complex, multi-hop, or ambiguous claims.
The sampling-based approach (SelfCheckGPT) requires multiple inference calls per prompt, which increases computational cost and latency.
Performance depends on the quality and coverage of the reference source; incomplete or outdated references lead to false positives.

Frequently asked

What is the difference between an intrinsic and an extrinsic hallucination?

An intrinsic hallucination directly contradicts information present in the source document — for example, a summary that says a patient was treated with drug A when the source says drug B. An extrinsic hallucination introduces information that is neither supported nor contradicted by the source, such as adding a biographical detail that does not appear anywhere in the input. The distinction matters because extrinsic hallucinations are harder to catch with a simple contradiction check.

Do I need a reference document to run hallucination detection?

For NLI-based and QA-based verification you do need a reference source. The SelfCheckGPT approach is the exception: it draws multiple samples from the model for the same prompt and treats cross-sample consistency as a proxy for factuality, so no external reference is required — but you do need the ability to call the model multiple times, and you need to interpret the resulting consistency score carefully.

Which verification approach should I use — NLI, QA-based, or sampling?

If you have a well-defined source document (e.g., in a RAG pipeline or summarisation task), an NLI model such as a DeBERTa-MNLI model is a natural starting point and gives per-claim entailment labels. If question generation is feasible, QA-based verification can be more precise on factual claims. If you have black-box API access only and no reference document, SelfCheckGPT is the appropriate choice. In high-stakes domains, combining methods and requiring agreement between them reduces false negatives.

How is hallucination detection evaluated?

A labelled dataset of reference-output pairs annotated by human experts for faithfulness is the gold standard. The detector's predictions are compared against these labels using precision, recall, and F1 at the claim or sentence level. Published benchmarks such as FactScore and the SummaC benchmark provide standardised evaluation sets for this purpose.

Sources

Maynez, J., Narayan, S., Bohnet, B., & McDonald, R. (2020). On Faithfulness and Factuality in Abstractive Summarization. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 1906-1919. link ↗
Manakul, P., Liusie, A., & Gales, M.J.F. (2023). SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 9004-9017. link ↗

How to cite this page

ScholarGate. (2026, June 1). Hallucination Detection (Factual Consistency). ScholarGate. https://scholargate.app/en/text-mining/hallucination-detection

Which method?

Set this method beside its closest kin and read them side by side — the library lays the books on the table; the choice is yours.

BERT EmbeddingsText mining↔ compare
Named Entity RecognitionText mining↔ compare
Question AnsweringText mining↔ compare
Sentiment AnalysisText mining↔ compare
Text ClassificationText mining↔ compare

Compare side by side →

Related reference concepts

Natural Language Processing in Clinical Documentation Evaluation and Annotation Question Answering and Dialogue Systems Information Extraction Language Modeling Natural Language Processing

Spotted an issue on this page? Report or suggest a fix →

Hallucination Detection — Factual Consistency Checking for LLM Outputs

Hallucination Detection (Factual Consistency) · Also known as: factual consistency checking, faithfulness evaluation, LLM output verification, Hallüsinasyon Tespiti (Factual Consistency)

Tools & resources

Download slides

Learn & explore

Read the full method

Members only

When to use it

Strengths & limitations

Strengths

Provides a principled, automated way to audit LLM outputs for factual consistency before they reach end users.
Multiple verification strategies — NLI, QA, sampling — cover both white-box and black-box model access scenarios.
Fine-grained claim-level scoring reveals exactly which sentences are hallucinated, enabling targeted correction.

Limitations

NLI and QA verifiers are themselves language models and can fail on complex, multi-hop, or ambiguous claims.
The sampling-based approach (SelfCheckGPT) requires multiple inference calls per prompt, which increases computational cost and latency.
Performance depends on the quality and coverage of the reference source; incomplete or outdated references lead to false positives.

Frequently asked

What is the difference between an intrinsic and an extrinsic hallucination?

Do I need a reference document to run hallucination detection?

Which verification approach should I use — NLI, QA-based, or sampling?

How is hallucination detection evaluated?

Sources

Maynez, J., Narayan, S., Bohnet, B., & McDonald, R. (2020). On Faithfulness and Factuality in Abstractive Summarization. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 1906-1919. link ↗
Manakul, P., Liusie, A., & Gales, M.J.F. (2023). SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), 9004-9017. link ↗

How to cite this page

ScholarGate. (2026, June 1). Hallucination Detection (Factual Consistency). ScholarGate. https://scholargate.app/en/text-mining/hallucination-detection

Hallucination Detection — Factual Consistency Checking for LLM Outputs

Read the full method

Method map

When to use it

Strengths & limitations

Frequently asked

Sources

How to cite this page

Which method?

Similar methods

Related reference concepts

Hallucination Detection — Factual Consistency Checking for LLM Outputs

Read the full method

Method map

When to use it

Strengths & limitations

Frequently asked

Sources

How to cite this page

Which method?

Similar methods

Related reference concepts

Hallucination Detection — Factual Consistency Checking for LLM Outputs

Read the full method

Method map

When to use it

Strengths & limitations

Frequently asked

Sources

How to cite this page

Related methods

Which method?

Similar methods

Related reference concepts

Hallucination Detection — Factual Consistency Checking for LLM Outputs

Read the full method

Method map

When to use it

Strengths & limitations

Frequently asked

Sources

How to cite this page

Related methods

Which method?

Similar methods

Related reference concepts