ScholarGate
アシスタント

手法を比較

選択した手法を並べて確認できます。異なる行はハイライト表示されます。

幻覚検出×質問応答 (QA)×
分野テキストマイニングテキストマイニング
系統Process / pipelineProcess / pipeline
提唱年2020 (faithfulness framing); 2023 (SelfCheckGPT)
提唱者Established as a formal task by Maynez et al. (2020); SelfCheckGPT zero-resource variant by Manakul et al. (2023)
種類NLP evaluation / quality-assurance pipelineNLP text-comprehension task
原典Maynez, J., Narayan, S., Bohnet, B., & McDonald, R. (2020). On Faithfulness and Factuality in Abstractive Summarization. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 1906-1919. link ↗Rajpurkar, P. et al. (2016). SQuAD: 100,000+ Questions for Machine Comprehension of Text. EMNLP. DOI ↗
別名factual consistency checking, faithfulness evaluation, LLM output verification, Hallüsinasyon Tespiti (Factual Consistency)QA, machine reading comprehension, Soru Cevaplama (Question Answering)
関連54
概要Hallucination detection is a natural-language-processing pipeline that measures whether the output of a language model is consistent with a reference source document or with verifiable facts. Formalised as a faithfulness evaluation task by Maynez et al. (2020) and extended to a zero-resource black-box setting by Manakul et al. (2023) with SelfCheckGPT, the approach is used to flag unreliable LLM outputs in high-stakes domains such as medicine, law, and journalism.Question answering is a natural-language-processing task that automatically answers natural-language questions grounded in a given context passage, using either extractive or generative approaches. The task was crystallised by the SQuAD benchmark of Rajpurkar et al. (2016), and later models such as XLNet (Yang et al., 2019) pushed reading-comprehension accuracy higher.
ScholarGateデータセット
  1. v1
  2. 2 出典
  3. PUBLISHED
  1. v1
  2. 2 出典
  3. PUBLISHED

検索へ スライドをダウンロード

ScholarGate手法を比較: Hallucination Detection · Question Answering. 2026-06-18に以下より取得 https://scholargate.app/ja/compare