ScholarGate
助手

方法对比

并排查看您选择的方法;存在差异的行会高亮显示。

TREC Pooling and Relevance Judgments×Cranfield Evaluation Paradigm×
领域Library Information ScienceLibrary Information Science
方法族Process / pipelineProcess / pipeline
起源年份20051967
提出者Ellen M. Voorhees & Donna K. Harman (NIST TREC)Cyril W. Cleverdon
类型Pooled relevance-assessment pipeline for large test collectionsTest-collection evaluation pipeline for retrieval effectiveness
开创性文献Voorhees, E. M., & Harman, D. K. (Eds.). (2005). TREC: Experiment and Evaluation in Information Retrieval. MIT Press. ISBN: 9780262220736Cleverdon, C. W. (1967). The Cranfield tests on index language devices. Aslib Proceedings, 19(6), 173-194. DOI ↗
别名Pooling Method, Depth Pooling, TREC Pooling, Pooled Relevance AssessmentCranfield Methodology, Test Collection Evaluation, Cranfield Tests, Laboratory IR Evaluation
相关33
摘要Pooling is the technique that lets the Cranfield evaluation paradigm scale to collections of millions of documents, where judging every document for every topic is impossible. Developed and institutionalized at the US National Institute of Standards and Technology for the Text REtrieval Conference (TREC), pooling gathers the top-ranked documents returned by many participating systems for each topic, merges them into a single pool, has human assessors judge only that pool, and treats every unjudged document as non-relevant. The result is a reusable test collection — documents, topics, and pooled relevance judgments (qrels) — on which new systems can later be scored without further assessment. Pooling is what made large-scale, reproducible retrieval evaluation feasible.The Cranfield evaluation paradigm is the foundational experimental design for measuring how well an information retrieval system finds relevant documents. Devised by Cyril Cleverdon at the College of Aeronautics in Cranfield during the 1960s, it fixes three ingredients — a document collection, a set of search requests, and human relevance judgments linking requests to documents — and then holds them constant so that competing indexing methods or retrieval algorithms can be compared on recall and precision under controlled, repeatable conditions. By abstracting evaluation away from any single live user and turning it into a reusable laboratory experiment, Cranfield made retrieval effectiveness a measurable quantity and supplied the template that every later large-scale campaign, including TREC, has built upon.
ScholarGate数据集
  1. v1
  2. 3 来源
  3. PUBLISHED
  1. v1
  2. 3 来源
  3. PUBLISHED

前往搜索 下载幻灯片

ScholarGate方法对比: TREC Pooling and Relevance Judgments · Cranfield Evaluation Paradigm. 于 2026-06-25 检索自 https://scholargate.app/zh/compare