Process / pipelineInformation retrieval evaluation

TREC Pooling and Relevance Judgments

Pooling is the technique that lets the Cranfield evaluation paradigm scale to collections of millions of documents, where judging every document for every topic is impossible. Developed and institutionalized at the US National Institute of Standards and Technology for the Text REtrieval Conference (TREC), pooling gathers the top-ranked documents returned by many participating systems for each topic, merges them into a single pool, has human assessors judge only that pool, and treats every unjudged document as non-relevant. The result is a reusable test collection — documents, topics, and pooled relevance judgments (qrels) — on which new systems can later be scored without further assessment. Pooling is what made large-scale, reproducible retrieval evaluation feasible.

在 MethodMind 中打开即将推出应用、比较、获取指导

工具与资源

下载幻灯片

学习与探索

视频即将推出

阅读完整方法

仅限会员

使用免费账户登录即可阅读本节。

方法图谱

相关方法的邻域——选择一个节点以展开探索。

TREC Pooling and Relevance Judgments

Cranfield Evaluation Par…Query Expansion Evaluati…Relevance Feedback Evalu…Inter-Indexer Consistency

来源

Voorhees, E. M., & Harman, D. K. (Eds.). (2005). TREC: Experiment and Evaluation in Information Retrieval. MIT Press. ISBN: 9780262220736
Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to Information Retrieval. Cambridge University Press. ISBN: 9780521865715
Cleverdon, C. W. (1967). The Cranfield tests on index language devices. Aslib Proceedings, 19(6), 173-194. DOI: 10.1108/eb050097 ↗

如何引用本页

ScholarGate. (2026, June 23). TREC Pooling and Relevance Judgments (Scalable Construction of Reusable Test Collections). ScholarGate. https://scholargate.app/zh/library-information-science/trec-pooling-relevance-judgments