TREC Pooling and Relevance Judgments
Pooling is the technique that lets the Cranfield evaluation paradigm scale to collections of millions of documents, where judging every document for every topic is impossible. Developed and institutionalized at the US National Institute of Standards and Technology for the Text REtrieval Conference (TREC), pooling gathers the top-ranked documents returned by many participating systems for each topic, merges them into a single pool, has human assessors judge only that pool, and treats every unjudged document as non-relevant. The result is a reusable test collection — documents, topics, and pooled relevance judgments (qrels) — on which new systems can later be scored without further assessment. Pooling is what made large-scale, reproducible retrieval evaluation feasible.
阅读完整方法
使用免费账户登录即可阅读本节。
方法图谱
相关方法的邻域——选择一个节点以展开探索。
来源
- Voorhees, E. M., & Harman, D. K. (Eds.). (2005). TREC: Experiment and Evaluation in Information Retrieval. MIT Press. ISBN: 9780262220736
- Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to Information Retrieval. Cambridge University Press. ISBN: 9780521865715
- Cleverdon, C. W. (1967). The Cranfield tests on index language devices. Aslib Proceedings, 19(6), 173-194. DOI: 10.1108/eb050097 ↗
如何引用本页
ScholarGate. (2026, June 23). TREC Pooling and Relevance Judgments (Scalable Construction of Reusable Test Collections). ScholarGate. https://scholargate.app/zh/library-information-science/trec-pooling-relevance-judgments
选用哪种方法?
将本方法与其最相近的同类并置,并排研读——本馆将书籍铺陈于案上,取舍则由您定夺。
- Cranfield Evaluation ParadigmLibrary Information Science↔ 比较
- Query Expansion EvaluationLibrary Information Science↔ 比较
- Relevance Feedback EvaluationLibrary Information Science↔ 比较