BM25 Probabilistic Ranking (Okapi)
BM25, the Okapi 'Best Matching 25' function, is the dominant classical ranking function in information retrieval and the workhorse term-weighting scheme behind most lexical search engines and bibliographic databases. Developed by Stephen Robertson, Karen Spärck Jones and colleagues at City University London and formalized in Robertson and Zaragoza's 2009 monograph on the Probabilistic Relevance Framework, BM25 scores a document against a query as a sum, over query terms, of inverse-document-frequency weights multiplied by a saturating, length-normalized transform of within-document term frequency. Two free parameters control how quickly repeated terms stop adding evidence (k1) and how strongly document length is penalized (b). BM25 consistently outperformed plain TF-IDF in the TREC evaluations and remains the standard first-stage retrieval baseline against which modern neural rankers are measured.
阅读完整方法
使用免费账户登录即可阅读本节。
方法图谱
相关方法的邻域——选择一个节点以展开探索。
来源
- Robertson, S., & Zaragoza, H. (2009). The Probabilistic Relevance Framework: BM25 and Beyond. Foundations and Trends in Information Retrieval, 3(4), 333-389. DOI: 10.1561/1500000019 ↗
- Robertson, S. E., Walker, S., Jones, S., Hancock-Beaulieu, M. M., & Gatford, M. (1995). Okapi at TREC-3. In Overview of the Third Text REtrieval Conference (TREC-3), NIST Special Publication 500-225, 109-126. link ↗
如何引用本页
ScholarGate. (2026, June 23). BM25 Probabilistic Ranking (Okapi BM25 Term-Weighting and Document Scoring). ScholarGate. https://scholargate.app/zh/bibliometrics/bm25-ranking
选用哪种方法?
将本方法与其最相近的同类并置,并排研读——本馆将书籍铺陈于案上,取舍则由您定夺。
- Citation Context and Sentiment Analysis文献计量学↔ 比较
- Mean Average Precision (MAP)文献计量学↔ 比较
- Normalized Discounted Cumulative Gain (nDCG)文献计量学↔ 比较