What is it about?

A more upto date comparison of (17) term-independence retrieval models including SPUD language model, MATF-like retrieval models, machine generated retrieval models, divergence from independence retrieval models and the common models (like BM25, language model and PL2). The best performing model is used as a baseline to compare based on a one-tail test. No model consistently performing without statistical significance difference from the top performing model across all the test collections using a multiple-hypothesis testing method.

Featured Image

Why is it important?

Some models do perform well in most test collections and perhaps those models should be used as the baseline for future comparison.


This is perhaps one of the more extensive comparison of term-independence retrieval models for ad hoc retrieval. The results in this compairson can serve as a reference for those who want to replicate or reproduce the retrieval performance.

Robert Luk
Hong Kong Polytechnic University

Read the Original

This page is a summary of: A Comparison between Term-Independence Retrieval Models for Ad Hoc Retrieval, ACM Transactions on Information Systems, July 2022, ACM (Association for Computing Machinery),
DOI: 10.1145/3483612.
You can read the full text:



The following have contributed to this page