All Stories

  1. Can LLMs Uphold Research Integrity? Evaluating the Role of LLMs in Peer Review Quality
  2. Query Performance Prediction Using Neural Query Space Proximity
  3. ProActLLM: Proactive Conversational Information Seeking with Large Language Models
  4. Building Trustworthy Peer Review Quality Assessment Systems
  5. RottenReviews: Benchmarking Review Quality with Human and LLM-Based Judgments
  6. A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment
  7. VAP3: Variation-Aware Prompt Performance Prediction
  8. IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
  9. Benchmarking LLM-based Relevance Judgment Methods
  10. Query Performance Prediction: Theory, Techniques and Applications
  11. Query Performance Prediction: Techniques and Applications in Modern Information Retrieval
  12. Evaluating Relative Retrieval Effectiveness with Normalized Residual Gain
  13. Offline Evaluation of Set-Based Text-to-Image Generation
  14. Reviewerly: Modeling the Reviewer Assignment Task as an Information Retrieval Problem
  15. Enhanced Retrieval Effectiveness through Selective Query Generation
  16. Retrieving Supporting Evidence for Generative Question Answering
  17. Noisy Perturbations for Estimating Query Difficulty in Dense Retrievers
  18. A is for Adele: An Offline Evaluation Metric for Instant Search
  19. Quantifying Ranker Coverage of Different Query Subspaces
  20. A Preference Judgment Tool for Authoritative Assessment
  21. Gender Fairness in Information Retrieval Systems
  22. Addressing Gender-related Performance Disparities in Neural Rankers
  23. Predicting Efficiency/Effectiveness Trade-offs for Dense vs. Sparse Retrieval Strategy Selection
  24. MS MARCO Chameleons: Challenging the MS MARCO Leaderboard with Extremely Obstinate Queries
  25. Matches Made in Heaven: Toolkit and Large-Scale Datasets for Supervised Query Reformulation
  26. BERT-QPP: Contextualized Pre-trained transformers for Query Performance Prediction
  27. On the Orthogonality of Bias and Utility in Ad hoc Retrieval
  28. Geometric Estimation of Specificity within Embedding Spaces