All Stories

  1. Unbiased Reasoning for Knowledge-Intensive Tasks in Large Language Models via Conditional Front-Door Adjustment
  2. Revisiting Pre-processing Group Fairness: A Modular Benchmarking Framework
  3. Temporal-Aware User Behaviour Simulation with Large Language Models for Recommender Systems
  4. PUB: An LLM-Enhanced Personality-Driven User Behaviour Simulator for Recommender System Evaluation
  5. Towards Better Evaluation of Recommendation Algorithms with Bi-directional Item Response Theory
  6. Testing fairness measures in machine learning using an approach for designing tests in education.
  7. Off-policy Evaluation for Multiple Actions in the Presence of Unobserved Confounders