All Stories

  1. PILs of Knowledge: A Synthetic Benchmark for Evaluating Question Answering Systems in Healthcare
  2. The Magnitude of Truth: On Using Magnitude Estimation for Truthfulness Assessment
  3. Efficiency and Effectiveness of LLM-Based Summarization of Evidence in Crowdsourced Fact-Checking
  4. Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas
  5. Report on the 14th Italian Information Retrieval Workshop (IIR 2024)
  6. The Elusiveness of Detecting Political Bias in Language Models: The Impact of Question Wording
  7. Generative AI for Energy: Multi-Horizon Power Consumption Forecasting using Large Language Models
  8. Understanding the Barriers to Running Longitudinal Studies on Crowdsourcing Platforms
  9. Combining Large Language Models and Crowdsourcing for Hybrid Human-AI Misinformation Detection
  10. Data Bias Management
  11. How Many Crowd Workers Do I Need? On Statistical Power When Crowdsourcing Relevance Judgments
  12. Combining Human and Machine Confidence in Truthfulness Assessment
  13. Preferences on a Budget: Prioritizing Document Pairs when Crowdsourcing Relevance Judgments
  14. Crowd_Frame: A Simple and Complete Framework to Deploy Complex Crowdsourcing Tasks Off-the-Shelf