Lightweight and Direct Document Relevance Optimization for Generative Information Retrieval

Kidist Amde Mekonnen; Yubao Tang; Maarten de Rijke

doi:10.1145/3726302.3730023

What is it about?

When you type a question into a search engine, it has to decide which documents to show first. A new family of systems, called generative retrieval, treats this like a guessing game: the model “spells out” the ID of a document, token by token, and the first guess becomes rank #1, the next guess rank #2, and so on. The snag? These models practise spelling, not ranking, so their top guesses are not always the most relevant. Our work introduces DDRO – Direct Document-Relevance Optimisation. Instead of teaching the model with a complex reward function and reinforcement learning, we give it simple “win / lose” examples: for the same question, show that document A should beat document B. We keep the model close to its original behaviour with a gentle KL-divergence constraint (think of it as a leash that allows exploration but prevents it from wandering too far).

Why is it important?

Sharper answers where users look first. Most people click one of the first few results; boosting top-hit accuracy by 15–35 % means more questions are answered on the first click. Methodological simplicity and reproducibility. By replacing reinforcement-learning machinery with a single KL-regularised pairwise loss, the approach can be replicated with standard sequence-to-sequence tooling and publicly available relevance labels. Resource-efficient fine-tuning – DDRO trains on one GPU with ordinary click-log or qrel data, lowering the computational barrier to high-quality generative-retrieval research. In short, DDRO delivers better early-rank relevance and lowers the cost-of-entry.

Perspectives

I enjoyed simplifying generative retrieval: swapping a bulky RL loop for a single pair-wise loss made the idea easy to test on one GPU, yet it still delivered strong early-rank gains. I hope this lighter recipe helps others experiment with GenIR models more easily.
Kidist Amde Mekonnen
University of Amsterdam

This page is a summary of: Lightweight and Direct Document Relevance Optimization for Generative Information Retrieval, July 2025, ACM (Association for Computing Machinery),
DOI: 10.1145/3726302.3730023.
You can read the full text:

Read

Resources

Project
This repository contains the official implementation of our SIGIR 2025 paper: Lightweight and Direct Document Relevance Optimization for Generative IR (DDRO)
Optimizing Generative Retrieval with Ranking-Aligned Objective.

Contributors

The following have contributed to this page

Kidist Amde Mekonnen
University of Amsterdam

A Simpler Way to Teach AI Generative Search Engines Which Documents to Show First.

What is it about?

Why is it important?

Perspectives

Resources

This repository contains the official implementation of our SIGIR 2025 paper: Lightweight and Direct Document Relevance Optimization for Generative IR (DDRO)

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

A Simpler Way to Teach AI Generative Search Engines Which Documents to Show First.

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Resources

This repository contains the official implementation of our SIGIR 2025 paper: Lightweight and Direct Document Relevance Optimization for Generative IR (DDRO)

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management