TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework

Chao Zhang; Yuhao Wang; Derong Xu; Haoxin Zhang; Yuanjie Lyu; Yuhao Chen; Shuochen Liu; Tong Xu; Xiangyu Zhao; Yan Gao; Yao Hu; Enhong Chen

doi:10.1145/3818621

What is it about?

Artificial Intelligence models often struggle to answer complex questions without reading and processing massive amounts of text. This requires a huge number of "tokens" (the basic units of data AI uses), making the process slow and expensive. We developed TeaRAG, a new framework that helps AI find and process information much more efficiently. It acts like a smart research assistant that not only filters and compresses the information it retrieves but also shortens the logical steps the AI takes to reach an answer. By combining traditional text search with a map of connected concepts (a knowledge graph), TeaRAG provides more accurate answers while cutting the amount of data processed by about 60%. This makes AI systems faster, cheaper, and more precise.

Photo by Numan Ali on Unsplash

Why is it important?

As large language models are increasingly deployed in real-world applications, the high computing costs and slow response times caused by processing massive amounts of context have become major bottlenecks. What makes TeaRAG unique is its dual-compression approach: it streamlines both the external information retrieved by the AI and the AI's internal reasoning steps. By introducing a novel training method (Iterative Process-aware DPO) and graph-based retrieval, TeaRAG achieves the rare feat of actually improving answer accuracy (by 2% to 4%) while simultaneously slashing computational token usage by nearly 60%. This breakthrough is highly timely, as it makes advanced, agent-based AI systems significantly more affordable, scalable, and sustainable for everyday business applications.

This page is a summary of: TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework, ACM Transactions on Information Systems, June 2026, ACM (Association for Computing Machinery),
DOI: 10.1145/3818621.
You can read the full text:

Read

Contributors

The following have contributed to this page

Chao Zhang
City University of Hong Kong

TeaRAG: A Faster and Cheaper Way for AI to Answer Complex Questions

What is it about?

Why is it important?

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

TeaRAG: A Faster and Cheaper Way for AI to Answer Complex Questions

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management