What is it about?
We propose PANDA, a novel dataflow architecture that improves performance through adaptive prefetching and decentralized scheduling. Rather than relying on rigid memory access and centralized control, PANDA introduces an application-adaptive prefetching method with a reconfigurable on-chip memory architecture, as well as a distributed dynamic scheduling strategy built on decentralized processing elements (PEs). Through hardware-software co-optimization of the dataflow execution model, PANDA effectively reduces memory access latency and improves hardware utilization. Evaluated on a wide range of real-world applications, PANDA achieves up to 2.53× performance improvement and 1.79× energy-efficiency improvement over state-of-the-art dataflow architectures.
Featured Image
Photo by A Chosen Soul on Unsplash
Why is it important?
Dataflow architectures offer an attractive balance of performance, energy efficiency, and flexibility, making them a strong candidate for future computing systems. However, their practical performance is often limited by two key bottlenecks: inefficient memory access and centralized task scheduling. Existing designs struggle to efficiently support both regular and irregular memory access patterns, while centralized control can introduce communication overhead and reduce hardware utilization. PANDA addresses these challenges through adaptive prefetching, a reconfigurable on-chip memory architecture, and decentralized dynamic scheduling. By reducing memory access latency and improving processing element utilization, PANDA provides a practical way to make dataflow architectures faster and more efficient for real-world applications.
Perspectives
For edge computing, the real bottleneck of dataflow architectures lies in how well they can adapt to diverse data access patterns and concurrent computation demands.
Shantian Qin
Institute of Computing Technology, Chinese Academy of Sciences
Read the Original
This page is a summary of: PANDA: Adaptive Prefetching and Decentralized Scheduling for Dataflow Architectures, ACM Transactions on Architecture and Code Optimization, June 2025, ACM (Association for Computing Machinery),
DOI: 10.1145/3721288.
You can read the full text:
Contributors
The following have contributed to this page







