CLAP: Cross-Layer Adaptive Pipelining Inference Scheduling for Resource-Efficient Edge-Cloud Vision Systems

Zheming Yang; Wen Ji; Qi Guo; Jian Zhao; Chang Zhao; Xingzhou Zhang; Yangyu Zhang; Zhicheng Li; Yang You

doi:10.1145/3803806

What is it about?

This paper is about making edge-cloud vision systems actually adapt to the real world instead of assuming everything is uniform. In practice, different video tasks vary a lot in complexity, but most systems treat them the same. CLAP tackles this by introducing a scene-aware mechanism to understand how complex each task is, and then dynamically deciding how to process it through an adaptive pipeline across edge and cloud. Instead of fixed offloading or static scheduling, it builds a flexible, cross-layer decision system that adjusts computation stages, resource allocation, and execution paths on the fly.

Photo by Igor Omilaev on Unsplash

Why is it important?

The key issue in current edge-cloud systems is not lack of resources, but inefficient use of them. Treating all tasks equally leads to wasted computation on simple scenes and insufficient processing for complex ones. This mismatch directly hurts latency, energy efficiency, and even accuracy. CLAP shows that by aligning scheduling decisions with actual scene complexity, systems can simultaneously improve throughput, reduce delay and energy consumption, and even boost accuracy. In other words, smarter scheduling—not more hardware—is what unlocks real performance gains in large-scale vision systems.

Perspectives

What I find interesting about this work is that it shifts the focus from “where to compute” to “how to adapt computation continuously.” It suggests that future edge-cloud systems should not rely on static architectures or fixed pipelines, but behave more like adaptive organisms that respond to task variability in real time. This idea could extend beyond vision systems to other multimodal or large-model inference scenarios. More broadly, it raises a thought: as systems become more complex, efficiency will increasingly depend on intelligence in orchestration rather than raw computational power—and that might be where the next wave of innovation really happens.
Zheming Yang
University of the Chinese Academy of Sciences

This page is a summary of: CLAP: Cross-Layer Adaptive Pipelining Inference Scheduling for Resource-Efficient Edge-Cloud Vision Systems, ACM Transactions on Architecture and Code Optimization, March 2026, ACM (Association for Computing Machinery),
DOI: 10.1145/3803806.
You can read the full text:

Read

Contributors

The following have contributed to this page

Zheming Yang
University of the Chinese Academy of Sciences

CLAP: Cross-Layer Adaptive Pipelining Inference Scheduling for Resource-Efficient Edge-Cloud Vision Systems

What is it about?

Why is it important?

Perspectives

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

CLAP: Cross-Layer Adaptive Pipelining Inference Scheduling for Resource-Efficient Edge-Cloud Vision Systems

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management