Atomic Action Slicing: Planner-Aligned Options for Generalist VLA Agents

Stefan Tabakov; Asen Popov; Dimitar Dimitrov; S. Ensiye Kiyamousavi; Boris Kraychev; Vladimir Hristov

doi:10.1145/3748522.3779892

What is it about?

This publication presents Atomic Action Slicing, a method for helping generalist Vision-Language-Action robots learn long tasks more effectively. Instead of treating a full robot demonstration as one continuous sequence, the method breaks it into smaller, meaningful steps such as reaching, grasping, moving, and placing an object. These shorter “atomic actions” are easier to label, understand, plan with, and learn from. Using the LIBERO robot manipulation benchmark, the work creates a validated dataset of 2,124 atomic action segments and shows that training a VLA model on these segments can improve performance on long-horizon robotic tasks.

Photo by Franck V. on Unsplash

Why is it important?

Modern robot foundation models can follow language instructions, but they often struggle when a task requires several steps, new object combinations, or careful planning. This work helps bridge the gap between high-level planning and low-level robot control by aligning robot demonstrations with structured atomic actions. This makes robot learning more interpretable and more reusable: instead of memorizing full demonstrations, a robot can learn smaller skills that can be recombined for new tasks. The results show improved success on LIBERO-Goal and LIBERO-Long after fine-tuning CLIP-RT+ with the atomic dataset.

Perspectives

For me, this publication is an important step in my research journey because it connects several areas I am deeply interested in: computer vision, robotics, machine learning, planning, and Vision-Language-Action models. I was especially excited by the idea that robots should not only imitate demonstrations, but also understand their internal structure. Contributing to this work helped me better understand how dataset design, action segmentation, and planner-aligned representations can make robot learning more scalable and useful for real-world long-horizon tasks.
Asen Popov
Technical University of Sofia

This page is a summary of: Atomic Action Slicing: Planner-Aligned Options for Generalist VLA Agents, March 2026, ACM (Association for Computing Machinery),
DOI: 10.1145/3748522.3779892.
You can read the full text:

Read

Resources

Data
GATE-VLAP Dataset on Hugging Face (AAS method)
This dataset contains the planner-aligned robotic demonstration data used in our work on Atomic Action Slicing. It provides structured robot task data for studying long-horizon robot learning, Vision-Language-Action models, and action decomposition. The dataset is publicly available through Hugging Face to support reproducibility and further research in robot learning.

Contributors

The following have contributed to this page

Asen Popov
Technical University of Sofia

Teaching robots to break complex tasks into simple actions

What is it about?

Why is it important?

Perspectives

Resources

GATE-VLAP Dataset on Hugging Face (AAS method)

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Teaching robots to break complex tasks into simple actions

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Resources

GATE-VLAP Dataset on Hugging Face (AAS method)

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management