Leveraging Symbolic Models in Reinforcement Learning for Multi-skill Chaining

Wenhao Lu; Maximilian Diehl; Jonas Sjöberg; Karinne Ramirez-Amaro

doi:10.1109/SII59315.2025.10870988

Leveraging Symbolic Models in Reinforcement Learning for Multi-skill Chaining
Paper i proceeding, 2025

We envision robots learning new skills as efficiently as possible. A key challenge in this pursuit is that the efficiency of learning systems such as Reinforcement Learning (RL) deteriorates with the task complexity. For instance, in the task of building a tower of cubes, multiple subtasks must be executed in the correct sequence to succeed with this long-horizon task. As the sequence extends, determining the correct order of actions becomes increasingly difficult, particularly for RL methods that rely on trial-and-error. To tackle this, we propose a new method that integrates symbolic models into RL to boost learning efficiency in long-horizon tasks. Symbolic models offer task abstraction and can enhance sample efficiency for RL agents through high-level operators. Our approach focuses on task decomposition aligned with the structure of Automated Planning (AP) operators, enabling RL agents to synthesize individual skills for specific subtasks, thus they will require fewer learning samples. The task decomposition is designed with dual consideration: 1) reducing errors when linking subsequent skills together and 2) enhancing skill reusability for downstream tasks with similar structures. In simulated robot manipulation tasks, such as stacking two cubes, experiments demonstrate superior sample efficiency for our proposed approach (a 2x reduction in training cost) compared to most RL baselines. Furthermore, our method is robust to generalise to unseen rearrangement tasks with minimal interaction steps (fewer than 100), achieving an average success rate approximately 50% higher than baselines, which often struggle to make progress.

Författare

Wenhao Lu

Chalmers, Elektroteknik, System- och reglerteknik

Forskning Andra publikationer

Maximilian Diehl

Chalmers, Elektroteknik, System- och reglerteknik

Forskning Andra publikationer

Jonas Sjöberg

Chalmers, Elektroteknik, System- och reglerteknik

Forskning Andra publikationer

Karinne Ramirez-Amaro

Chalmers, Elektroteknik, System- och reglerteknik

Forskning Andra publikationer

2025 IEEE/SICE International Symposium on System Integration, SII 2025

29-36
9798331531614 (ISBN)

2025 IEEE/SICE International Symposium on System Integration, SII 2025
Munich, Germany,

Ämneskategorier (SSIF 2025)

Robotik och automation

Datorgrafik och datorseende

Datavetenskap (datalogi)

DOI

10.1109/SII59315.2025.10870988

Publikationsdata kopplat till DOI

Mer information

Senast uppdaterat

2025-03-19

Leveraging Symbolic Models in Reinforcement Learning for Multi-skill Chaining Paper i proceeding, 2025

Författare

Wenhao Lu

Maximilian Diehl

Jonas Sjöberg

Karinne Ramirez-Amaro

2025 IEEE/SICE International Symposium on System Integration, SII 2025

Ämneskategorier (SSIF 2025)

DOI

Mer information

Senast uppdaterat

Leveraging Symbolic Models in Reinforcement Learning for Multi-skill Chaining
Paper i proceeding, 2025