Robot decision-making in partially observable, real-time, dynamic, and multi-agent environments remains a difficult and unsolved challenge. Model-free reinforcement learning (RL) is a promising approach to learning decisionmaking in such domains, however, end-to-end RL in complex environments is often intractable. To address this challenge in the RoboCup Standard Platform League (SPL) domain, we developed a novel architecture integrating RL within a classical robotics stack, while employing a multi-fidelity sim2real approach and decomposing behavior into learned sub-behaviors with heuristic selection. Our architecture led to victory in the 2024 RoboCup SPL Challenge Shield Division. In this work, we fully describe our system's architecture and empirically analyze key design decisions that contributed to its success. Our approach demonstrates how RL-based behaviors can be integrated into complete robot behavior architectures.
Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer
Adam Labiosa,Zhihan Wang,Siddhant Agarwal,William Cong,Geethika Hemkumar,A. Harish,Benjamin Hong,Josh Kelle,Chen Li,Yuhao Li,Zisen Shao,Peter Stone,Josiah P. Hanna
Published 2024 in IEEE International Conference on Robotics and Automation
ABSTRACT
PUBLICATION RECORD
- Publication year
2024
- Venue
IEEE International Conference on Robotics and Automation
- Publication date
2024-12-12
- Fields of study
Computer Science, Engineering
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-45 of 45 references · Page 1 of 1
CITED BY
Showing 1-4 of 4 citing papers · Page 1 of 1