A Comparison of Action Spaces for Learning Manipulation Tasks

Patrick Varin,Lev Grossman,S. Kuindersma

Published 2019 in IEEE/RJS International Conference on Intelligent RObots and Systems

ABSTRACT

Designing reinforcement learning (RL) problems that can produce delicate and precise manipulation policies requires careful choice of the reward function, state, and action spaces. Much prior work on applying RL to manipulation tasks has defined the action space in terms of direct joint torques or reference positions for a joint-space proportional derivative (PD) controller. In practice, it is often possible to add additional structure by taking advantage of model-based controllers that support both accurate positioning and control of the dynamic response of the manipulator. In this paper, we evaluate how the choice of action space for dynamic manipulation tasks affects the sample complexity as well as the final quality of learned policies. We compare learning performance across three tasks (peg insertion, hammering, and pushing), four action spaces (torque, joint PD, inverse dynamics, and impedance control), and using two modern reinforcement learning algorithms (Proximal Policy optimization and Soft Actor-Critic). Our results lend support to the hypothesis that learning references for a task-space impedance controller significantly reduces the number of samples needed to achieve good performance across all tasks and algorithms.

PUBLICATION RECORD

Publication year
2019
Venue
IEEE/RJS International Conference on Intelligent RObots and Systems
Publication date
2019-08-23
Fields of study
Computer Science, Engineering
Identifiers
DOI 10.1109/IROS40897.2019.8967946 arXiv 1908.08659
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Robotic Arm Control and Task Training through Deep Reinforcement Learning
2020cited by this paper
Learning dexterous in-hand manipulation
2018cited by this paper
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
2018influential reference
Addressing Function Approximation Error in Actor-Critic Methods
2018cited by this paper
Composable Deep Reinforcement Learning for Robotic Manipulation
2018cited by this paper
Soft Actor-Critic Algorithms and Applications
2018cited by this paper
Residual Reinforcement Learning for Robot Control
2018cited by this paper
Residual Policy Learning
2018cited by this paper
DeepMimic
2018cited by this paper
Data-efficient Deep Reinforcement Learning for Dexterous Manipulation
2017cited by this paper
DeepLoco
2017cited by this paper
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
2017cited by this paper
Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning
2017cited by this paper
Reinforcement Learning with Deep Energy-Based Policies
2017cited by this paper
Learning to Schedule Control Fragments for Physics-Based Characters Using Deep Q-Learning
2017cited by this paper
Proximal Policy Optimization Algorithms
2017influential reference
Learning locomotion skills using DeepRL: does the choice of action space matter?
2016cited by this paper
Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates
2016influential reference
Guided Learning of Control Graphs for Physics-Based Characters
2016cited by this paper
Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection
2016cited by this paper
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
2015cited by this paper
Trust Region Policy Optimization
2015cited by this paper
End-to-End Training of Deep Visuomotor Policies
2015cited by this paper
Continuous control with deep reinforcement learning
2015cited by this paper
Simulation and control of skeleton-driven soft body characters
2013cited by this paper
Technical Section: Goal directed multi-finger manipulation: Control policies and analysis
2013cited by this paper
MuJoCo: A physics engine for model-based control
2012cited by this paper
Interactive Character Animation Using Simulated Physics: A State‐of‐the‐Art Review
2012cited by this paper
Learning variable impedance control
2011cited by this paper
Articulated swimming creatures
2011cited by this paper
Stable Proportional-Derivative Controllers
2011cited by this paper
Learning force control policies for compliant manipulation
2011influential reference
Impedance learning for Robotic Contact Tasks using Natural Actor-Critic Algorithm
2010cited by this paper
Policy Gradient Methods for Reinforcement Learning with Function Approximation
1999cited by this paper
Learning reactive admittance control
1992cited by this paper
Quasi-Static Assembly of Compliantly Supported Rigid Parts
1982cited by this paper

CITED BY

Sensorimotor Learning With Stability Guarantees via Autonomous Neural Dynamic Policies
2025cites this paper
Reliable Robotic Task Execution in the Face of Anomalies
2025cites this paper
Kinematics-Aware Diffusion Policy with Consistent 3D Observation and Action Space for Whole-Arm Robotic Manipulation
2025cites this paper
How Well do Diffusion Policies Learn Kinematic Constraint Manifolds?
2025cites this paper
Action Space Design in Reinforcement Learning for Robot Motor Skills
2024cites this paper
Redundancy-Aware Action Spaces for Robot Learning
2024cites this paper
Continual Domain Randomization
2024cites this paper
Learning Variable Impedance Control for Robotic Massage With Deep Reinforcement Learning: A Novel Learning Framework
2024cites this paper
Lowering reinforcement learning barriers for quadruped locomotion in the task space
2024cites this paper
On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer
2023cites this paper
End-to-End Stable Imitation Learning via Autonomous Neural Dynamic Policies
2023cites this paper
A Learning-based Adaptive Compliance Method for Symmetric Bi-manual Manipulation
2023cites this paper
Guided Reinforcement Learning: A Review and Evaluation for Efficient and Effective Real-World Robotics [Survey]
2023cites this paper
Investigating the Impact of Action Representations in Policy Gradient Algorithms
2023cites this paper
Robotic Table Tennis: A Case Study into a High Speed Learning System
2023cites this paper
A Hierarchical Compliance-Based Contextual Policy Search for Robotic Manipulation Tasks With Multiple Objectives
2023influential citation
Just Round: Quantized Observation Spaces Enable Memory Efficient Learning of Dynamic Locomotion
2022cites this paper
Implicit Kinematic Policies: Unifying Joint and Cartesian Action Spaces in End-to-End Robot Learning
2022cites this paper
Learning Torque Control for Quadrupedal Locomotion
2022cites this paper
Skill-based Multi-objective Reinforcement Learning of Industrial Robot Tasks with Planning and Knowledge Integration
2022cites this paper
Learning Skill-based Industrial Robot Tasks with User Priors
2022cites this paper
Deep Model Predictive Variable Impedance Control
2022cites this paper
Behavior policy learning: Learning multi-stage tasks via solution sketches and model-based controllers
2022cites this paper
CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action Spaces
2022influential citation
LASER: Learning a Latent Action Space for Efficient Reinforcement Learning
2021cites this paper
Data-Efficient Hierarchical Reinforcement Learning for Robotic Assembly Control Applications
2021cites this paper
Training a Robotic Arm Movement with Deep Reinforcement Learning
2021cites this paper
A Heuristic Algorithm with Falling Prediction Mechanism for Pose Planning in Irregular Objects Oriented Vertical Stacking Task
2021cites this paper
Learning Robotic Manipulation Skills Using an Adaptive Force-Impedance Action Space
2021cites this paper
Learning to Centralize Dual-Arm Assembly
2021cites this paper
AFORCE: A Bio-Inspired Action Space for Multimodal Manipulation Learning and Adaption
2021cites this paper
Motion Planning for Dual-Arm Robot Based on Soft Actor-Critic
2021cites this paper
How does the structure embedded in learning policy affect learning quadruped locomotion?
2020cites this paper
Learning Task Space Actions for Bipedal Locomotion
2020cites this paper
Learning Variable Impedance Control for Contact Sensitive Tasks
2019cites this paper
Context-Dependent Variable Impedance Control with Stability Guarantees
year unknowncites this paper
Robotics and Autonomous Systems
year unknowncites this paper