Differentiable Physics Models for Real-world Offline Model-based Reinforcement Learning

M. Lutter,Johannes Silberbauer,Joe Watson,Jan Peters

Published 2020 in IEEE International Conference on Robotics and Automation

ABSTRACT

A limitation of model-based reinforcement learning (MBRL) is the exploitation of errors in the learned models. Blackbox models can fit complex dynamics with high fidelity, but their behavior is undefined outside of the data distribution. Physics-based models are better at extrapolating, due to the general validity of their informed structure, but underfit in the real world due to the presence of unmodeled phenomena. In this work, we demonstrate experimentally that for the offline model-based reinforcement learning setting, physics-based models can be beneficial compared to high-capacity function approximators if the mechanical structure is known. Physics-based models can learn to perform the ball in a cup (BiC) task on a physical manipulator using only 4 minutes of sampled data using offline MBRL. We find that black-box models consistently produce unviable policies for BiC as all predicted trajectories diverge to physically impossible state, despite having access to more data than the physics-based model. In addition, we generalize the approach of physics parameter identification from modeling holonomic multi-body systems to systems with nonholonomic dynamics using end-to-end automatic differentiation.Videos: https://sites.google.com/view/ball-in-a-cup-in-4-minutes/

PUBLICATION RECORD

Publication year
2020
Venue
IEEE International Conference on Robotics and Automation
Publication date
2020-11-03
Fields of study
Physics, Computer Science
Identifiers
DOI 10.1109/ICRA48506.2021.9561805 arXiv 2011.01734
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Learning to Simulate Complex Physics with Graph Networks
2020cited by this paper
Encoding Physical Constraints in Differentiable Newton-Euler Algorithm
2020cited by this paper
High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards
2020cited by this paper
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
2020cited by this paper
Lagrangian Neural Networks
2020cited by this paper
A Differentiable Newton Euler Algorithm for Multi-body Model Learning
2020cited by this paper
Learning to Play Cup-and-Ball with Noisy Camera Observations
2020cited by this paper
ADD
2020cited by this paper
Symplectic ODE-Net: Learning Hamiltonian Dynamics with Control
2019cited by this paper
TuneNet: One-Shot Residual Tuning for System Identification and Sim-to-Real Robot Task Transfer
2019cited by this paper
Assessing Transferability From Simulation to Reality for Reinforcement Learning
2019cited by this paper
Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems
2019cited by this paper
Benchmarking Model-Based Reinforcement Learning
2019cited by this paper
When to Trust Your Model: Model-Based Policy Optimization
2019cited by this paper
BayesSim: adaptive domain randomization via probabilistic inference for robotics simulators
2019cited by this paper
Hamiltonian Neural Networks
2019cited by this paper
Deep Lagrangian Networks: Using Physics as Model Prior for Deep Learning
2019cited by this paper
Learning agile and dynamic motor skills for legged robots
2019cited by this paper
Interactive Differentiable Simulation
2019cited by this paper
Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup
2019cited by this paper
A General Framework for Structured Learning of Mechanical Systems
2019cited by this paper
Variational Integrator Networks for Physically Structured Embeddings
2019cited by this paper
Self-Paced Contextual Reinforcement Learning
2019cited by this paper
DiffTaichi: Differentiable Programming for Physical Simulation
2019cited by this paper
Graph networks as learnable physics engines for inference and control
2018cited by this paper
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models
2018cited by this paper
Differentiable Physics and Stable Modes for Tool-Use and Manipulation Planning
2018cited by this paper
Closing the Sim-to-Real Loop: Adapting Simulation Randomization with Real World Experience
2018cited by this paper
End-to-End Differentiable Physics for Learning and Control
2018cited by this paper
Using probabilistic movement primitives in robotics
2017cited by this paper
Linear Matrix Inequalities for Physically Consistent Inertial Parameter Identification: A Statistical Perspective on the Mass Distribution
2017cited by this paper
Nonholonomic Mechanics And Control
2016cited by this paper
Identification of fully physical consistent inertial parameters using optimization on manifolds
2016cited by this paper
A DIFFERENTIABLE PHYSICS ENGINE FOR DEEP LEARNING IN ROBOTICS
2016cited by this paper
Probabilistic Movement Primitives
2013cited by this paper
A Survey on Policy Search for Robotics
2013cited by this paper
Lie Group Formulation of Articulated Rigid Body Dynamics
2012influential reference
MuJoCo: A physics engine for model-based control
2012influential reference
Batch Reinforcement Learning
2012cited by this paper
Using model knowledge for learning inverse dynamics
2010cited by this paper
Model Learning with Local Gaussian Process Regression
2009cited by this paper
Rigid Body Dynamics Algorithms
2007cited by this paper
A Bayesian Approach to Nonlinear Parameter Identification for Rigid Body Dynamics
2006cited by this paper
Gaussian process model based predictive control
2004cited by this paper
Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning
2002cited by this paper
Locally Weighted Learning for Control
1997cited by this paper
Long Short-Term Memory
1997cited by this paper
Learning an Accurate Neural Model of the Dynamics of a Typical Industrial Robot
1994cited by this paper
Estimation of Inertial Parameters of Manipulator Loads and Links
1986cited by this paper
Automatic Differentiation: Techniques and Applications
1981cited by this paper
System identification-A survey
1971cited by this paper
Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics
year unknowncited by this paper

CITED BY

Safe Online Mobile Network Optimization Through Digital Twin-Enhanced Monte Carlo Tree Search
2025cites this paper
Autonomous Vehicle Path Planning by Searching With Differentiable Simulation
2025cites this paper
Floating-Base Deep Lagrangian Networks
2025cites this paper
TAG-K: Tail-Averaged Greedy Kaczmarz for Computationally Efficient and Performant Online Inertial Parameter Estimation
2025cites this paper
Parameter Identification of a Differentiable Human Arm Musculoskeletal Model without Deep Muscle EMG Reconstruction
2025cites this paper
Diff-MSM: Differentiable MusculoSkeletal Model for Simultaneous Identification of Human Muscle and Bone Parameters
2025cites this paper
Newtonian and Lagrangian Neural Networks: A Comparison Towards Efficient Inverse Dynamics Identification
2025cites this paper
Reinforcement Twinning for Hybrid Control of Flapping-Wing Drones
2025cites this paper
Accelerating Model-Based Reinforcement Learning with State-Space World Models
2025cites this paper
Unlocking Efficient Vehicle Dynamics Modeling via Analytic World Models
2025cites this paper
Discovering Artificial Viscosity Models for Discontinuous Galerkin Approximation of Conservation Laws using Physics-Informed Machine Learning
2024cites this paper
Diminishing Return of Value Expansion Methods
2024cites this paper
Learning Object Properties Using Robot Proprioception via Differentiable Robot-Object Interaction
2024cites this paper
Enhanced Prediction of Multi-Agent Trajectories via Control Inference and State-Space Dynamics
2024cites this paper
A Review of Differentiable Simulators
2024influential citation
Lyapunov-Based Physics-Informed Long Short-Term Memory (LSTM) Neural Network-Based Adaptive Control
2024cites this paper
Deep Lyapunov-Based Physics-Informed Neural Networks (DeLb-PINN) for Adaptive Control Design
2023cites this paper
Differentiable Trajectory Generation for Car-like Robots with Interpolating Radial Basis Function Networks
2023influential citation
Data-efficient, explainable and safe box manipulation: Illustrating the advantages of physical priors in model-predictive control
2023cites this paper
Adaptive Robotic Information Gathering via non-stationary Gaussian processes
2023cites this paper
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning
2023cites this paper
A Survey on Physics Informed Reinforcement Learning: Review and Open Problems
2023cites this paper
On Physical Origins of Learning
2023cites this paper
Reinforcement Twinning: from digital twins to model-based reinforcement learning
2023cites this paper
TWIST: Teacher-Student World Model Distillation for Efficient Sim-to-Real Transfer
2023cites this paper
Physics-Informed Machine Learning: A Survey on Problems, Methods and Applications
2022cites this paper
Learning Tool Morphology for Contact-Rich Manipulation Tasks with Differentiable Simulation
2022cites this paper
Inferring Smooth Control: Monte Carlo Posterior Policy Iteration with Gaussian Processes
2022influential citation
j-Wave: An open-source differentiable wave simulator
2022cites this paper
Rethinking Optimization with Differentiable Simulation from a Global Perspective
2022cites this paper
Learning an Accurate State Transition Dynamics Model by Fitting both a Function and Its Derivative
2022cites this paper
DiffCloud: Real-to-Sim from Point Clouds with Differentiable Simulation and Rendering of Deformable Objects
2022cites this paper
A Recurrent Differentiable Engine for Modeling Tensegrity Robots Trainable with Low-Frequency Data
2022cites this paper
Parameter Identification and Motion Control for Articulated Rigid Body Robots Using Differentiable Position-based Dynamics
2022cites this paper
Neural Posterior Domain Randomization
2021cites this paper
Benchmarking Structured Policies and Policy Optimization for Real-World Dexterous Object Manipulation
2021cites this paper
Using Physics Knowledge for Learning Rigid-body Forward Dynamics with Gaussian Process Force Priors
2021influential citation
Robot Learning From Randomized Simulations: A Review
2021cites this paper
A Differentiable Newton-Euler Algorithm for Real-World Robotics
2021cites this paper
Combining physics and deep learning to learn continuous-time dynamics models
2021influential citation
Learning Dynamics Models for Model Predictive Agents
2021cites this paper
NeuralSim: Augmenting Differentiable Simulators with Neural Networks
2020cites this paper