Meta-Reinforcement Learning With Evolving Gradient Regularization

Jiaxing Chen,Ao Ma,Shaofei Chen,Weilin Yuan,Zhenzhen Hu,Peng Li

Published 2025 in IEEE Robotics and Automation Letters

ABSTRACT

Deep reinforcement learning (DRL) typically requires reinitializing training for new tasks, limiting its generalization due to isolated knowledge transfer. Meta-reinforcement learning (Meta-RL) addresses this by enabling rapid adaptation through prior task experiences, yet existing gradient-based methods like MAML suffer from poor out-of-distribution performance due to overfitting narrow task distributions. To overcome this limitation, we propose Evolving Gradient Regularization MAML (ER-MAML). By integrating evolving gradient regularization into the MAML framework, ER-MAML optimizes meta-gradients while constraining adaptation directions via a regularization policy. This dual mechanism prevents overparameterization and enhances robustness across diverse task distributions. Experiments demonstrate ER-MAML outperforms state-of-the-art baselines by 14.6% in out-of-distribution success rates. It also achieves strong online adaptation performance in the MetaWorld benchmark. These results validate ER-MAML's effectiveness in improving meta-RL generalization under distribution shifts.

PUBLICATION RECORD

Publication year
2025
Venue
IEEE Robotics and Automation Letters
Publication date
2025-06-01
Fields of study
Computer Science
Identifiers
DOI 10.1109/LRA.2025.3566585
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Recent advances in reinforcement learning-based autonomous driving behavior planning: A survey
2024cited by this paper
Evo-MAML: Meta-Learning with Evolving Gradient
2023cited by this paper
A Tutorial on Meta-Reinforcement Learning
2023cited by this paper
MAML and ANIL Provably Learn Representations
2022cited by this paper
MAML2: meta reinforcement learning via meta-learning for task categories
2022cited by this paper
Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning
2022cited by this paper
Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning
2022influential reference
How to Train Your MAML to Excel in Few-Shot Classification
2021cited by this paper
EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization
2021cited by this paper
Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes
2019cited by this paper
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples
2019cited by this paper
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables
2019cited by this paper
On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms
2019cited by this paper
Meta-Learning with Implicit Gradients
2019cited by this paper
Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML
2019influential reference
ES-MAML: Simple Hessian-Free Meta Learning
2019influential reference
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
2019influential reference
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
2018cited by this paper
ProMP: Proximal Meta-Policy Search
2018cited by this paper
Some Considerations on Learning to Explore via Meta-Reinforcement Learning
2018influential reference
How to train your MAML
2018cited by this paper
On First-Order Meta-Learning Algorithms
2018cited by this paper
Learned Optimizers that Scale and Generalize
2017cited by this paper
Mastering the game of Go without human knowledge
2017cited by this paper
Proximal Policy Optimization Algorithms
2017cited by this paper
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
2017influential reference
RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning
2016cited by this paper
Learning to Optimize
2016cited by this paper
Mastering the game of Go with deep neural networks and tree search
2016cited by this paper
Trust Region Policy Optimization
2015cited by this paper
Reinforcement learning in robotics: A survey
2013cited by this paper

CITED BY

Edge Computing Methods and Applications in Wireless Seismic Sensor Networks: A Review
2025cites this paper