Adaptive Exploration Strategy With Multi-Attribute Decision-Making for Reinforcement Learning

Published 2020 in IEEE Access

ABSTRACT

Reinforcement Learning (RL) agents often encounter the bottleneck of the performance when the dilemma of exploration and exploitation arises. In this study, an adaptive exploration strategy with multi-attribute decision-making is proposed to address the trade-off problem between exploration and exploitation. Firstly, the proposed method decomposes a complex task into several sub-tasks and trains each sub-task using the same training method individually. Then, the proposed method uses a multi-attribute decision-making method to develop an action policy integrating the training results of these trained sub-tasks. There are practical advantages to improve learning performance by allowing multiple learners to learn in parallel. An adaptive exploration strategy determines the probability of exploration depending on the information entropy instead of the suffocating work of empirical tuning. Finally, transfer learning extends the applicability of the proposed method. The experiment of the robot migration, the robot confrontation, and the real wheeled mobile robot are used to demonstrate the availability and practicability of the proposed method.

PUBLICATION RECORD

Publication year
2020
Venue
IEEE Access
Publication date
Unknown publication date
Fields of study
Computer Science
Identifiers
DOI 10.1109/ACCESS.2020.2973169
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Domain Adaptation With Neural Embedding Matching
2020cited by this paper
Behavior fusion for deep reinforcement learning.
2020cited by this paper
Structured optimal graph based sparse feature extraction for semi-supervised learning
2020cited by this paper
A Fuzzy Adaptive Approach to Decoupled Visual Servoing for a Wheeled Mobile Robot
2020influential reference
Incipient winding fault detection and diagnosis for squirrel-cage induction motors equipped on CRH trains.
2020cited by this paper
Discriminative low-rank preserving projection for dimensionality reduction
2019cited by this paper
Adaptive Image-Based Visual Servoing With Temporary Loss of the Visual Signal
2019cited by this paper
A Descriptor System Approach for Estimation of Incipient Faults With Application to High-Speed Railway Traction Devices
2019cited by this paper
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning
2018cited by this paper
Safe Exploration Algorithms for Reinforcement Learning Controllers
2018cited by this paper
Manifold Regularized Reinforcement Learning
2018cited by this paper
Self-Paced Prioritized Curriculum Learning With Coverage Penalty in Deep Reinforcement Learning
2018cited by this paper
Event-Triggered Distributed Control of Nonlinear Interconnected Systems Using Online Reinforcement Learning With Exploration
2018cited by this paper
Decoupled Visual Servoing With Fuzzy Q-Learning
2018influential reference
Pheromone-Based Planning Strategies in Dyna-Q Learning
2017cited by this paper
Parameter Space Noise for Exploration
2017cited by this paper
Balancing exploration and exploitation in reinforcement learning using a value of information criterion
2017cited by this paper
Extending the Peak Bandwidth of Parameters for Softmax Selection in Reinforcement Learning
2017cited by this paper
Multiattribute decision making based on interval-valued intuitionistic fuzzy values and linear programming methodology
2017cited by this paper
Curiosity-Driven Exploration by Self-Supervised Prediction
2017cited by this paper
A nonlinear-programming methodology for multi-attribute decision-making problem with interval-valued intuitionistic fuzzy soft sets information
2017cited by this paper
Predicting links based on knowledge dissemination in complex network
2017cited by this paper
Hybrid Whale Optimization Algorithm with simulated annealing for feature selection
2017cited by this paper
Asynchronous Methods for Deep Reinforcement Learning
2016cited by this paper
RL-IAC: An exploration policy for online saliency learning on an autonomous mobile robot
2016cited by this paper
Deep Exploration via Bootstrapped DQN
2016cited by this paper
Convergence Analysis of Two Loss Functions in Soft-Max Regression
2016cited by this paper
KELEA (Kinetic Energy Limiting Electrostatic Attraction) Offers an Alternative Explanation to Existing Concepts Regarding Wave-Particle Duality, Cold Fusion and Superconductivity
2016cited by this paper
Active transfer learning of matching query results across multiple sources
2015cited by this paper
Evolving Robocode tanks for Evo Robocode
2014cited by this paper
Analysis and Classification of Sleep Stages Based on Difference Visibility Graphs From a Single-Channel EEG Signal
2014cited by this paper
A Simple Scheme for Formation Control Based on Weighted Behavior Learning
2014cited by this paper
Fusion of Multiple Behaviors Using Layered Reinforcement Learning
2012cited by this paper
Unified Behavior Framework for Reactive Robot Control
2009cited by this paper
Transfer of Reinforcement Learning:The State of the Art
2008cited by this paper
A phased reinforcement learning algorithm for complex control problems
2007cited by this paper
Information Entropy
2006cited by this paper
Technical Note: Q-Learning
2004cited by this paper
Finite-time Analysis of the Multiarmed Bandit Problem
2002cited by this paper
The ordered weighted geometric averaging operators
2002cited by this paper
Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning
2000cited by this paper
Induced ordered weighted averaging operators
1999cited by this paper
Reinforcement Learning: An Introduction
1998cited by this paper
The convergence of TD(λ) for general λ
1992cited by this paper

CITED BY

Unveiling New Approaches to Hybrid Intrinsic-Extrinsic Rewards in Actor-Critic Reinforcement Learning: A Systematic Review of Engagement, Performance, and Generalization
2025cites this paper
Optimal Energy Management of Plug-In Hybrid Electric Vehicles Through Ensemble Reinforcement Learning With Exploration-to-Exploitation Ratio Control
2025cites this paper
Comparative Analysis of A3C and PPO Algorithms in Reinforcement Learning: A Survey on General Environments
2024cites this paper
Optimal Energy Management of Plug-in Hybrid Vehicles Through Exploration-to-Exploitation Ratio Control in Ensemble Reinforcement Learning
2023cites this paper
Visible light communication and WiFi hybrid networks based on dynamic resource allocation algorithm
2023cites this paper
Deep reinforcement learning for the rapid on-demand design of mechanical metamaterials with targeted nonlinear deformation responses
2023cites this paper
Visual servoing with deep reinforcement learning for rotor unmanned helicopter
2022cites this paper
"Deep Reinforcement Learning for Engineering Design Through Topology Optimization of Elementally Discretized Design Domains"
2022cites this paper
Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation
2021cites this paper
A Confrontation Decision-Making Method with Deep Reinforcement Learning and Knowledge Transfer for Multi-Agent System
2020cites this paper
Distributed Hybrid Kalman Temporal Differences for Reinforcement Learning
2020cites this paper
A Fuzzy Ensemble Method With Deep Learning for Multi-Robot System
2020cites this paper
A Situation Assessment Method with an Improved Fuzzy Deep Neural Network for Multiple UAVs
2020cites this paper
MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement Learning
2020cites this paper
A Hierarchical Decision-Making Method with a Fuzzy Ant Colony Algorithm for Mission Planning of Multiple UAVs
2020cites this paper
An Experience Aggregative Reinforcement Learning With Multi-Attribute Decision-Making for Obstacle Avoidance of Wheeled Mobile Robot
2020cites this paper