Opponent Aware Reinforcement Learning

Víctor Gallego,Roi Naveiro,D. Insua,D. Gómez‐Ullate

Published 2019 in arXiv.org

ABSTRACT

We introduce Threatened Markov Decision Processes (TMDPs) as an extension of the classical Markov Decision Process framework for Reinforcement Learning (RL). TMDPs allow suporting a decision maker against potential opponents in a RL context. We also propose a level-k thinking scheme resulting in a novel learning approach to deal with TMDPs. After introducing our framework and deriving theoretical results, relevant empirical evidence is given via extensive experiments, showing the benefits of accounting for adversaries in RL while the agent learns

PUBLICATION RECORD

Publication year
2019
Venue
arXiv.org
Publication date
2019-08-22
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1908.08773
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

A survey of game theoretic approach for adversarial machine learning
2019cited by this paper
The Art and Science of Negotiation
2019cited by this paper
Adversarial classification: An adversarial risk analysis approach
2018cited by this paper
Configurable Markov Decision Processes
2018cited by this paper
Adversarial Attacks on Neural Network Policies
2017cited by this paper
AI Safety Gridworlds
2017influential reference
Autonomous agents modelling other agents: A comprehensive survey and open problems
2017cited by this paper
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
2017cited by this paper
Mastering the game of Go without human knowledge
2017cited by this paper
Wild Patterns: Ten Years After the Rise of Adversarial Machine Learning
2017cited by this paper
Learning with Opponent-Learning Awareness
2017cited by this paper
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
2017cited by this paper
Opponent Modeling in Deep Reinforcement Learning
2016cited by this paper
Modeling Opponents in Adversarial Risk Analysis
2016cited by this paper
Markov Security Games : Learning in Spatial Security Problems
2016cited by this paper
Game Theory A Critical Introduction
2016cited by this paper
Human-level control through deep reinforcement learning
2015influential reference
Explaining and Harnessing Adversarial Examples
2014cited by this paper
Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent
2012cited by this paper
Adversarial Risk Analysis for Counterterrorism Modeling
2012cited by this paper
Network Games: Theory, Models, and Dynamics
2011cited by this paper
Innovations in Multi-Agent Systems and Applications - 1
2010cited by this paper
Multi-agent Reinforcement Learning: An Overview
2010cited by this paper
The Art and Science of Negotiation
2009cited by this paper
Discrete Colonel Blotto and General Lotto games
2008cited by this paper
A Framework for Sequential Planning in Multi-Agent Settings
2005cited by this paper
A Cognitive Hierarchy Model of Games
2004cited by this paper
Nash Q-Learning for General-Sum Stochastic Games
2003cited by this paper
Rational and Convergent Learning in Stochastic Games
2001cited by this paper
Friend-or-Foe Q-learning in General-Sum Games
2001cited by this paper
Direct gradient-based reinforcement learning
2000cited by this paper
Constrained Markov Decision Processes
1999cited by this paper
Gambling in a rigged casino: The adversarial multi-armed bandit problem
1995cited by this paper
Game Theory: A Critical Introduction
1995cited by this paper
On Players' Models of Other Players: Theory and Experimental Evidence
1995cited by this paper
Experimental evidence on players' models of other players
1994cited by this paper
Markov Games as a Framework for Multi-Agent Reinforcement Learning
1994cited by this paper
Introduction: Paradigms for Machine Learning
1989cited by this paper
A model for high-order Markov chains
1985cited by this paper
Subjective Probability and the Theory of Games
1982cited by this paper
The evolution of cooperation
1981cited by this paper
Dynamic Programming and Markov Processes
1960cited by this paper

CITED BY

Configurable Environments in Reinforcement Learning: An Overview
2022cites this paper
Defense and security planning under resource uncertainty and multi‐period commitments
2022cites this paper
Data sharing games
2021cites this paper
Adversarial Risk Analysis (Overview)
2020cites this paper
Perspectives on Adversarial Classification
2020cites this paper
Learning to Play Sequential Games versus Unknown Opponents
2020cites this paper
Adversarial Machine Learning: Perspectives from Adversarial Risk Analysis
2020influential citation