Reinforcement Learning and Inverse Reinforcement Learning with System 1 and System 2

Published 2018 in AAAI/ACM Conference on AI, Ethics, and Society

ABSTRACT

Inferring a person's goal from their behavior is an important problem in applications of AI (e.g. automated assistants, recommender systems). The workhorse model for this task is the rational actor model - this amounts to assuming that people have stable reward functions, discount the future exponentially, and construct optimal plans. Under the rational actor assumption techniques such as inverse reinforcement learning (IRL) can be used to infer a person's goals from their actions. A competing model is the dual-system model. Here decisions are the result of an interplay between a fast, automatic, heuristic-based system 1 and a slower, deliberate, calculating system 2. We generalize the dual system framework to the case of Markov decision problems and show how to compute optimal plans for dual-system agents. We show that dual-system agents exhibit behaviors that are incompatible with rational actor assumption. We show that naive applications of rational-actor IRL to the behavior of dual-system agents can generate wrong inference about the agents' goals and suggest interventions that actually reduce the agent's overall utility. Finally, we adapt a simple IRL algorithm to correctly infer the goals of dual system decision-makers. This allows us to make interventions that help, rather than hinder, the dual-system agent's ability to reach their true goals.

PUBLICATION RECORD

Publication year
2018
Venue
AAAI/ACM Conference on AI, Ethics, and Society
Publication date
2018-11-19
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1145/3306618.3314259 arXiv 1811.08549
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Recommender Systems
2021cited by this paper
J. R. McNeill and George Vrtis, (Eds): Mining North America. An Environmental History since 1522
2018cited by this paper
Planning Complexity Registers as a Cost in Metacontrol
2018influential reference
Maintaining cooperation in complex social dilemmas using deep reinforcement learning
2017cited by this paper
Multi-agent Reinforcement Learning in Sequential Social Dilemmas
2017cited by this paper
Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems
2017cited by this paper
Consequentialist conditional cooperation in social dilemmas with imperfect information
2017cited by this paper
Cooperative Inverse Reinforcement Learning
2016cited by this paper
Habits of Virtue: Creating Norms of Cooperation and Defection in the Laboratory
2016cited by this paper
Recency, Records, and Recaps
2016cited by this paper
Intuition, deliberation, and the evolution of cooperation
2016cited by this paper
Planning Problems for Sophisticated Agents with Present Bias
2016cited by this paper
Learning the Preferences of Ignorant, Inconsistent Agents
2015cited by this paper
Moral tribes: Emotion, reason, and the gap between us and them
2015cited by this paper
Time-inconsistent planning: a computational problem in behavioral economics
2014cited by this paper
Goal Recognition Design
2014cited by this paper
How to commit (if you must): Commitment contracts and the dual-self model
2014cited by this paper
Recency, records and recaps: learning and non-equilibrium behavior in a simple decision problem
2014cited by this paper
Recommender Systems: Kembellec/Recommender Systems
2014cited by this paper
Social heuristics shape intuitive cooperation
2014cited by this paper
The expected value of control: an integrative theory of anterior cingulate cortex function.
2013cited by this paper
Timing and Self-Control
2012cited by this paper
Spontaneous giving and calculated greed
2012cited by this paper
Depression Babies: Do Macroeconomic Experiences Affect Risk-Taking?
2011cited by this paper
A Structural Model of Sponsored Search Advertising Auctions
2011cited by this paper
Thinking, Fast and Slow
2011cited by this paper
Commitment Devices
2009cited by this paper
Self-Control in Decision-Making Involves Modulation of the vmPFC Valuation System
2009cited by this paper
The description-experience gap in risky choice.
2009cited by this paper
Temporal-Difference Reinforcement Learning with Distributed Representations
2009cited by this paper
Maximum Entropy Inverse Reinforcement Learning
2008influential reference
Reasoning: Breakdown of Will
2008cited by this paper
Bayesian Inverse Reinforcement Learning
2007cited by this paper
A Dual-Self Model of Impulse Control.
2006cited by this paper
Neural Systems Responding to Degrees of Uncertainty in Human Decision-Making
2005cited by this paper
Separate Neural Systems Value Immediate and Delayed Monetary Rewards
2004cited by this paper
Moral Tribes: Emotion, Reason, and the Gap Between Us and Them
2001cited by this paper
Pharmacokinetics of a novel formulation of ivermectin after administration to goats
2000cited by this paper
Doing It Now or Later
1999influential reference
Temptation and Self‐Control
1999cited by this paper
Introduction to Reinforcement Learning
1998cited by this paper
Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria
1998cited by this paper
Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces
1997cited by this paper
Golden Eggs and Hyperbolic Discounting
1997cited by this paper
The Winner s Curse
1991cited by this paper
An Economic Theory of Self-Control
1981cited by this paper

CITED BY

Domain-driven Metrics for Reinforcement Learning: A Case Study on Epidemic Control using Agent-based Simulation
2025cites this paper
Reinforcement Learning for Intrusion Detection: Recent Advances and Datasets
2025cites this paper
Anomalous ride-hailing driver detection with deep transfer inverse reinforcement learning
2024cites this paper
An Ontology-based Data-driven Architecture for Analyzing Cognitive Biases in Decision-making
2023cites this paper
Designing Fiduciary Artificial Intelligence
2023cites this paper
HMIway-env: A Framework for Simulating Behaviors and Preferences to Support Human-AI Teaming in Driving
2022cites this paper
Meaningful human control: actionable properties for AI system development
2021cites this paper
Building Healthy Recommendation Sequences for Everyone: A Safe Reinforcement Learning Approach
2021cites this paper
AI Alignment and Human Reward
2021cites this paper
Meaningful human control over AI systems: beyond talking the talk
2021cites this paper
Model-free conventions in multi-agent reinforcement learning with heterogeneous preferences
2020cites this paper
Robust Multi-agent Counterfactual Prediction
2019cites this paper
Graph Model Approach to Hierarchy Control Network
2019cites this paper
Distributed Ledger Technology and Cyber-Physical Systems. Multi-agent Systems. Concepts and Trends
2019cites this paper