Reinforcement Learning for Mapping Instructions to Actions

S. Branavan,Harr Chen,Luke Zettlemoyer,R. Barzilay

Published 2009 in Annual Meeting of the Association for Computational Linguistics

ABSTRACT

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function that defines the quality of the executed actions. During training, the learner repeatedly constructs action sequences for a set of documents, executes those actions, and observes the resulting reward. We use a policy gradient algorithm to estimate the parameters of a log-linear model for action selection. We apply our method to interpret instructions in two domains --- Windows troubleshooting guides and game tutorials. Our results demonstrate that this method can rival supervised learning techniques while requiring few or no annotated training examples.

PUBLICATION RECORD

Publication year
2009
Venue
Annual Meeting of the Association for Computational Linguistics
Publication date
2009-08-02
Fields of study
Computer Science
Identifiers
DOI 10.3115/1687878.1687892
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Learning Language from Its Perceptual Context
2011cited by this paper
Learning to Connect Language and Perception
2008cited by this paper
Learning to sportscast: a test of grounded language acquisition
2008cited by this paper
Submission Category : Applications , Preference : ORAL Reinforcement Learning for Spoken Dialogue Systems
2007cited by this paper
Intentional Context in Situated Natural Language Learning
2005cited by this paper
On the Integration of Grounding Language and Learning Objects
2004cited by this paper
Autonomous Helicopter Flight via Reinforcement Learning
2003cited by this paper
Automatic learning of dialogue strategy using dialogue simulation and reinforcement learning
2002cited by this paper
Learning words from sights and sounds: a computational model
2002cited by this paper
Learning the semantics of words and pictures
2001cited by this paper
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
2001cited by this paper
Grounding knowledge in sensors: unsupervised learning for language and planning
2001cited by this paper
Spoken Dialogue Management Using Probabilistic Reasoning
2000cited by this paper
Automatic Optimization of Dialogue Management
2000cited by this paper
Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic
1999cited by this paper
Policy Gradient Methods for Reinforcement Learning with Function Approximation
1999cited by this paper
Introduction to Reinforcement Learning
1998influential reference
Reinforcement Learning: An Introduction
1998cited by this paper
Inducing Features of Random Fields
1995cited by this paper
Understanding Natural Language Instructions: The Case of Purpose Clauses
1992cited by this paper
Understanding natural language
1972cited by this paper

CITED BY

AutoEval: A Practical Framework for Autonomous Evaluation of Mobile Agents
2025cites this paper
WebSight: A Vision-First Architecture for Robust Web Agents
2025cites this paper
Creative Planning with Language Models: Practice, Evaluation and Applications
2025cites this paper
Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark
2025cites this paper
Cost–benefit analysis of deploying shallow, deep learning and generative models for legal text classification
2025cites this paper
Efficiency-Driven Adaptive Task Planning for Household Robot Based on Hierarchical Item-Environment Cognition
2025cites this paper
UGIF-DataSet: A New Dataset for Cross-lingual, Cross-modal Sequential actions on the UI
2024cites this paper
NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild
2024cites this paper
Augmenting the action space with conventions to improve multi-agent cooperation in Hanabi
2024cites this paper
Prompt2Task: Automating UI Tasks on Smartphones from Textual Prompts
2024cites this paper
The broader spectrum of in-context learning
2024cites this paper
Natural Language-based State Representation in Deep Reinforcement Learning
2024cites this paper
Problem Solving Through Human-AI Preference-Based Cooperation
2024cites this paper
Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale
2024cites this paper
AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents
2024cites this paper
Mobile User Interface Adaptation Based on Usability Reward Model and Multi-Agent Reinforcement Learning
2024cites this paper
A Systematic Survey on Instructional Text: From Representation Formats to Downstream NLP Tasks
2024cites this paper
NaRuto: Automatically Acquiring Planning Models from Narrative Texts
2024cites this paper
BAGEL: Bootstrapping Agents by Guiding Exploration with Language
2024cites this paper
Language-guided Skill Learning with Temporal Variational Inference
2024cites this paper
Beyond Browsing: API-Based Web Agents
2024cites this paper
Autonomous Evaluation and Refinement of Digital Agents
2024influential citation
Navigating WebAI: Training Agents to Complete Web Tasks with Large Language Models and Reinforcement Learning
2024cites this paper
HeaP: Hierarchical Policies for Web Actions using LLMs
2023cites this paper
Predictive Chemistry Augmented with Text Retrieval
2023cites this paper
Using Planning to Improve Semantic Parsing of Instructional Texts
2023cites this paper
Lexi: Self-Supervised Learning of the UI Language
2023cites this paper
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
2023cites this paper
WebArena: A Realistic Web Environment for Building Autonomous Agents
2023cites this paper
LLM-driven Instruction Following: Progresses and Concerns
2023cites this paper
Federated Reinforcement Learning in IoT: Applications, Opportunities and Open Challenges
2023cites this paper
Having Difficulty Understanding Manuals? Automatically Converting User Manuals into Instructional Videos
2023cites this paper
Automated Action Model Acquisition from Narrative Texts
2023cites this paper
Toward Interactive Dictation
2023cites this paper
ChatMPC: Natural Language Based MPC Personalization
2023cites this paper
An SQL query generator for cross-domain human language based questions based on NLP model
2023cites this paper
How-to Guides for Specific Audiences: A Corpus and Initial Findings
2023cites this paper
SteP: Stacked LLM Policies for Web Actions
2023cites this paper
Communicative Feedback in language acquisition
2023cites this paper
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
2023cites this paper
On the Effectiveness of Offline RL for Dialogue Response Generation
2023cites this paper
Text Editing as Imitation Game
2022cites this paper
Interactive Language: Talking to Robots in Real Time
2022cites this paper
RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents
2022cites this paper
Overcoming Referential Ambiguity in Language-Guided Goal-Conditioned Reinforcement Learning
2022cites this paper
A General Framework of Task Understanding for Tour-Guide Robots in Exhibition Environments
2022cites this paper
RLang: A Declarative Language for Expressing Prior Knowledge for Reinforcement Learning
2022cites this paper
Natural Language-Based Automatic Programming for Industrial Robots
2022cites this paper
Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence
2022influential citation
ℓ Gym: Natural Language Visual Reasoning with Reinforcement Learning
2022cites this paper
Ask Before You Act: Generalising to Novel Environments by Asking Questions
2022cites this paper
Inferring Rewards from Language in Context
2022cites this paper
Integrating AI Planning with Natural Language Processing: A Combination of Explicit and Tacit Knowledge
2022cites this paper
71 . Cognitive Human–Robot Interaction
2022cites this paper
Improving Intrinsic Exploration with Language Abstractions
2022cites this paper
A comprehensive review of task understanding of command-triggered execution of tasks for service robots
2022cites this paper
Leveraging Language for Accelerated Learning of Tool Manipulation
2022cites this paper
Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling Approaches
2022cites this paper
Building Assistive Sensorimotor Interfaces through Human-in-the-Loop Machine Learning
2022cites this paper
SPRINT: Scalable Semantic Policy Pre-Training via Language Instruction Relabeling
2022cites this paper
Recent Advances in Artificial Intelligence and Tactical Autonomy: Current Status, Challenges, and Perspectives
2022cites this paper
Pragmatics in Grounded Language Learning: Phenomena, Tasks, and Modeling Approaches
2022cites this paper
Credit-cognisant reinforcement learning for multi-agent cooperation
2022cites this paper
G^3: Geolocation via Guidebook Grounding
2022cites this paper
Few-shot Subgoal Planning with Language Models
2022cites this paper
SPRINT: S CALABLE S EMANTIC P OLICY P RE - T RAINING VIA L ANGUAGE I NSTRUCTION R ELABELING Anonymous authors
2022cites this paper
Learning to play: understanding in-game tutorials with a pilot study on implicit tutorials
2022cites this paper
UGIF: UI Grounded Instruction Following
2022cites this paper
Reasoning about Procedures with Natural Language Processing: A Tutorial
2022cites this paper
Diagnosing Vision-and-Language Navigation: What Really Matters
2021cites this paper
gComm: An environment for investigating generalization in Grounded Language Acquisition
2021cites this paper
Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games
2021cites this paper
Episodic Transformer for Vision-and-Language Navigation
2021cites this paper
Learning Grounded Pragmatic Communication
2021influential citation
A Survey on Federated Learning: The Journey From Centralized to Distributed On-Site Learning and Beyond
2021cites this paper
A Natural Language Instruction Disambiguation Method for Robot Grasping
2021cites this paper
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games
2021cites this paper
Compositional Data and Task Augmentation for Instruction Following
2021cites this paper
Learning UI Navigation through Demonstrations composed of Macro Actions
2021cites this paper
Interactive Robot Learning: An Overview
2021cites this paper
Bridging Natural Language and Graphical User Interfaces
2021cites this paper
Interactive Hierarchical Guidance using Language
2021cites this paper
Skill Induction and Planning with Latent Language
2021cites this paper
Etna: Harvesting Action Graphs from Websites
2021cites this paper
Glider: A Reinforcement Learning Approach to Extract UI Scripts from Websites
2021cites this paper
Service skill improvement for home robots: Autonomous generation of action sequence based on reinforcement learning
2021cites this paper
Generalization in Instruction Following Systems
2021cites this paper
A Conformal Mapping-based Framework for Robot-to-Robot and Sim-to-Real Transfer Learning
2021cites this paper
Feudal Reinforcement Learning by Reading Manuals
2021cites this paper
GPT3-to-plan: Extracting plans from text using GPT-3
2021cites this paper
Guiding Safe Reinforcement Learning Policies Using Structured Language Constraints
2020cites this paper
Program Guided Agent
2020cites this paper
CLAI: A Platform for AI Skills on the Command Line
2020cites this paper
Spatial Reasoning from Natural Language Instructions for Robot Manipulation
2020cites this paper
One-Shot Learning of PDDL Models from Natural Language Process Manuals
2020cites this paper
FLIN: A Flexible Natural Language Interface for Web Navigation
2020influential citation
Conversational Learning
2020cites this paper
Lifelong Learning Dialogue Systems: Chatbots that Self-Learn On the Job
2020cites this paper
An Application-Independent Approach to Building Task-Oriented Chatbots with Interactive Continual Learning
2020cites this paper
Guaranteeing Sound Reactions to Long-Tailed Changes: A Syntax-Directed Annotation Approach
2020cites this paper