Situated Mapping of Sequential Instructions to Actions with Single-step Reward Observation

Published 2018 in Annual Meeting of the Association for Computational Linguistics

ABSTRACT

We propose a learning approach for mapping context-dependent sequential instructions to actions. We address the problem of discourse and state dependencies with an attention-based model that considers both the history of the interaction and the state of the world. To train from start and goal states without access to demonstrations, we propose SESTRA, a learning algorithm that takes advantage of single-step reward observations and immediate expected reward maximization. We evaluate on the SCONE domains, and show absolute accuracy improvements of 9.8%-25.3% across the domains over approaches that use high-level logical representations.

PUBLICATION RECORD

Publication year
2018
Venue
Annual Meeting of the Association for Computational Linguistics
Publication date
2018-05-25
Fields of study
Computer Science
Identifiers
DOI 10.18653/v1/P18-1193 arXiv 1805.10209
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Reinforcement learning
2019cited by this paper
Learning to Map Context-Dependent Sentences to Executable Formal Queries
2018cited by this paper
Learning Interpretable Spatial Operations in a Rich 3D Blocks World
2017cited by this paper
From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood
2017cited by this paper
Mapping Instructions and Visual Observations to Actions with Reinforcement Learning
2017cited by this paper
Unified Pragmatic Models for Generating and Following Instructions
2017cited by this paper
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments
2017cited by this paper
Mean Actor Critic
2017cited by this paper
Source-Target Inference Models for Spatial Instruction Understanding
2017cited by this paper
Representation Learning for Grounded Spatial Reasoning
2017cited by this paper
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision
2016cited by this paper
Natural Language Communication with Robots
2016cited by this paper
Simpler Context-Dependent Logical Forms via Model Projections
2016influential reference
Alignment-Based Compositional Semantics for Instruction Following
2015cited by this paper
Learning to Search Better than Your Teacher
2015influential reference
Traversing Knowledge Graphs in Vector Space
2015cited by this paper
Listen, Attend, and Walk: Neural Mapping of Navigational Instructions to Action Sequences
2015cited by this paper
Imitation Learning of Agenda-based Semantic Parsers
2015cited by this paper
Effective Approaches to Attention-based Neural Machine Translation
2015cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
Neural Machine Translation by Jointly Learning to Align and Translate
2014cited by this paper
Semantic Parsing via Paraphrasing
2014cited by this paper
Learning Compact Lexicons for CCG Semantic Parsing
2014cited by this paper
Semantic Parsing on Freebase from Question-Answer Pairs
2013cited by this paper
Scaling Semantic Parsers with On-the-Fly Ontology Matching
2013cited by this paper
Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions
2013cited by this paper
Fast Online Lexicon Learning for Grounded Language Acquisition
2012cited by this paper
Learning to Interpret Natural Language Navigation Instructions from Observations
2011cited by this paper
Learning Dependency-Based Compositional Semantics
2011cited by this paper
Driving Semantic Parsing from the World’s Response
2010cited by this paper
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
2010cited by this paper
What is left to be understood in ATIS?
2010cited by this paper
Understanding the difficulty of training deep feedforward neural networks
2010cited by this paper
Learning Context-Dependent Mappings from Sentences to Logical Form
2009cited by this paper
Walk the Talk: Connecting Language, Knowledge, and Action in Route Instructions
2006cited by this paper
Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping
1999cited by this paper
Reinforcement learning
1998cited by this paper
Long Short-Term Memory
1997cited by this paper
A Fully Statistical Approach to Natural Language Interfaces
1996cited by this paper
Expanding the Scope of the ATIS Task: The ATIS-3 Corpus
1994cited by this paper
The ATIS Spoken Language Systems Pilot Corpus
1990cited by this paper

CITED BY

Improving Agent Interactions in Virtual Environments with Language Models
2024cites this paper
Can Sequence-to-Sequence Transformers Naturally Understand Sequential Instructions?
2023cites this paper
Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling
2023cites this paper
Learning to Execute Actions or Ask Clarification Questions
2022cites this paper
LISA: Learning Interpretable Skill Abstractions from Language
2022cites this paper
Inferring Rewards from Language in Context
2022cites this paper
LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training
2022cites this paper
GoalNet: Inferring Conjunctive Goal Predicates from Human Plan Demonstrations for Robot Instruction Following
2022cites this paper
A Modular Vision Language Navigation and Manipulation Framework for Long Horizon Compositional Tasks in Indoor Environment
2021cites this paper
Learning to execute or ask clariﬁcation questions
2021cites this paper
A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution
2021cites this paper
Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing
2021cites this paper
Learning to execute instructions in a Minecraft dialogue
2020cites this paper
Programming in Natural Language with fuSE: Synthesizing Methods from Spoken Utterances Using Deep Natural Language Understanding
2020cites this paper
Achieving Common Ground in Multi-modal Dialogue
2020cites this paper
Context Dependent Semantic Parsing: A Survey
2020influential citation
Sentiment Analysis for Arabic Language using Attention-Based Simple Recurrent Unit
2019cites this paper
Executing Instructions in Situated Collaborative Interactions
2019cites this paper
Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions
2019influential citation
FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension
2019cites this paper
Coupling Retrieval and Meta-Learning for Context-Dependent Semantic Parsing
2019cites this paper
Automated Curriculum Learning for Turn-level Spoken Language Understanding with Weak Supervision
2019cites this paper
CONVERSATIONAL MACHINE COMPREHENSION
2019influential citation
Learning to Map Natural Language Instructions to Physical Quadcopter Control using Simulated Flight
2019cites this paper
Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions
2019influential citation
Knowledge-Aware Conversational Semantic Parsing Over Web Tables
2018cites this paper
FlowQA: Grasping Flow in History for Conversational Machine Comprehension
2018influential citation
Mapping Instructions to Actions in 3D Environments with Visual Goal Prediction
2018cites this paper
Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base
2018cites this paper
Value-based Search in Execution Space for Mapping Instructions to Programs
2018influential citation
Mapping Navigation Instructions to Continuous Control Actions with Position-Visitation Prediction
2018cites this paper
Simple Recurrent Units for Highly Parallelizable Recurrence
2017cites this paper
Training RNNs as Fast as CNNs
2017cites this paper