PlatoLTL: Learning to Generalize Across Symbols in LTL Instructions for Multi-Task RL

Jacques Cloete,Mathias Jackermeier,Ioannis Havoutis,Alessandro Abate

Published 2026 in arXiv.org

ABSTRACT

A central challenge in multi-task reinforcement learning (RL) is to train generalist policies capable of performing tasks not seen during training. To facilitate such generalization, linear temporal logic (LTL) has recently emerged as a powerful formalism for specifying structured, temporally extended tasks to RL agents. While existing approaches to LTL-guided multi-task RL demonstrate successful generalization across LTL specifications, they are unable to generalize to unseen vocabularies of propositions (or"symbols"), which describe high-level events in LTL. We present PlatoLTL, a novel approach that enables policies to zero-shot generalize not only compositionally across LTL formula structures, but also parametrically across propositions. We achieve this by treating propositions as instances of parameterized predicates rather than discrete symbols, allowing policies to learn shared structure across related propositions. We propose a novel architecture that embeds and composes predicates to represent LTL specifications, and demonstrate successful zero-shot generalization to novel propositions and tasks across challenging environments.

PUBLICATION RECORD

Publication year
2026
Venue
arXiv.org
Publication date
2026-01-30
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2601.22891 arXiv 2601.22891
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Reinforcement Learning of Flexible Policies for Symbolic Instructions With Adjustable Mapping Specifications
2025influential reference
Zero-Shot Instruction Following in RL via Structured LTL Representations
2025influential reference
One Subgoal at a Time: Zero-Shot Generalization to Arbitrary Linear Temporal Logic Requirements in Multi-Task Reinforcement Learning
2025influential reference
LTL-Constrained Policy Optimization with Cycle Experience Replay
2024cited by this paper
Exploiting Hybrid Policy in Reinforcement Learning for Interpretable Temporal Logic Manipulation
2024cited by this paper
Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents
2024cited by this paper
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
2024cited by this paper
Generalization of temporal logic tasks via future dependent options
2024cited by this paper
Eventual Discounting Temporal Logic Counterfactual Experience Replay
2023cited by this paper
Instructing Goal-Conditioned Reinforcement Learning Agents with Temporal Logic Objectives
2023cited by this paper
Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning
2022cited by this paper
Modular Deep Reinforcement Learning for Continuous Motion Planning With Temporal Logic
2021cited by this paper
LTL2Action: Generalizing LTL Instructions for Multi-Task RL
2021cited by this paper
In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications
2021cited by this paper
Compositional Reinforcement Learning from Logical Specifications
2021cited by this paper
Plato
2020cited by this paper
Deep Reinforcement Learning with Temporal Logics
2020cited by this paper
Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning
2020cited by this paper
A Composable Specification Language for Reinforcement Learning Tasks
2020cited by this paper
Certified Reinforcement Learning with Logic Guidance
2019cited by this paper
LTL and Beyond: Formal Languages for Reward Function Specification in Reinforcement Learning
2019cited by this paper
Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning
2019cited by this paper
Modular Deep Reinforcement Learning with Temporal Logic Specifications
2019cited by this paper
Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network
2018cited by this paper
Omega-Regular Objectives in Model-Free Reinforcement Learning
2018cited by this paper
Graph Neural Networks: A Review of Methods and Applications
2018cited by this paper
Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning
2018cited by this paper
Rabinizer 4: From LTL to Your Favourite Deterministic Automaton
2018cited by this paper
Semi-Supervised Classification with Graph Convolutional Networks
2016cited by this paper
On the Properties of Neural Machine Translation: Encoder–Decoder Approaches
2014cited by this paper
Principles of model checking
2008cited by this paper
Reinforcement Learning: An Introduction
1998cited by this paper
Backpropagation Applied to Handwritten Zip Code Recognition
1989cited by this paper
Temporal Logic of Programs
1987cited by this paper
A hierarchy of temporal properties
1987cited by this paper
Learning representations by back-propagating errors
1986cited by this paper
Symposium on Decision Problems: On a Decision Method in Restricted Second Order Arithmetic
1966cited by this paper

CITED BY

Zero-Shot Instruction Following in RL via Structured LTL Representations
2025cites this paper