Optimal control of the future via prospective learning with control

Yuxin Bai,Aranyak Acharyya,Ashwin De Silva,Zeyu Shen,James Hassett,J. Vogelstein

Published 2025 in Unknown venue

ABSTRACT

Optimal control of the future is the next frontier for AI. Current approaches to this problem are typically rooted in reinforcement learning (RL). RL is mathematically distinct from supervised learning, which has been the main workhorse for the recent achievements in AI. Moreover, RL typically operates in a stationary environment with episodic resets, limiting its utility. Here, we extend supervised learning to address learning to \textit{control} in non-stationary, reset-free environments. Using this framework, called''Prospective Learning with Control''(PL+C), we prove that under certain fairly general assumptions, empirical risk minimization (ERM) asymptotically achieves the Bayes optimal policy. We then consider a specific instance of prospective learning with control, foraging -- which is a canonical task for any mobile agent -- be it natural or artificial. We illustrate that modern RL algorithms fail to learn in these non-stationary reset-free environments, and even with modifications, they are orders of magnitude less efficient than our prospective foraging agents.

PUBLICATION RECORD

Publication year
2025
Venue
Unknown venue
Publication date
2025-11-11
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 2511.08717
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Prospective Learning in Retrospect
2025influential reference
Prospective Learning: Learning for a Dynamic Future
2024cited by this paper
What is foraging?
2024cited by this paper
Continual Learning as Computationally Constrained Reinforcement Learning
2023cited by this paper
A Definition of Continual Reinforcement Learning
2023cited by this paper
You Only Live Once: Single-Life Reinforcement Learning
2022cited by this paper
Simple Lifelong Learning Machines
2020cited by this paper
Reset-Free Lifelong Learning with Skill-Space Planning
2020cited by this paper
Towards Continual Reinforcement Learning: A Review and Perspectives
2020cited by this paper
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
2018cited by this paper
Soft Actor-Critic Algorithms and Applications
2018cited by this paper
Attention is All you Need
2017cited by this paper
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
2017cited by this paper
Mastering the game of Go with deep neural networks and tree search
2016cited by this paper
Understanding Machine Learning - From Theory to Algorithms
2014cited by this paper
The Ecological Approach to Visual Perception: Classic Edition
2014cited by this paper
Probably Approximately Correct: Nature's Algorithms for Learning and Prospering in a Complex World
2013cited by this paper
Finite-Time Bounds for Fitted Value Iteration
2008cited by this paper
Bandit Based Monte-Carlo Planning
2006cited by this paper
Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search
2006cited by this paper
Tree-Based Batch Mode Reinforcement Learning
2005cited by this paper
A New Approach to Linear Filtering and Prediction Problems
2002cited by this paper
Reinforcement Learning: An Introduction
1998cited by this paper
A Bayesian Approach to Filtering Junk E-Mail
1998cited by this paper
On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9
1990cited by this paper
A theory of the learnable
1984cited by this paper
Adaptive control: The model reference approach
1981cited by this paper
Estimating causal effects of treatments in randomized and nonrandomized studies.
1974cited by this paper
Chervonenkis: On the uniform convergence of relative frequencies of events to their probabilities
1971cited by this paper
Statistical Inference for Probabilistic Functions of Finite State Markov Chains
1966cited by this paper
Dynamic Programming and Stochastic Control Processes
1958cited by this paper
Statistical Decision Functions
1951cited by this paper

CITED BY

Toward a science of prospective learning.
2025cites this paper