Tracking Slowly Moving Clairvoyant: Optimal Dynamic Regret of Online Learning with True and Noisy Gradient

Tianbao Yang,Lijun Zhang,Rong Jin,Jinfeng Yi

Published 2016 in International Conference on Machine Learning

ABSTRACT

This work focuses on dynamic regret of online convex optimization that compares the performance of online learning to a clairvoyant who knows the sequence of loss functions in advance and hence selects the minimizer of the loss function at each step. By assuming that the clairvoyant moves slowly (i.e., the minimizers change slowly), we present several improved variation-based upper bounds of the dynamic regret under the true and noisy gradient feedback, which are {\it optimal} in light of the presented lower bounds. The key to our analysis is to explore a regularity metric that measures the temporal changes in the clairvoyant's minimizers, to which we refer as {\it path variation}. Firstly, we present a general lower bound in terms of the path variation, and then show that under full information or gradient feedback we are able to achieve an optimal dynamic regret. Secondly, we present a lower bound with noisy gradient feedback and then show that we can achieve optimal dynamic regrets under a stochastic gradient feedback and two-point bandit feedback. Moreover, for a sequence of smooth loss functions that admit a small variation in the gradients, our dynamic regret under the two-point bandit feedback matches what is achieved with full information.

PUBLICATION RECORD

Publication year
2016
Venue
International Conference on Machine Learning
Publication date
2016-05-16
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1605.04638
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Online Optimization : Competing with Dynamic Comparators
2015cited by this paper
Regret bounded by gradual variation for online convex optimization
2014cited by this paper
Optimization, Learning, and Games with Predictable Sequences
2013cited by this paper
Beating Bandits in Gradually Evolving Worlds
2013cited by this paper
Online Optimization in Dynamic Environments
2013cited by this paper
Non-Stationary Stochastic Optimization
2013influential reference
A new look at shifting regret
2012cited by this paper
Unified Algorithms for Online Learning and Competitive Analysis
2012cited by this paper
Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback.
2010influential reference
Online convex optimization in the bandit setting: gradient descent without a gradient
2004cited by this paper
Tracking the Best Expert
1995cited by this paper
25th Annual Conference on Learning Theory Online Optimization with Gradual Variations
year unknowncited by this paper

CITED BY

Parameter-free Dynamic Regret: Time-varying Movement Costs, Delayed Feedback, and Memory
2026cites this paper
Online time series prediction using feature adjustment
2025cites this paper
Label Shift Meets Online Learning: Ensuring Consistent Adaptation with Universal Dynamic Regret
2025cites this paper
Online Learning With Non-convex Losses: New Condition To Achieve Small Dynamic Regret
2025cites this paper
On the Dynamic Regret of Following the Regularized Leader: Optimism with History Pruning
2025cites this paper
One-Point Sampling for Distributed Bandit Convex Optimization With Time-Varying Constraints
2025cites this paper
Adaptive Estimation and Learning under Temporal Distribution Shift
2025cites this paper
Learning to Bid in Non-Stationary Repeated First-Price Auctions
2025influential citation
Dynamic Regret Reduces to Kernelized Static Regret
2025cites this paper
Nesterov Accelerated Gradient Tracking With Adam for Distributed Online Optimization
2025cites this paper
Efficient Hyperparameter Search for Non-Stationary Model Training
2025cites this paper
Distributed Dynamic Associative Memory via Online Convex Optimization
2025cites this paper
Efficient Utility-Preserving Machine Unlearning with Implicit Gradient Surgery
2025cites this paper
Universal Dynamic Regret and Constraint Violation Bounds for Constrained Online Convex Optimization
2025cites this paper
Non-Stationary Bandit Convex Optimization: An Optimal Algorithm with Two-Point Feedback
2025cites this paper
Revisiting Clustering of Neural Bandits: Selective Reinitialization for Mitigating Loss of Plasticity
2025cites this paper
Online Optimization Under Randomly Corrupted Attacks
2024influential citation
A Framework for Time-Varying Optimization via Derivative Estimation
2024cites this paper
Universal Online Convex Optimization With Minimax Optimal Second-Order Dynamic Regret
2024influential citation
Forgetting-Factor Regrets for Online Convex Optimization
2024cites this paper
Quantum Algorithm for Sparse Online Learning with Truncated Gradient Descent
2024cites this paper
Online convex optimization for constrained control of nonlinear systems
2024cites this paper
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
2024influential citation
Online Non-convex Learning in Dynamic Environments
2024cites this paper
Online Nonconvex Bilevel Optimization with Bregman Divergences
2024cites this paper
Online Convex Optimization with Memory and Limited Predictions
2024cites this paper
Decentralized Online Riemannian Optimization with Dynamic Environments
2024cites this paper
Efficient Non-stationary Online Learning by Wavelets with Applications to Online Distribution Shift Adaptation
2024cites this paper
Online Learning of Partitions in Additively Separable Hedonic Games
2024influential citation
Tracking Nonstationary Streaming Data via Exponentially Weighted Moving Average Stochastic Gradient Descent
2024cites this paper
Non-stationary Online Convex Optimization with Arbitrary Delays
2024cites this paper
Distributed Event-Triggered Bandit Convex Optimization With Time-Varying Constraints
2024cites this paper
A conversion theorem and minimax optimality for continuum contextual bandits
2024cites this paper
An Equivalence Between Static and Dynamic Regret Minimization
2024cites this paper
Particle-based Online Bayesian Sampling
2023influential citation
Dynamic Regret Bounds for Constrained Online Nonconvex Optimization Based on Polyak–Lojasiewicz Regions
2023cites this paper
Learning Rate Schedules in the Presence of Distribution Shift
2023influential citation
Learning-to-Learn to Guide Random Search: Derivative-Free Meta Blackbox Optimization on Manifold
2023cites this paper
Efficient Online Learning with Memory via Frank-Wolfe Optimization: Algorithms with Bounded Dynamic Regret and Applications to Control
2023cites this paper
Online (Non-)Convex Learning via Tempered Optimism
2023influential citation
Unconstrained Dynamic Regret via Sparse Coding
2023cites this paper
Adapting to Continuous Covariate Shift via Online Density Ratio Estimation
2023cites this paper
Universal Online Optimization in Dynamic Environments via Uniclass Prediction
2023cites this paper
Improved Dynamic Regret for Online Frank-Wolfe
2023cites this paper
Online Constrained Meta-Learning: Provable Guarantees for Generalization
2023cites this paper
A Stability Principle for Learning under Non-Stationarity
2023cites this paper
Adaptive, Doubly Optimal No-Regret Learning in Strongly Monotone and Exp-Concave Games with Gradient Feedback
2023cites this paper
Online Convex Optimization with Switching Cost and Delayed Gradients
2023cites this paper
Online mixed discrete and continuous optimization: Algorithms, regret analysis and applications
2023influential citation
Non-Convex Bilevel Optimization with Time-Varying Objective Functions
2023cites this paper
Towards Fair Disentangled Online Learning for Changing Environments
2023cites this paper
Online Label Shift: Optimal Dynamic Regret meets Practical Algorithms
2023cites this paper
Non-stationary Delayed Online Convex Optimization: From Full-information to Bandit Setting
2023cites this paper
Non-stationary Projection-free Online Learning with Dynamic and Adaptive Regret Guarantees
2023influential citation
The Nonstationary Newsvendor with (and Without) Predictions
2023cites this paper
Structured Dynamic Pricing: Optimal Regret in a Global Shrinkage Model
2023cites this paper
MNL-Bandit in non-stationary environments
2023cites this paper
Dynamic Regret of Online Markov Decision Processes
2022cites this paper
First Order Online Optimisation Using Forward Gradients in Over-Parameterised Systems
2022cites this paper
First order online optimisation using forward gradients under Polyak-Łojasiewicz condition
2022cites this paper
On Dynamic Regret and Constraint Violations in Constrained Online Convex Optimization
2022cites this paper
Meta-Learning in Games
2022cites this paper
Dynamic Regret Bounds without Lipschitz Continuity: Online Convex Optimization with Multiple Mirror Descent Steps
2022cites this paper
Dynamic regret of adaptive gradient methods for strongly convex problems
2022cites this paper
Random Coordinate Descent for Resource Allocation in Open Multiagent Systems
2022cites this paper
Optimal Tracking in Prediction with Expert Advice
2022cites this paper
Provable Guarantees for Meta-Safe Reinforcement Learning
2022cites this paper
Online Resource Optimization for Elastic Stream Processing with Regret Guarantee
2022cites this paper
Online Bilevel Optimization: Regret Analysis of Online Alternating Gradient Methods
2022cites this paper
Adapting to Online Label Shift with Provable Guarantees
2022influential citation
Optimal Dynamic Regret in Proper Online Learning with Strongly Convex Losses and Beyond
2022cites this paper
Online PAC-Bayes Learning
2022cites this paper
No-Regret Learning in Time-Varying Zero-Sum Games
2022cites this paper
Optimal Dynamic Regret in LQR Control
2022cites this paper
Dynamic Regret of Online Mirror Descent for Relatively Smooth Convex Cost Functions
2022cites this paper
Smoothed Online Convex Optimization Based on Discounted-Normal-Predictor
2022cites this paper
A Survey of Decentralized Online Learning
2022cites this paper
Second Order Path Variationals in Non-Stationary Online Learning
2022influential citation
Stochastic Zeroth-Order Optimization under Nonstationarity and Nonconvexity
2022cites this paper
A Survey on Distributed Online Optimization and Game
2022cites this paper
Convergence of the Inexact Online Gradient and Proximal-Gradient Under the Polyak-Łojasiewicz Condition
2021cites this paper
An Adaptive News-Driven Method for CVaR-sensitive Online Portfolio Selection in Non-Stationary Financial Markets
2021cites this paper
Dynamic Regret Bounds for Online Nonconvex Optimization
2021cites this paper
Improving Dynamic Regret in Distributed Online Mirror Descent Using Primal and Dual Information
2021cites this paper
Adaptive Client Sampling in Federated Learning via Online Learning with Bandit Feedback
2021cites this paper
Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization
2021influential citation
Proximal Online Gradient Is Optimum for Dynamic Regret: A General Lower Bound
2021cites this paper
Adaptive Importance Sampling for Finite-Sum Optimization and Sampling with Decreasing Step-Sizes
2021cites this paper
Dynamic Online Learning via Frank-Wolfe Algorithm
2021cites this paper
Online Stochastic Gradient Methods Under Sub-Weibull Noise and the Polyak-Łojasiewicz Condition
2021cites this paper
An Optimal Reduction of TV-Denoising to Adaptive Online Learning
2021cites this paper
A closer look at temporal variability in dynamic online learning
2021cites this paper
Revisiting Smoothed Online Learning
2021cites this paper
Time-Varying Optimization of Networked Systems With Human Preferences
2021cites this paper
Optimal Dynamic Regret in Exp-Concave Online Learning
2021cites this paper
Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization
2021cites this paper
Projection-free Online Learning in Dynamic Environments
2021cites this paper
Online Continual Adaptation with Active Self-Training
2021cites this paper
Online Zeroth-order Optimisation on Hadamard Manifolds
2021cites this paper
Adaptive Online Estimation of Piecewise Polynomial Trends
2020cites this paper