Learning Policies for Contextual Submodular Prediction

S. Ross,Jiaji Zhou,Yisong Yue,Debadeepta Dey,J. Bagnell

Published 2013 in International Conference on Machine Learning

ABSTRACT

Many prediction domains, such as ad placement, recommendation, trajectory prediction, and document summarization, require predicting a set or list of options. Such lists are often evaluated using submodular reward functions that measure both quality and diversity. We propose a simple, efficient, and provably near-optimal approach to optimizing such prediction problems based on noregret learning. Our method leverages a surprising result from online submodular optimization: a single no-regret online learner can compete with an optimal sequence of predictions. Compared to previous work, which either learn a sequence of classifiers or rely on stronger assumptions such as realizability, we ensure both data-efficiency as well as performance guarantees in the fully agnostic setting. Experiments validate the efficiency and applicability of the approach on a wide range of problems including manipulator trajectory optimization, news recommendation and document summarization.

PUBLICATION RECORD

Publication year
2013
Venue
International Conference on Machine Learning
Publication date
2013-05-11
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1305.2532
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Online learning to diversify from implicit feedback
2012influential reference
The Multiplicative Weights Update Method: a Meta-Algorithm and Applications
2012cited by this paper
Contextual Sequence Prediction with Application to Control Library Optimization
2012influential reference
Agnostic System Identification for Model-Based Reinforcement Learning
2012cited by this paper
Multiple Choice Learning: Learning to Produce Multiple Structured Outputs
2012cited by this paper
Learning Mixtures of Submodular Shells with Application to Document Summarization
2012cited by this paper
Maximizing Non-Monotone Submodular Functions
2011influential reference
Learning message-passing inference machines for structured prediction
2011cited by this paper
Linear Submodular Bandits and their Application to Diversified Retrieval
2011influential reference
Learning Determinantal Point Processes
2011influential reference
A Class of Submodular Functions for Document Summarization
2011cited by this paper
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
2010cited by this paper
Multi-document Summarization via Budgeted Maximization of Submodular Functions
2010cited by this paper
Online Learning of Assignments
2009cited by this paper
CHOMP: Gradient optimization techniques for efficient motion planning
2009cited by this paper
Predicting diverse subsets using structural SVMs
2008cited by this paper
An Online Algorithm for Maximizing Submodular Functions
2008influential reference
Learning diverse rankings with multi-armed bandits
2008influential reference
Overview of DUC 2005
2005cited by this paper
Efficient algorithms for online decision problems
2005influential reference
Error limiting reductions between classification tasks
2005cited by this paper
A support vector method for multivariate performance measures
2005cited by this paper
ROUGE: A Package for Automatic Evaluation of Summaries
2004cited by this paper
Eecient Algorithms for Online Decision Problems
2003influential reference
The Nonstochastic Multiarmed Bandit Problem
2002influential reference
How to use expert advice
1997cited by this paper
The weighted majority algorithm
1989cited by this paper

CITED BY

Bayesian Optimization with Inexact Acquisition: Is Random Grid Search Sufficient?
2025cites this paper
Optimizing Partial Area Under the Top-k Curve: Theory and Practice
2022cites this paper
Learning to Rank under Evolving Consumer Reviews
2019cites this paper
Neural Network-Based Learning from Demonstration of an Autonomous Ground Robot
2019cites this paper
Adaptive Motion Planning
2018cites this paper
Learning to Learn for Small Sample Visual Recognition
2018cites this paper
Greedy Inference Algorithms for Structured and Neural Models
2018cites this paper
Smart And Autonomous Systems For Repair And Improvisation
2018cites this paper
Towards Generalization and Efficiency of Reinforcement Learning
2018cites this paper
First-principles–based reaction kinetics from reactive molecular dynamics simulations: Application to hydrogen peroxide decomposition
2018cites this paper
Learning to Search via Retrospective Imitation
2018cites this paper
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction
2017cites this paper
Recent advances in document summarization
2017cites this paper
Image classification with limited training data and class ambiguity
2017cites this paper
Learning Partial Policies to Speedup MDP Tree Search via Reduction to I.I.D. Learning
2017cites this paper
Learning from Natural Human Interactions for Assistive Robots
2016cites this paper
List prediction applied to motion planning
2016cites this paper
Analysis and Optimization of Loss Functions for Multiclass, Top-k, and Multilabel Classification
2016cites this paper
Smooth Interactive Submodular Set Cover
2015cites this paper
Deep Dependency Substructure-Based Learning for Multidocument Summarization
2015influential citation
Natural Language Direction Following for Robots in Unstructured Unknown Environments
2015cites this paper
An Invitation to Imitation
2015cites this paper
Model recommendation: Generating object detectors from few samples
2015cites this paper
Loss Functions for Top-k Error: Analysis and Insights
2015cites this paper
Learning preferences for manipulation tasks from online coactive feedback
2015cites this paper
Predicting Multiple Structured Visual Interpretations
2015cites this paper
SubmodBoxes: Near-Optimal Search for a Set of Diverse Object Proposals
2015cites this paper
Predicting Sets and Lists: Theory and Practice
2015cites this paper
Anytime Prediction: Efficient Ensemble Methods for Any Computational Budget
2014cites this paper
Online Submodular Maximization under a Matroid Constraint with Application to Learning Assignments
2014cites this paper
Visual chunking: A list prediction framework for region-based object detection
2014cites this paper
Reinforcement and Imitation Learning via Interactive No-Regret Learning
2014cites this paper
Interactive Learning for Sequential Decisions and Predictions
2013cites this paper
Knapsack Constrained Contextual Submodular List Prediction with Application to Multi-document Summarization
2013cites this paper