Explore, exploit, and explain: personalizing explainable recommendations with bandits

James McInerney,B. Lacker,Samantha Hansen,Karl Higley,Hugues Bouchard,Alois Gruson,Rishabh Mehrotra

Published 2018 in ACM Conference on Recommender Systems

ABSTRACT

The multi-armed bandit is an important framework for balancing exploration with exploitation in recommendation. Exploitation recommends content (e.g., products, movies, music playlists) with the highest predicted user engagement and has traditionally been the focus of recommender systems. Exploration recommends content with uncertain predicted user engagement for the purpose of gathering more information. The importance of exploration has been recognized in recent years, particularly in settings with new users, new items, non-stationary preferences and attributes. In parallel, explaining recommendations ("recsplanations") is crucial if users are to understand their recommendations. Existing work has looked at bandits and explanations independently. We provide the first method that combines both in a principled manner. In particular, our method is able to jointly (1) learn which explanations each user responds to; (2) learn the best content to recommend for each user; and (3) balance exploration with exploitation to deal with uncertainty. Experiments with historical log data and tests with live production traffic in a large-scale music recommendation service show a significant improvement in user engagement.

PUBLICATION RECORD

Publication year
2018
Venue
ACM Conference on Recommender Systems
Publication date
2018-09-27
Fields of study
Computer Science
Identifiers
DOI 10.1145/3240323.3240354
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Offline A/B Testing for Recommender Systems
2018cited by this paper
How algorithmic confounding in recommendation systems increases homogeneity and decreases utility
2017cited by this paper
User Preferences for Hybrid Explanations
2017cited by this paper
Off-policy evaluation for slate recommendation
2016cited by this paper
Deep Neural Networks for YouTube Recommendations
2016cited by this paper
Counterfactual Evaluation and Learning for Search, Recommendation and Ad Placement
2016cited by this paper
Improving Controllability and Predictability of Interactive Recommendation Interfaces for Exploratory Search
2015cited by this paper
Cascading Bandits: Learning to Rank in the Cascade Model
2015cited by this paper
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
2014cited by this paper
Matroid Bandits: Fast Combinatorial Optimization with Learning
2014cited by this paper
Exploration in Interactive Personalized Music Recommendation: A Reinforcement Learning Approach
2013cited by this paper
TasteWeights: a visual interactive hybrid recommender system
2012cited by this paper
Doubly Robust Policy Evaluation and Learning
2011cited by this paper
A Taxonomy for Generating Explanations in Recommender Systems
2011cited by this paper
A contextual-bandit approach to personalized news article recommendation
2010cited by this paper
Factorization Machines
2010cited by this paper
Matrix Factorization Techniques for Recommender Systems
2009cited by this paper
Collaborative Filtering for Implicit Feedback Datasets
2008cited by this paper
Learning diverse rankings with multi-armed bandits
2008cited by this paper
A Survey of Explanations in Recommender Systems
2007cited by this paper
E-commerce intelligent agent: personalization travel support agent using Q Learning
2005cited by this paper
Reinforcement Learning: An Introduction
1998cited by this paper
Reinforcement Learning: : An Introduction
1998cited by this paper
Journal of Machine Learning Research? (????)??-?? Submitted?/??; Published?/?? An MDP-Based Recommender System ∗
year unknowncited by this paper

CITED BY

Reimagining Social Robots as Recommender Systems: Foundations, Framework, and Applications
2026cites this paper
Deep Learning to Rank in Industrial Search Engines, Recommender Systems and Online Advertising: An Overview and New Perspectives
2026cites this paper
Comparing metabolic engineering scenarios using simulated design-build-test-learn-cycles
2026cites this paper
Explainable person–job recommendations: challenges, approaches, and comparative analysis
2025cites this paper
Feedback-Driven Gradual Discovery for Expanding Musical Preferences
2025cites this paper
Exploitation Over Exploration: Unmasking the Bias in Linear Bandit Recommender Offline Evaluation
2025cites this paper
Optimization of Epsilon-Greedy Exploration
2025cites this paper
Neural Contextual Bandits Under Delayed Feedback Constraints
2025cites this paper
Design and implementation of adaptive filtering-based recommendation systems for maximizing publisher-side revenue
2025cites this paper
Practical Federated Recommendation Model Learning Using ORAM with Controlled Privacy
2025cites this paper
Integrating Exploration into Case-Based Reasoning: A Strategy for Enhanced Knowledge Acquisition
2025cites this paper
Adapting LLMs for Personalized Evaluation of Explanations for Recommendations: A Meta-Learning Approach based on MAML
2025cites this paper
On the Analysis of Two-Stage Stochastic Bandit
2024cites this paper
Cart-State-Aware Discovery of E-Commerce Visitor Journeys with Process Mining
2024cites this paper
Explore versus repeat: insights from an online supermarket
2024cites this paper
Causal Feature Selection Method for Contextual Multi-Armed Bandits in Recommender System
2024cites this paper
Beyond Preferences in AI Alignment
2024cites this paper
Multi-Task Neural Linear Bandit for Exploration in Recommender Systems
2024cites this paper
Ranking Across Different Content Types: The Robust Beauty of Multinomial Blending
2024cites this paper
Empathic Responding for Digital Interpersonal Emotion Regulation via Content Recommendation
2024cites this paper
Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits
2024cites this paper
A Survey of Music Recommendation Systems
2024cites this paper
Aligning Explanations for Recommendation with Rating and Feature via Maximizing Mutual Information
2024cites this paper
Uncovering the Interaction Equation: Quantifying the Effect of User Interactions on Social Media Homepage Recommendations
2024cites this paper
Interactive preference analysis: A reinforcement learning framework
2024cites this paper
Does the Long Tail of Context Exist and Matter? The Case of Dialogue-based Recommender Systems
2024cites this paper
DISCO: An End-to-End Bandit Framework for Personalised Discount Allocation
2024cites this paper
Dynamic Online Recommendation for Two-Sided Market with Bayesian Incentive Compatibility
2024cites this paper
Optimal Baseline Corrections for Off-Policy Contextual Bandits
2024cites this paper
Multi-Objective Recommendation via Multivariate Policy Learning
2024cites this paper
Analysis and Implications of Adopting AI and Machine Learning in Marketing, Servicing, and Communications Technology
2024cites this paper
Unleashing the Potential of Reinforcement Learning for Personalizing Behavioral Transformations with Digital Therapeutics: A Systematic Literature Review
2024cites this paper
Explainable data stream mining: Why the new models are better
2024cites this paper
Unlocking the 'Why' of Buying: Introducing a New Dataset and Benchmark for Purchase Reason and Post-Purchase Experience
2024cites this paper
Let's Get It Started: Fostering the Discoverability of New Releases on Deezer
2024cites this paper
Constrained contextual bandit algorithm for limited-budget recommendation system
2024cites this paper
Unpacking the exploration-exploitation tradeoff on Snapchat: The relationships between users' exploration-exploitation interests and server log data
2024cites this paper
Human Variability and the Explore-Exploit Trade-Off in Recommendation
2023cites this paper
Characterizing Impression-Aware Recommender Systems
2023cites this paper
Incorporating Impressions to Graph-Based Recommenders
2023cites this paper
Contextual Position Bias Estimation Using a Single Stochastic Logging Policy
2023cites this paper
Contextual Bandits for Hyper-Personalization based on User Behavior in Local Domain
2023cites this paper
Optimism Based Exploration in Large-Scale Recommender Systems
2023cites this paper
A survey on multi-objective recommender systems
2023cites this paper
Digitally nudging users to explore off-profile recommendations: here be dragons
2023cites this paper
Value of Exploration: Measurements, Findings and Algorithms
2023cites this paper
Interactive Content Diversity and User Exploration in Online Movie Recommenders: A Field Experiment
2023cites this paper
Leveling Up the Peloton Homescreen: A System and Algorithm for Dynamic Row Ranking
2023cites this paper
Quantum contextual bandits and recommender systems for quantum data
2023cites this paper
Beyond the comfort zone: digital nudges for off-profile recommendations
2023cites this paper
How Users Ride the Carousel: Exploring the Design of Multi-List Recommender Interfaces From a User Perspective
2023cites this paper
Self-determination through explanation: an ethical perspective on the implementation of the transparency requirements for recommender systems set by the Digital Services Act of the European Union
2023cites this paper
Impression-Aware Recommender Systems
2023cites this paper
Parallel Online Clustering of Bandits via Hedonic Game
2023cites this paper
Multi-list interfaces for recommender systems: survey and future directions
2023cites this paper
: A Personalized Music Recommendation Method Combining TextCNN and Attention
2023cites this paper
Impressions in Recommender Systems: Present and Future
2023cites this paper
Exploring Multi-Dimension User-Item Interactions With Attentional Knowledge Graph Neural Networks for Recommendation
2023cites this paper
BTSAMA: A Personalized Music Recommendation Method Combining TextCNN and Attention
2023cites this paper
Impatient Bandits: Optimizing Recommendations for the Long-Term Without Delay
2023cites this paper
Mod2Panel: A Design Framework for Model-Based Automated Generation of Interactive Panels
2023cites this paper
Understanding User Behavior in Carousel Recommendation Systems for Click Modeling and Learning to Rank
2023cites this paper
Evaluating Online Bandit Exploration In Large-Scale Recommender System
2023cites this paper
Interactive Personalization of Classifiers for Explainability using Multi-Objective Bayesian Optimization
2023cites this paper
A Contextual Bandit Approach for Network Service Selection
2023cites this paper
Recommendation of Mix-and-Match Clothing by Modeling Indirect Personal Compatibility
2023cites this paper
Long-Term Value of Exploration: Measurements, Findings and Algorithms
2023cites this paper
Leveraging Large Language Models in Conversational Recommender Systems
2023cites this paper
A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits
2023cites this paper
Pessimistic Off-Policy Optimization for Learning to Rank
2022cites this paper
A survey on knowledge-aware news recommender systems
2022cites this paper
Scientific paper recommendation systems: a literature review of recent publications
2022cites this paper
Transparent and Explainable ML
2022cites this paper
Multi-Armed Bandits in Recommendation Systems: A survey of the state-of-the-art and future directions
2022cites this paper
Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds
2022cites this paper
TastePaths: Enabling Deeper Exploration and Understanding of Personal Preferences in Recommender Systems
2022cites this paper
“Knowing me, knowing you”: personalized explanations for a music recommender system
2022cites this paper
A Multi-Dimensional Conceptualization Framework for Personalized Explanations in Recommender Systems 11-23
2022influential citation
Modeling Position Bias Ranking for Streaming Media Services
2022cites this paper
The role of recommender systems in fostering consumers' long-term platform engagement
2022cites this paper
Social influence for societal interest: a pro-ethical framework for improving human decision making through multi-stakeholder recommender systems
2022cites this paper
Exploring post-hoc agnostic models for explainable cooking recipe recommendations
2022cites this paper
Machine learning and artificial intelligence use in marketing: a general taxonomy
2022cites this paper
Generating Recommendations with Post-Hoc Explanations for Citizen Science
2022cites this paper
Delayed Feedback in Generalised Linear Bandits Revisited
2022cites this paper
Predicting IPv4 services across all ports
2022cites this paper
Evaluating Recommender Systems: Survey and Framework
2022cites this paper
A Scalable Recommendation Engine for New Users and Items
2022cites this paper
The Long Tail of Context: Does it Exist and Matter?
2022cites this paper
A human-ML collaboration framework for improving video content reviews
2022cites this paper
Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model
2022cites this paper
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
2022cites this paper
Pessimistic Decision-Making for Recommender Systems
2022influential citation
Summarizing Sets of Related ML-Driven Recommendations for Improving File Management in Cloud Storage
2022cites this paper
Multi-Objective Recommendation: Overview and Challenges
2022cites this paper
Explainable software systems: from requirements analysis to system evaluation
2022cites this paper
A comparative study of item space visualizations for recommender systems
2022cites this paper
Exploitation and Exploration: Improving Search Precision on E-commerce Platforms
2021cites this paper
Fuzzy Tunes
2021cites this paper
Project 412Connect: Bridging Students and Communities
2021cites this paper