Differentiable Unbiased Online Learning to Rank

Published 2018 in International Conference on Information and Knowledge Management

ABSTRACT

Online Learning to Rank (OLTR) methods optimize rankers based on user interactions. State-of-the-art OLTR methods are built specifically for linear models. Their approaches do not extend well to non-linear models such as neural networks. We introduce an entirely novel approach to OLTR that constructs a weighted differentiable pairwise loss after each interaction: Pairwise Differentiable Gradient Descent (PDGD). PDGD breaks away from the traditional approach that relies on interleaving or multileaving and extensive sampling of models to estimate gradients. Instead, its gradient is based on inferring preferences between document pairs from user clicks and can optimize any differentiable model. We prove that the gradient of PDGD is unbiased w.r.t. user document pair preferences. Our experiments on the largest publicly available Learning to Rank (LTR) datasets show considerable and significant improvements under all levels of interaction noise. PDGD outperforms existing OLTR methods both in terms of learning speed as well as final convergence. Furthermore, unlike previous OLTR methods, PDGD also allows for non-linear models to be optimized effectively. Our results show that using a neural network leads to even better performance at convergence than a linear model. In summary, PDGD is an efficient and unbiased OLTR approach that provides a better user experience than previously possible.

PUBLICATION RECORD

Publication year
2018
Venue
International Conference on Information and Knowledge Management
Publication date
2018-09-22
Fields of study
Computer Science
Identifiers
DOI 10.1145/3269206.3271686 arXiv 1809.08415
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

On Application of Learning to Rank for E-Commerce Search
2017cited by this paper
Balancing Speed and Quality in Online Learning to Rank for Information Retrieval
2017influential reference
Online Learning to Rank in Stochastic Click Models
2017cited by this paper
Sensitive and Scalable Online Evaluation with Theoretical Guarantees
2017cited by this paper
A Theoretical Framework for Conversational Search
2017cited by this paper
Learning to Rank with Selection Bias in Personal Search
2016influential reference
Probabilistic Multileave Gradient Descent
2016cited by this paper
Fast Ranking with Additive Ensembles of Oblivious and Non-Oblivious Regression Trees
2016cited by this paper
Multileave Gradient Descent for Fast Online Learning to Rank
2016influential reference
Click Models for Web Search
2015cited by this paper
Online Rank Elicitation for Plackett-Luce: A Dueling Bandits Approach
2015cited by this paper
Probabilistic Multileave for Online Retrieval Evaluation
2015cited by this paper
An Introduction to Click Models for Web Search: SIGIR 2015 Tutorial
2015cited by this paper
Relative confidence sampling for efficient on-line ranker evaluation
2014cited by this paper
Online Exploration for Detecting Shifts in Fresh Intent
2014cited by this paper
Multileaved Comparisons for Fast Online Evaluation
2014cited by this paper
Global analytic solution of fully-observed variational Bayesian matrix factorization
2013cited by this paper
Ranked bandits in metric spaces: learning diverse rankings over large document collections
2013cited by this paper
How useful is social feedback for learning to rank YouTube videos?
2013cited by this paper
Learning to rank for recommender systems
2013cited by this paper
Introducing LETOR 4.0 Datasets
2013cited by this paper
Reusing historical interaction data for faster online learning to rank for IR
2013cited by this paper
Fast and reliable online learning to rank for information retrieval
2013cited by this paper
Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval
2012cited by this paper
Balancing Exploration and Exploitation in Learning to Rank Online
2011cited by this paper
A probabilistic method for inferring preferences from clicks
2011cited by this paper
Yahoo! Learning to Rank Challenge Overview
2010cited by this paper
From RankNet to LambdaRank to LambdaMART: An Overview
2010cited by this paper
Understanding the difficulty of training deep feedforward neural networks
2010cited by this paper
Keynote: The Web Changes Everything: Understanding and Supporting People in Dynamic Information Environments
2010cited by this paper
Test Collection Based Evaluation of Information Retrieval Systems
2010cited by this paper
Beyond position bias: examining result attractiveness as a source of presentation bias in clickthrough data
2010cited by this paper
Interactively optimizing information retrieval systems as a dueling bandits problem
2009influential reference
Evaluation of methods for relative comparison of retrieval systems based on clickthroughs
2009cited by this paper
Efficient multiple-click models in web search
2009influential reference
How does clickthrough data reflect retrieval quality?
2008cited by this paper
Learning diverse rankings with multi-armed bandits
2008cited by this paper
Million Query Track 2007 Overview
2007cited by this paper
Learning to rank: from pairwise approach to listwise approach
2007cited by this paper
LETOR: Benchmark Dataset for Research on Learning to Rank for Information Retrieval
2007cited by this paper
Optimizing search engines using clickthrough data
2002cited by this paper
Changes in relevance criteria and problem stages in task performance
2000cited by this paper

CITED BY

DSEBench: A Test Collection for Explainable Dataset Search with Examples
2025cites this paper
A Large-Scale Web Search Dataset for Federated Online Learning to Rank
2025cites this paper
Unlearning for Federated Online Learning to Rank: A Reproducibility Study
2025influential citation
CoDIME: A Counterfactual Approach for Dimension Importance Estimation through Click Logs
2025cites this paper
LT2R: Learning to Online Learning to Rank for Web Search
2024influential citation
Fairly Accurate: Optimizing Accuracy Parity in Fair Target-Group Detection
2024cites this paper
Optimizing Learning-to-Rank Models for Ex-Post Fair Relevance
2024cites this paper
Privacy Preserved Federated Learning for Online Ranking System (OLTR) for 6G Internet Technology
2024cites this paper
Unbiased Learning to Rank: On Recent Advances and Practical Applications
2024cites this paper
How to Forget Clients in Federated Online Learning to Rank?
2024influential citation
Learning-to-Rank with Nested Feedback
2024cites this paper
Investigating the Robustness of Counterfactual Learning to Rank Models: A Reproducibility Study
2024cites this paper
BayesCNS: A Unified Bayesian Approach to Address Cold Start and Non-Stationarity in Search Systems at Scale
2024cites this paper
Meta-Learning to Rank for Sparsely Supervised Queries
2024cites this paper
TRAVERS: A Diversity-Based Dynamic Approach to Iterative Relevance Search over Knowledge Graphs
2023influential citation
Metric-agnostic Ranking Optimization
2023cites this paper
Contrasting Neural Click Models and Pointwise IPS Rankers
2023cites this paper
The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives
2023cites this paper
RAIFLE: Reconstruction Attacks on Interaction-based Federated Learning with Adversarial Data Manipulation
2023cites this paper
Recent Advancements in Unbiased Learning to Rank
2023cites this paper
RAIFLE: Reconstruction Attacks on Interaction-based Federated Learning with Active Data Manipulation
2023cites this paper
Towards Sequential Counterfactual Learning to Rank
2023cites this paper
Efficient Exploration and Exploitation for Sequential Music Recommendation
2023cites this paper
Optimizing Group-Fair Plackett-Luce Ranking Models for Relevance and Ex-Post Fairness
2023influential citation
An Analysis of Untargeted Poisoning Attack and Defense Methods for Federated Online Learning to Rank Systems
2023influential citation
Inference-time Stochastic Ranking with Risk Control
2023cites this paper
Adversarial Attacks on Online Learning to Rank with Stochastic Click Models
2023cites this paper
Mitigating Exploitation Bias in Learning to Rank with an Uncertainty-aware Empirical Bayes Approach
2023cites this paper
The Role of Relevance in Fair Ranking
2023cites this paper
Recent Advances in the Foundations and Applications of Unbiased Learning to Rank
2023cites this paper
On the Impact of Outlier Bias on User Clicks
2023cites this paper
Offline Evaluation of Ranked Lists using Parametric Estimation of Propensities
2022cites this paper
Reinforcement online learning to rank with unbiased reward shaping
2022influential citation
Learning Neural Ranking Models Online from Implicit User Feedback
2022influential citation
User Behavior Simulation for Search Result Re-ranking
2022cites this paper
External Evaluation of Ranking Models under Extreme Position-Bias
2022cites this paper
Do Lessons from Metric Learning Generalize to Image-Caption Retrieval?
2022cites this paper
Doubly-Robust Estimation for Unbiased Learning-to-Rank from Position-Biased Click Feedback
2022cites this paper
Implicit Feedback for Dense Passage Retrieval: A Counterfactual Approach
2022influential citation
Is Non-IID Data a Threat in Federated Online Learning to Rank?
2022cites this paper
Efficient Online Learning to Rank for Sequential Music Recommendation
2022cites this paper
Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo sampling
2022cites this paper
ListMAP: Listwise learning to rank as maximum a posteriori estimation
2022cites this paper
Doubly Robust Estimation for Correcting Position Bias in Click Feedback for Unbiased Learning to Rank
2022cites this paper
Non-stationary Dueling Bandits for Online Learning to Rank
2022cites this paper
Can Clicks Be Both Labels and Features?: Unbiased Behavior Feature Collection and Uncertainty-aware Learning to Rank
2022cites this paper
Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback
2022influential citation
Learning to Rank for Test Case Prioritization
2022cites this paper
A General Framework for Pairwise Unbiased Learning to Rank
2022cites this paper
RUCIR21 at the NTCIR-16 ULTRE Task
2022cites this paper
Improving Effectiveness and Security in Federated Online Learning to Rank
2022cites this paper
Overview of the NTCIR-16 Unbiased Learning to Rank Evaluation (ULTRE) Task
2022cites this paper
Reinforcement Learning to Rank Using Coarse-grained Rewards
2022cites this paper
Addressing Cold Start in Product Search via Empirical Bayes
2022cites this paper
Are Neural Click Models Pointwise IPS Rankers?
2022cites this paper
ULTRE Framework: a Framework for Unbiased Learning to Rank Evaluation based on Simulation of User Behavior
2021cites this paper
Calibrating Explore-Exploit Trade-off for Fair Online Learning to Rank
2021cites this paper
Beyond Relevance Ranking: A General Graph Matching Framework for Utility-Oriented Learning to Rank
2021influential citation
Mixture-Based Correction for Position and Trust Bias in Counterfactual Learning to Rank
2021cites this paper
ULTRA: An Unbiased Learning To Rank Algorithm Toolbox
2021influential citation
How do Online Learning to Rank Methods Adapt to Changes of Intent?
2021influential citation
Interactive Information Retrieval with Bandit Feedback
2021cites this paper
Effective and Privacy-preserving Federated Online Learning to Rank
2021influential citation
Propensity-Independent Bias Recovery in Offline Learning-to-Rank Systems
2021cites this paper
Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness
2021influential citation
Unbiased Learning to Rank in Feeds Recommendation
2021cites this paper
Toward User Engagement Optimization in 2D Presentation
2021cites this paper
PairRank: Online Pairwise Learning to Rank by Divide-and-Conquer
2021influential citation
Robust Generalization and Safe Query-Specializationin Counterfactual Learning to Rank
2021cites this paper
Neural embedding-based specificity metrics for pre-retrieval query performance prediction
2020cites this paper
Neural Embedding-Based Metrics for Pre-retrieval Query Performance Prediction
2020cites this paper
Counterfactual Online Learning to Rank
2020influential citation
Learning from user interactions with rankings
2020influential citation
Unifying Online and Counterfactual Learning to Rank: A Novel Counterfactual Estimator that Effectively Utilizes Online Interventions
2020influential citation
Unbiased Learning to Rank: Counterfactual and Online Approaches
2020cites this paper
CPR: Collaborative Pairwise Ranking for Online List Recommendations
2020influential citation
Correcting for Selection Bias in Learning-to-rank Systems
2020cites this paper
Debiasing Learning to Rank Models with Generative Adversarial Networks
2020cites this paper
Entity Summarization with User Feedback
2020cites this paper
Cascade Model-based Propensity Estimation for Counterfactual Learning to Rank
2020cites this paper
Unbiased Learning to Rank
2020influential citation
Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model
2019cites this paper
Unbiased Learning to Rank: Counterfactual and Online Approaches
2019influential citation
Cascading Hybrid Bandits: Online Learning to Rank for Relevance and Diversity
2019cites this paper
Learning to Rank in Theory and Practice: From Gradient Boosting to Neural Networks and Unbiased Learning
2019cites this paper
To Model or to Intervene: A Comparison of Counterfactual and Online Learning to Rank from User Interactions
2019influential citation
Variance Reduction in Gradient Exploration for Online Learning to Rank
2019cites this paper
Optimizing Ranking Models in an Online Setting
2019influential citation
A Contextual-Bandit Approach to Online Learning to Rank for Relevance and Diversity
2019cites this paper
A Fast and Accurate Intervention-Aware Estimator
year unknowncites this paper
Information Processing and Management
year unknowncites this paper
UvA-DARE (Digital Academic Repository) A Contextual-Bandit Approach to Online Learning to Rank for Relevance and Diversity
year unknowncites this paper
An Open SERP Mining Infrastructure for the Archive Query Log
year unknowncites this paper