Misaligned by Design: Incentive Failures in Machine Learning

David Autor,Andrew Caplin,Daniel Martin,Philip Marx

Published 2025 in Social Science Research Network

ABSTRACT

The cost of error in many high-stakes settings is asymmetric: misdiagnosing pneumonia when absent is an inconvenience, but failing to detect it when present can be life-threatening. Because of this, artificial intelligence (AI) models used to assist such decisions are frequently trained with asymmetric loss functions that incorporate human decision-makers'trade-offs between false positives and false negatives. In two focal applications, we show that this standard alignment practice can backfire. In both cases, it would be better to train the machine learning model with a loss function that ignores the human's objective and then adjust predictions ex post according to that objective. We rationalize this result using an economic model of incentive design with endogenous information acquisition. The key insight from our theoretical framework is that machine classifiers perform not one but two incentivized tasks: choosing how to classify and learning how to classify. We show that while the adjustments engineers use correctly incentivize choosing, they can simultaneously reduce the incentives to learn. Our formal treatment of the problem reveals that methods embraced for their intuitive appeal can in fact misalign human and machine objectives in predictable ways.

PUBLICATION RECORD

Publication year
2025
Venue
Social Science Research Network
Publication date
2025-11-01
Fields of study
Computer Science, Economics
Identifiers
DOI 10.2139/ssrn.5795735 arXiv 2511.07699
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Using Machine Learning to Generate, Clarify, and Improve Economic Models
2025cited by this paper
Eliciting Informed Preferences
2025cited by this paper
Foundational Challenges in Assuring Alignment and Safety of Large Language Models
2024cited by this paper
Bayesian Persuasion
2024cited by this paper
Uncovering mesa-optimization algorithms in Transformers
2023cited by this paper
AI Alignment: A Comprehensive Survey
2023cited by this paper
Managing AI Risks in an Era of Rapid Progress
2023cited by this paper
The alignment problem from a deep learning perspective
2022cited by this paper
Predict-then-optimize or predict-and-optimize? An empirical evaluation of cost-sensitive learning strategies
2022cited by this paper
Calibrating for Class Weights by Modeling Machine Learning
2022cited by this paper
Transformers learn in-context by gradient descent
2022cited by this paper
Revisiting the Calibration of Modern Neural Networks
2021cited by this paper
Algorithm Design: A Fairness-Accuracy Frontier
2021cited by this paper
Balancing Competing Objectives with Noisy Data: Score-Based Classifiers for Welfare-Aware Machine Learning
2020cited by this paper
Adaptive Efficient Coding: A Variational Auto-encoder Approach
2020cited by this paper
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
2020cited by this paper
An Economic Approach to Regulating Algorithms
2020cited by this paper
Measuring Calibration in Deep Learning
2019cited by this paper
Bayesian Persuasion and Information Design
2019cited by this paper
Risks from Learned Optimization in Advanced Machine Learning Systems
2019cited by this paper
Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists
2018cited by this paper
The cost of fairness in binary classification
2018cited by this paper
Scalable agent alignment via reward modeling: a research direction
2018cited by this paper
ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases
2017cited by this paper
On Calibration of Modern Neural Networks
2017cited by this paper
Loss Functions for Predicted Click-Through Rates in Auctions for Online Advertising
2017influential reference
Attention is All you Need
2017cited by this paper
CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning
2017cited by this paper
Algorithmic Decision Making and the Cost of Fairness
2017cited by this paper
Does mitigating ML's impact disparity require treatment disparity?
2017cited by this paper
Concrete Problems in AI Safety
2016cited by this paper
Densely Connected Convolutional Networks
2016influential reference
Bayesian persuasion with heterogeneous priors
2016cited by this paper
Should We Fear Supersmart Robots?
2016cited by this paper
Fundamentals Of Convex Analysis
2016cited by this paper
Calibrating Probability with Undersampling for Unbalanced Classification
2015cited by this paper
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
2015influential reference
Obtaining Well Calibrated Probabilities Using Bayesian Binning
2015cited by this paper
Adam: A Method for Stochastic Optimization
2014influential reference
Predicting Binary Outcomes
2013cited by this paper
Elicitation and Evaluation of Statistical Forecasts
2010cited by this paper
The construction of empirical credit scoring rules based on maximization principles
2010cited by this paper
A closed-form reduction of multi-class cost-sensitive learning to weighted multi-class learning
2009cited by this paper
Learning Multiple Layers of Features from Tiny Images
2009cited by this paper
ImageNet: A large-scale hierarchical image database
2009cited by this paper
Machine Learning from Imbalanced Data Sets 101
2008cited by this paper
Strictly Proper Scoring Rules, Prediction, and Estimation
2007cited by this paper
ON MULTI‐CLASS COST‐SENSITIVE LEARNING
2006influential reference
Boosting for Learning Multiple Classes with Imbalanced Class Distribution
2006cited by this paper
Predicting good probabilities with supervised learning
2005cited by this paper
Loss Functions for Binary Class Probability Estimation and Classification: Structure and Applications
2005cited by this paper
An iterative method for multi-class cost-sensitive learning
2004cited by this paper
C4.5 and Imbalanced Data sets: Investigating the eect of sampling method, probabilistic estimate, and decision tree structure
2003cited by this paper
C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling
2003cited by this paper
Learning When Data Sets are Imbalanced and When Costs are Unequal and Unknown
2003cited by this paper
Cost-sensitive learning by cost-proportionate example weighting
2003cited by this paper
Adjusting the Outputs of a Classifier to New a Priori Probabilities: A Simple Procedure
2002cited by this paper
Class Probability Estimation and Cost-Sensitive Classification Decisions
2002cited by this paper
The Foundations of Cost-Sensitive Learning
2001influential reference
An Instance-weighting Method to Induce Cost-sensitive Trees
2001cited by this paper
Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers
2001cited by this paper
The Class Imbalance Problem: Significance and Strategies
2000cited by this paper
Algorithms for Inverse Reinforcement Learning
2000cited by this paper
Exploiting the Cost (In)sensitivity of Decision Tree Splitting Criteria
2000cited by this paper
Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods
1999cited by this paper
AdaCost: Misclassification Cost-Sensitive Boosting
1999cited by this paper
MetaCost: a general method for making classifiers cost-sensitive
1999cited by this paper
Gradient-based learning applied to document recognition
1998cited by this paper
Addressing the Curse of Imbalanced Training Sets: One-Sided Selection
1997cited by this paper
A General Method for Comparing Probability Assessors
1989cited by this paper
The Comparison and Evaluation of Forecasters.
1983cited by this paper
Elicitation of Personal Probabilities and Expectations
1971cited by this paper
Admissible probability measurement procedures
1966cited by this paper
Some moral and technical consequences of automation
1960cited by this paper
Equivalent Comparisons of Experiments
1953influential reference
VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY
1950cited by this paper

CITED BY

No citing papers are available for this paper.