Unsupervised Recalibration

Published 2019 in arXiv.org

ABSTRACT

Unsupervised recalibration (URC) is a general way to improve the accuracy of an already trained probabilistic classification or regression model upon encountering new data while deployed in the field. URC does not require any ground truth associated with the new field data. URC merely observes the model's predictions and recognizes when the training set is not representative of field data, and then corrects to remove any introduced bias. URC can be particularly useful when applied separately to different subpopulations observed in the field that were not considered as features when training the machine learning model. This makes it possible to exploit subpopulation information without retraining the model or even having ground truth for some or all subpopulations available. Additionally, if these subpopulations are the object of study, URC serves to determine the correct ground truth distributions for them, where naive aggregation methods, like averaging the model's predictions, systematically underestimate their differences.

PUBLICATION RECORD

Publication year
2019
Venue
arXiv.org
Publication date
2019-08-24
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1908.09157
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Identifying and Correcting Label Bias in Machine Learning
2019cited by this paper
Invariant Risk Minimization
2019cited by this paper
An Overview of Overfitting and its Solutions
2019cited by this paper
Quantification under prior probability shift: the ratio estimator and its extensions
2018cited by this paper
Detecting and Correcting for Label Shift with Black Box Predictors
2018cited by this paper
A Review on Quantification Learning
2017cited by this paper
Beta calibration: a well-founded and easily implemented improvement on logistic calibration for binary classifiers
2017cited by this paper
Fisher consistency for prior probability shift
2017cited by this paper
The iNaturalist Species Classification and Detection Dataset
2017cited by this paper
Maximum a posteriori estimators as a limit of Bayes estimators
2016cited by this paper
A novel progressive learning technique for multi-class classification
2016cited by this paper
NRU-HSE at SemEval-2016 Task 4: Comparative Analysis of Two Iterative Methods Using Quantification Library
2016cited by this paper
Effect of separate sampling on classification and the minimax criterion
2013cited by this paper
Supplementary Materials for: Effect of Separate Sampling on Classification Accuracy
2013cited by this paper
A unifying view on dataset shift in classification
2012cited by this paper
Brier Curves: a New Cost-Based Visualisation of Classifier Performance
2011cited by this paper
Scikit-learn: Machine Learning in Python
2011cited by this paper
Ensemble-based classifiers
2010cited by this paper
Quantification and semi-supervised classification methods for handling changes in class distribution
2009cited by this paper
Multivariate calibration of classifier scores into the probability space
2009cited by this paper
Quantifying counts and costs via classification
2008cited by this paper
Conflations of Probability Distributions: An Optimal Method for Consolidating Data from Different Experiments
2008cited by this paper
Conditional Expectation을 이용한 영상의 노출 보정
2005cited by this paper
Adjusting the Outputs of a Classifier to New a Priori Probabilities: A Simple Procedure
2002cited by this paper
Python Reference Manual
2000cited by this paper
Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods
1999cited by this paper
Model Assisted Survey Sampling
1997cited by this paper
The numerical evaluation of the maximum-likelihood estimate of mixture proportions
1976cited by this paper
Comparison of a screening test and a reference test in epidemiologic studies. II. A probabilistic model for the comparison of diagnostic tests.
1966cited by this paper

CITED BY

Bayesian Quantification with Black-Box Estimators
2023cites this paper