Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks

Shreyas Padhy,Zachary Nado,Jie Jessie Ren,J. Liu,Jasper Snoek,Balaji Lakshminarayanan

Published 2020 in arXiv.org

ABSTRACT

Accurate estimation of predictive uncertainty in modern neural networks is critical to achieve well calibrated predictions and detect out-of-distribution (OOD) inputs. The most promising approaches have been predominantly focused on improving model uncertainty (e.g. deep ensembles and Bayesian neural networks) and post-processing techniques for OOD detection (e.g. ODIN and Mahalanobis distance). However, there has been relatively little investigation into how the parametrization of the probabilities in discriminative classifiers affects the uncertainty estimates, and the dominant method, softmax cross-entropy, results in misleadingly high confidences on OOD data and under covariate shift. We investigate alternative ways of formulating probabilities using (1) a one-vs-all formulation to capture the notion of "none of the above", and (2) a distance-based logit representation to encode uncertainty as a function of distance to the training manifold. We show that one-vs-all formulations can improve calibration on image classification tasks, while matching the predictive performance of softmax without incurring any additional training or test-time complexity.

PUBLICATION RECORD

Publication year
2020
Venue
arXiv.org
Publication date
2020-07-10
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 2007.05134
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

One Versus all for deep Neural Network Incertitude (OVNNI) quantification
2020cited by this paper
5分で分かる!? 有名論文ナナメ読み：Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding
2020cited by this paper
Small and Practical BERT Models for Sequence Labeling
2019cited by this paper
A survey on Image Data Augmentation for Deep Learning
2019cited by this paper
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
2019influential reference
Benchmarking Neural Network Robustness to Common Corruptions and Perturbations
2019influential reference
Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift
2019cited by this paper
Analyzing the role of model uncertainty for electronic health records
2019cited by this paper
Isotropic Maximization Loss and Entropic Score: Fast, Accurate, Scalable, Unexposed, Turnkey, and Native Neural Networks Out-of-Distribution Detection.
2019influential reference
The Intriguing Effects of Focal Loss on the Calibration of Deep Neural Networks
2019cited by this paper
Distance-Based Learning from Errors for Confidence Calibration
2019cited by this paper
An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction
2019cited by this paper
Why ReLU Networks Yield High-Confidence Predictions Far Away From the Training Data and How to Mitigate the Problem
2018influential reference
A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks
2018cited by this paper
Simple, Distributed, and Accelerated Probabilistic Programming
2018cited by this paper
Deep-RBF Networks Revisited: Robust Classification with Rejection
2018cited by this paper
Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks
2017cited by this paper
Scaling SGD Batch Size to 32K for ImageNet Training
2017cited by this paper
Dermatologist-level classification of skin cancer with deep neural networks
2017cited by this paper
DOC: Deep Open Classification of Text Documents
2017cited by this paper
On Calibration of Modern Neural Networks
2017influential reference
Large Batch Training of Convolutional Networks
2017cited by this paper
End to End Learning for Self-Driving Cars
2016cited by this paper
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
2016cited by this paper
Concrete Problems in AI Safety
2016cited by this paper
A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks
2016cited by this paper
Wide Residual Networks
2016cited by this paper
Deep Residual Learning for Image Recognition
2015cited by this paper
Weight Uncertainty in Neural Networks
2015cited by this paper
ImageNet Large Scale Visual Recognition Challenge
2014influential reference
Reading Digits in Natural Images with Unsupervised Feature Learning
2011cited by this paper
Bayesian Methods for Adaptive Models
2011cited by this paper
Composite Binary Losses
2009cited by this paper
Learning Multiple Layers of Features from Tiny Images
2009cited by this paper
Multiclass and Binary SVM Classification: Implications for Training and Classification Users
2008cited by this paper
Image Classification Using SVMs: One-against-One Vs One-against-All
2007cited by this paper
Strictly Proper Scoring Rules, Prediction, and Estimation
2007cited by this paper
One-against-all multi-class SVM classification using reliability measures
2005cited by this paper
Predicting good probabilities with supervised learning
2005cited by this paper
In Defense of One-Vs-All Classification
2004cited by this paper
Multi-category Classification by Soft-Max Combination of Binary Classifiers
2003cited by this paper
The Comparison and Evaluation of Forecasters.
1983cited by this paper

CITED BY

Ego4OOD: Rethinking Egocentric Video Domain Generalization via Covariate Shift Scoring
2026cites this paper
Specialized Convolutional Neural Network Models for Echolocation-Based Perception
2025cites this paper
Deep Joint Distribution Optimal Transport for Universal Domain Adaptation on Time Series
2025cites this paper
Optimal Transport and Adaptive Thresholding for Universal Domain Adaptation on Time Series
2025cites this paper
Boosting Universal Domain Adaptation in Remote Sensing With Dual-Classifiers Consistency Discrimination and Cross-Domain Feature Mixup
2025cites this paper
Deuce: Dual-diversity Enhancement and Uncertainty-awareness for Cold-start Active Learning
2024cites this paper
One-vs-All Semi-Automatic Labeling Tool for Semantic Segmentation in Autonomous Driving
2024cites this paper
Transitional Uncertainty with Layered Intermediate Predictions
2024cites this paper
USDet:Unknown Ship Detection Based on Remote Sensing Optical Images
2024cites this paper
UR2M: Uncertainty and Resource-Aware Event Detection on Microcontrollers
2024cites this paper
Stochastic Binary Network for Universal Domain Adaptation
2024cites this paper
SURE: SUrvey REcipes for Building Reliable and Robust Deep Networks
2024cites this paper
Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge
2024cites this paper
Embracing Unknown Step by Step: Towards Reliable Sparse Training in Real World
2024cites this paper
Robust Semi-Supervised Learning by Wisely Leveraging Open-Set Data
2024cites this paper
Improving CLIP Robustness with Knowledge Distillation and Self-Training
2023cites this paper
Unified Classification and Rejection: A One-versus-all Framework
2023cites this paper
A Simple and Explainable Method for Uncertainty Estimation using Attribute Prototype Networks
2023cites this paper
Open Set Recognition of Radar Signals Based on Time-Frequency Fusion and OVA Network
2023cites this paper
Multiclass Alignment of Confidence and Certainty for Network Calibration
2023cites this paper
A universal transfer network for machinery fault diagnosis
2023cites this paper
Low-Resource Named Entity Recognition: Can One-vs-All AUC Maximization Help?
2023cites this paper
One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training
2022cites this paper
One Versus All for Deep Neural Network for Uncertainty (OVNNI) Quantification
2022cites this paper
Invariant representation driven neural classifier for anti-QCD jet tagging
2022cites this paper
UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs
2022cites this paper
Robust and Deployable Gesture Recognition for Smartwatches
2022cites this paper
A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration
2022cites this paper
Expanding Low-Density Latent Regions for Open-Set Object Detection
2022cites this paper
A Simple Approach to Improve Single-Model Deep Uncertainty via Distance-Awareness
2022cites this paper
Distributional Gaussian Processes Layers for Out-of-Distribution Detection
2022cites this paper
SLOVA: Uncertainty Estimation Using Single Label One-Vs-All Classifier
2022cites this paper
Harnessing Out-Of-Distribution Examples via Augmenting Content and Style
2022cites this paper
Towards adaptive unknown authentication for universal domain adaptation by classifier paradox
2022cites this paper
Latent Discriminant deterministic Uncertainty
2022cites this paper
CrossMatch: Cross-Classifier Consistency Regularization for Open-Set Single Domain Generalization
2022cites this paper
Uncertainty-Induced Transferability Representation for Source-Free Unsupervised Domain Adaptation
2022cites this paper
Towards Improving Calibration in Object Detection Under Domain Shift
2022cites this paper
Post-hoc estimators for learning to defer to an expert
2022cites this paper
Handling Label Uncertainty for Camera Incremental Person Re-Identification
2022cites this paper
Distributional Gaussian Process Layers for Outlier Detection in Image Segmentation
2021influential citation
OVANet: One-vs-All Network for Universal Domain Adaptation
2021cites this paper
OpenMatch: Open-set Consistency Regularization for Semi-supervised Learning with Outliers
2021cites this paper
Training on Test Data with Bayesian Adaptation for Covariate Shift
2021cites this paper
Energy-Based Open-World Uncertainty Modeling for Confidence Calibration
2021influential citation
Evaluation of Out-of-Distribution Detection Performance of Self-Supervised Learning in a Controllable Environment
2020cites this paper
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges
2020cites this paper
Encoding the Latent Posterior of Bayesian Neural Networks for Uncertainty Quantification
2020cites this paper
Scale Development: Illustrating the Potential of State-of-the-Art Natural Language Processing
year unknowninfluential citation