Deep Individual Active Learning: Safeguarding against Out-of-Distribution Challenges in Neural Networks

Published 2024 in Entropy

ABSTRACT

Active learning (AL) is a paradigm focused on purposefully selecting training data to enhance a model’s performance by minimizing the need for annotated samples. Typically, strategies assume that the training pool shares the same distribution as the test set, which is not always valid in privacy-sensitive applications where annotating user data is challenging. In this study, we operate within an individual setting and leverage an active learning criterion which selects data points for labeling based on minimizing the min-max regret on a small unlabeled test set sample. Our key contribution lies in the development of an efficient algorithm, addressing the challenging computational complexity associated with approximating this criterion for neural networks. Notably, our results show that, especially in the presence of out-of-distribution data, the proposed algorithm substantially reduces the required training set size by up to 15.4%, 11%, and 35.1% for CIFAR10, EMNIST, and MNIST datasets, respectively.

PUBLICATION RECORD

Publication year
2024
Venue
Entropy
Publication date
2024-01-31
Fields of study
Medicine, Computer Science
Identifiers
DOI 10.3390/e26020129 PMID 38392384 PMCID 10887833
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Prediction-Oriented Bayesian Active Learning
2023cited by this paper
Active Learning for Individual Data via Minimal Stochastic Complexity
2022cited by this paper
DeepAL: Deep Active Learning in Python
2021cited by this paper
Contrastive Coding for Active Learning under Class Distribution Mismatch
2021cited by this paper
TLDR: Deep Learning-Based Automated Privacy Policy Annotation with Key Policy Highlights
2021cited by this paper
Offline Model-Based Optimization via Normalized Maximum Likelihood Estimation
2021cited by this paper
SIMILAR: Submodular Information Measures Based Active Learning In Realistic Scenarios
2021cited by this paper
Laplace Redux - Effortless Bayesian Deep Learning
2021cited by this paper
Universal Active Learning via Conditional Mutual Information Minimization
2021cited by this paper
Distribution Free Uncertainty for the Minimum Norm Solution of Over-parameterized Linear Regression
2021cited by this paper
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering
2021cited by this paper
The Predictive Normalized Maximum Likelihood for Over-parameterized Linear Regression with Norm Constraint: Regret and Double Descent
2021cited by this paper
Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision
2020cited by this paper
Amortized Conditional Normalized Maximum Likelihood
2020cited by this paper
Learning, compression, and leakage: Minimising classification error via meta-universal compression principles
2020cited by this paper
A Survey of Deep Active Learning
2020cited by this paper
A Simple Framework for Contrastive Learning of Visual Representations
2020cited by this paper
A New Look at an Old Problem: A Universal Learning Approach to Linear Regression
2019cited by this paper
A Simple Baseline for Bayesian Uncertainty in Deep Learning
2019cited by this paper
Variational Adversarial Active Learning
2019cited by this paper
Deep pNML: Predictive Normalized Maximum Likelihood for Deep Neural Networks
2019cited by this paper
Learning Loss for Active Learning
2019cited by this paper
BatchBALD: Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning
2019cited by this paper
Learning the Difference that Makes a Difference with Counterfactually-Augmented Data
2019cited by this paper
Adversarial NLI: A New Benchmark for Natural Language Understanding
2019cited by this paper
Minimax Active Learning Via Minimal Model Capacity
2019cited by this paper
MaxiMin Active Learning in Overparameterized Model Classes
2019cited by this paper
Universal Batch Learning with Log-Loss
2018influential reference
Active Learning for Convolutional Neural Networks: A Core-Set Approach
2017cited by this paper
Cost-Effective Active Learning for Deep Image Classification
2017cited by this paper
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
2017cited by this paper
Advances in Variational Inference
2017cited by this paper
EMNIST: Extending MNIST to handwritten letters
2017influential reference
Deep Bayesian Active Learning with Image Data
2017cited by this paper
Stochastic Variational Deep Kernel Learning
2016cited by this paper
Variational Inference: A Review for Statisticians
2016cited by this paper
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
2015influential reference
An Almost Optimal PAC Algorithm
2015cited by this paper
Stochastic variational inference
2012cited by this paper
The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web]
2012influential reference
Convolutional neural networks applied to house numbers digit classification
2012cited by this paper
Bayesian Active Learning for Classification and Preference Learning
2011cited by this paper
Information-Based Complexity, Feedback and Dynamics in Convex Programming
2010cited by this paper
ON SEQUENTIALLY NORMALIZED MAXIMUM LIKELIHOOD MODELS
2008cited by this paper
Conditional NML Universal Models
2007cited by this paper
Universal Prediction
1998influential reference
Information-Based Objective Functions for Active Data Selection
1992cited by this paper
Experiments
1986cited by this paper

CITED BY

Out-of-Distribution Learning with Human Feedback
2024cites this paper