Learning From Noisy Singly-labeled Data

A. Khetan,Zachary Chase Lipton,Anima Anandkumar

Published 2017 in International Conference on Learning Representations

ABSTRACT

Supervised learning depends on annotated examples, which are taken to be the \emph{ground truth}. But these labels often come from noisy crowdsourcing platforms, like Amazon Mechanical Turk. Practitioners typically collect multiple labels per example and aggregate the results to mitigate noise (the classic crowdsourcing problem). Given a fixed annotation budget and unlimited unlabeled data, redundant annotation comes at the expense of fewer labeled examples. This raises two fundamental questions: (1) How can we best learn from noisy workers? (2) How should we allocate our labeling budget to maximize the performance of a classifier? We propose a new algorithm for jointly modeling labels and worker quality from noisy crowd-sourced data. The alternating minimization proceeds in rounds, estimating worker quality from disagreement with the current model and then updating the model by optimizing a loss function that accounts for the current estimate of worker quality. Unlike previous approaches, even with only one annotation per example, our algorithm can estimate worker quality. We establish a generalization error bound for models learned with our algorithm and establish theoretically that it's better to label many examples once (vs less multiply) when worker quality is above a threshold. Experiments conducted on both ImageNet (with simulated noisy workers) and MS-COCO (using the real crowdsourced labels) confirm our algorithm's benefits.

PUBLICATION RECORD

Publication year
2017
Venue
International Conference on Learning Representations
Publication date
2017-12-13
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1712.04577
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Who Said What: Modeling Individual Labelers Improves Classification
2017cited by this paper
Lean Crowdsourcing: Combining Humans and Machines in an Online System
2017cited by this paper
Optimal Testing for Crowd Workers
2016cited by this paper
Data Programming: Creating Large Training Sets, Quickly
2016cited by this paper
Achieving budget-optimality with adaptive schemes in crowdsourcing
2016cited by this paper
Re-Active Learning: Active Learning with Relabeling
2016cited by this paper
Learning Deep Networks from Noisy Labels with Dropout Regularization
2016cited by this paper
Regularized Minimax Conditional Entropy for Crowdsourcing
2015cited by this paper
Learning Visual Features from Large Weakly Supervised Data
2015cited by this paper
The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition
2015cited by this paper
Training Convolutional Networks with Noisy Labels
2014cited by this paper
Microsoft COCO: Common Objects in Context
2014cited by this paper
Spectral Methods Meet EM: A Provably Optimal Algorithm for Crowdsourcing
2014cited by this paper
To Re(label), or Not To Re(label)
2014cited by this paper
Repeated labeling using multiple noisy labelers
2013cited by this paper
Fine-Grained Crowdsourcing for Fine-Grained Recognition
2013cited by this paper
Aggregating crowdsourced binary ratings
2013cited by this paper
Learning with Noisy Labels
2013influential reference
Learning from the Wisdom of Crowds by Minimax Entropy
2012cited by this paper
Variational Inference for Crowdsourcing
2012cited by this paper
Budget-Optimal Task Allocation for Reliable Crowdsourcing Systems
2011cited by this paper
Multiclass recognition and part localization with humans in the loop
2011cited by this paper
The Multidimensional Wisdom of Crowds
2010influential reference
crowdsourcing : rating annotators and obtaining cost-effective labels
2010cited by this paper
Visual Recognition with Humans in the Loop
2010cited by this paper
ImageNet: A large-scale hierarchical image database
2009influential reference
Whose Vote Should Count More: Optimal Integration of Labels from Labelers of Unknown Expertise
2009influential reference
Learning Multiple Layers of Features from Tiny Images
2009cited by this paper
Get another label? improving data quality and data mining using multiple, noisy labelers
2008cited by this paper
Learning with Multiple Labels
2002cited by this paper
Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm
1979influential reference

CITED BY

Meta-learning representations for learning from multiple annotators
2026cites this paper
Whose truth is it anyway? An experiment on annotation bias in times of factual opinion polarization
2025influential citation
Benefits of Online Tilted Empirical Risk Minimization: A Case Study of Outlier Detection and Robust Regression
2025cites this paper
Wisdom of the Crowd, Without the Crowd: A Socratic LLM for Asynchronous Deliberation on Perspectivist Data
2025cites this paper
Sanitizing Manufacturing Dataset Labels Using Vision-Language Models
2025cites this paper
Query Design for Crowdsourced Clustering: Effect of Cognitive Overload and Contextual Bias
2025cites this paper
crowd-hpo: Realistic Hyperparameter Optimization and Benchmarking for Learning from Crowds with Noisy Labels
2025cites this paper
A survey on learning from data with label noise via deep neural networks
2025cites this paper
Realistic Evaluation of Deep Partial-Label Learning Algorithms
2025cites this paper
FedTilt: Towards Multi-Level Fairness-Preserving and Robust Federated Learning
2025cites this paper
Theoretical Analysis of Weak-to-Strong Generalization
2024cites this paper
Learning from Noisy Labels via Conditional Distributionally Robust Optimization
2024influential citation
Retrieval-Augmented Generation with Estimation of Source Reliability
2024cites this paper
Fine-tuning Vision Classifiers On A Budget
2024cites this paper
Intelligent Maritime Radar Target Detection With Partial Annotation via Progressive Learning
2024cites this paper
DualPSNet: A Discrepant-Annotations-Driven Framework for Medical Image Segmentation from Dual-Polygon Supervision
2024cites this paper
FedTrans: Client-Transparent Utility Estimation for Robust Federated Learning
2024cites this paper
dopanim: A Dataset of Doppelganger Animals with Noisy Annotations from Multiple Humans
2024cites this paper
Efficiently Training Neural Networks for Imperfect Information Games by Sampling Information Sets
2024cites this paper
Annot-Mix: Learning with Noisy Class Labels from Multiple Annotators via a Mixup Extension
2024cites this paper
MAGI: Multi-Annotated Explanation-Guided Learning
2023cites this paper
Learning to complement with multiple humans
2023cites this paper
Label Robust and Differentially Private Linear Regression: Computational and Statistical Efficiency
2023cites this paper
Quality aspects of annotated data
2023cites this paper
Learning to Complement with Multiple Humans (LECOMH): Integrating Multi-rater and Noisy-Label Learning into Human-AI Collaboration
2023cites this paper
Stochastic co-teaching for training neural networks with unknown levels of label noise
2023cites this paper
Drawing the Same Bounding Box Twice? Coping Noisy Annotations in Object Detection with Repeated Labels
2023cites this paper
Leveraging Inter-Rater Agreement for Classification in the Presence of Noisy Labels
2023cites this paper
Learning from Crowds with Annotation Reliability
2023cites this paper
STARS: Spatial-Temporal Active Re-sampling for Label-Efficient Learning from Noisy Annotations
2023cites this paper
Enhancing Noise-Robust Losses for Large-Scale Noisy Data Learning
2023cites this paper
Which Examples Should be Multiply Annotated? Active Learning When Annotators May Disagree
2023cites this paper
Transferring Annotator- and Instance-Dependent Transition Matrix for Learning From Crowds
2023influential citation
Deep Learning From Crowdsourced Labels: Coupled Cross-entropy Minimization, Identifiability, and Regularization
2023influential citation
Multi-annotator Deep Learning: A Probabilistic Framework for Classification
2023cites this paper
Learning from multiple annotators for medical image segmentation
2023cites this paper
Near Optimal Private and Robust Linear Regression
2023cites this paper
Learning from Crowds with Mutual Correction-Based Co-Training
2022cites this paper
Enhancing Robust Text Classification via Category Description
2022cites this paper
Improved Input Reprogramming for GAN Conditioning
2022cites this paper
CROWDLAB: Supervised learning to infer consensus labels and quality scores for data with multiple annotators
2022cites this paper
Sources of Noise in Dialogue and How to Deal with Them
2022cites this paper
Learning from Noisy Pairwise Similarity and Unlabeled Data
2022cites this paper
Utilizing supervised models to infer consensus labels and their quality from data with multiple annotators
2022cites this paper
A feasibility study to assess Mediterranean Diet adherence using an AI-powered system
2022cites this paper
Bayesian Weak Supervision via an Optimal Transport Approach
2022cites this paper
Trustable Co-Label Learning From Multiple Noisy Annotators
2022cites this paper
Learning from Imbalanced Crowdsourced Labeled Data
2022influential citation
Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal
2022cites this paper
Learning from Multiple Annotator Noisy Labels via Sample-wise Label Fusion
2022influential citation
Beyond confusion matrix: learning from multiple annotators with awareness of instance features
2022influential citation
HAEM: Obtaining Higher-Quality Classification Task Results with AI Workers
2022influential citation
Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees
2022cites this paper
Diminishing Empirical Risk Minimization for Unsupervised Anomaly Detection
2022cites this paper
The Fault in Our Data Stars: Studying Mitigation Techniques against Faulty Training Data in Machine Learning Applications
2022cites this paper
Label Augmentation with Reinforced Labeling for Weak Supervision
2022cites this paper
Shoring Up the Foundations: Fusing Model Embeddings and Weak Supervision
2022cites this paper
LABNET: A Collaborative Method for DNN Training and Label Aggregation
2022cites this paper
A Survey on Programmatic Weak Supervision
2022cites this paper
Data Consistency for Weakly Supervised Learning
2022cites this paper
Active label cleaning for improved dataset quality under resource constraints
2021cites this paper
Learning from Subjective Ratings Using Auto-Decoded Deep Latent Embeddings
2021cites this paper
On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances, and Million-AID
2021cites this paper
A General Framework for Adversarial Label Learning
2021cites this paper
Few-Shot Upsampling for Protest Size Detection
2021cites this paper
Generation and Analysis of Feature-Dependent Pseudo Noise for Training Deep Neural Networks
2021cites this paper
Lan: Learning to Augment Noise Tolerance for Self-report Survey Labels
2021cites this paper
Learning from Multiple Annotators by Incorporating Instance Features
2021cites this paper
End-to-End Weak Supervision
2021cites this paper
Label noise in segmentation networks : mitigation must deal with bias
2021cites this paper
kNet: A Deep kNN Network To Handle Label Noise
2021cites this paper
Noise label learning through label confidence statistical inference
2021cites this paper
Deep neural learning on weighted datasets utilizing label disagreement from crowdsourcing
2021cites this paper
A Realistic Simulation Framework for Learning with Label Noise
2021cites this paper
Towards Robust Object Detection: Bayesian RetinaNet for Homoscedastic Aleatoric Uncertainty Modeling
2021cites this paper
Active label cleaning: Improving dataset quality under resource constraints
2021cites this paper
An instance-dependent simulation framework for learning with label noise
2021cites this paper
Creating Training Sets via Weak Indirect Supervision
2021cites this paper
Learning with Noisy Labels by Targeted Relabeling
2021cites this paper
A Fine-Grained Analysis on Distribution Shift
2021cites this paper
A Machine Learning Framework Towards Transparency in Experts' Decision Quality
2021influential citation
Fraud Detection under Multi-Sourced Extremely Noisy Annotations
2021cites this paper
End-to-End Annotator Bias Approximation on Crowdsourced Single-Label Sentiment Analysis
2021cites this paper
Detecting Atrial Fibrillation in ICU Telemetry data with Weak Labels
2021cites this paper
Instance-adaptive training with noise-robust losses against noisy labels
2021cites this paper
A Worker-Task Specialization Model for Crowdsourcing: Efficient Inference and Fundamental Limits
2021cites this paper
Estimating Formulas for Model Performance Under Noisy Labels Using Symbolic Regression
2021cites this paper
Examining Effect of Label Redundancy for Machine Learning using Crowdsourcing
2021cites this paper
Bayesian Regression from Multiple Sources of Weak Supervision
2021cites this paper
Clean or Annotate: How to Spend a Limited Data Collection Budget
2021cites this paper
Learning Collections of Functions
2021cites this paper
FLEA: Provably Robust Fair Multisource Learning from Unreliable Training Data
2021cites this paper
Multi-Class Classification from Noisy-Similarity-Labeled Data
2020cites this paper
On Robustness of Neural Architecture Search Under Label Noise
2020cites this paper
TOWARDS COUNTERING HATE SPEECH AGAINST JOURNALISTS ON SOCIAL MEDIA
2020cites this paper
The Resistance to Label Noise in K-NN and DNN Depends on its Concentration
2020cites this paper
Harnessing Side Information for Classification Under Label Noise
2020cites this paper
Collecting Entailment Data for Pretraining: New Protocols and Negative Results
2020cites this paper
A Non-intrusive Correction Algorithm for Classification Problems with Corrupted Data
2020cites this paper
Ontology-driven weak supervision for clinical entity classification in electronic health records
2020cites this paper