Optimized Gradient Clipping for Noisy Label Learning

Xichen Ye,Yifan Wu,Weizhong Zhang,Xiaoqiang Li,Yifan Chen,Cheng Jin

Published 2024 in AAAI Conference on Artificial Intelligence

ABSTRACT

Previous research has shown that constraining the gradient of loss function w.r.t. model-predicted probabilities can enhance the model robustness against noisy labels. These methods typically specify a fixed optimal threshold for gradient clipping through validation data to obtain the desired robustness against noise. However, this common practice overlooks the dynamic distribution of gradients from both clean and noisy-labeled samples at different stages of training, significantly limiting the model capability to adapt to the variable nature of gradients throughout the training process. To address this issue, we propose a simple yet effective approach called Optimized Gradient Clipping (OGC), which dynamically adjusts the clipping threshold based on the ratio of noise gradients to clean gradients after clipping, estimated by modeling the distributions of clean and noisy samples. This approach allows us to modify the clipping threshold at each training step, effectively controlling the influence of noise gradients. Additionally, we provide statistical analysis to certify the noise-tolerance ability of OGC. Our extensive experiments across various types of label noise, including symmetric, asymmetric, instance-dependent, and real-world noise, demonstrate the effectiveness of our approach.

PUBLICATION RECORD

Publication year
2024
Venue
AAAI Conference on Artificial Intelligence
Publication date
2024-12-12
Fields of study
Computer Science
Identifiers
DOI 10.48550/arXiv.2412.08941 arXiv 2412.08941
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Mitigating Memorization of Noisy Labels by Clipping the Model Prediction
2022cited by this paper
Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations
2021cited by this paper
Towards Understanding Deep Learning from Noisy Labels with Small-Loss Criterion
2021cited by this paper
Asymmetric Loss Functions for Learning with Noisy Labels
2021cited by this paper
Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels
2021cited by this paper
8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020
2020influential reference
Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization
2020cited by this paper
Part-dependent Label Noise: Towards Instance-dependent Label Noise
2020cited by this paper
Does label smoothing mitigate label noise?
2020cited by this paper
DivideMix: Learning with Noisy Labels as Semi-supervised Learning
2020cited by this paper
Can Cross Entropy Loss Be Robust to Label Noise?
2020cited by this paper
Early-Learning Regularization Prevents Memorization of Noisy Labels
2020cited by this paper
Normalized Loss Functions for Deep Learning with Noisy Labels
2020cited by this paper
SELF: Learning to Filter Noisy Labels with Self-Ensembling
2019cited by this paper
How does Disagreement Help Generalization against Label Corruption?
2019cited by this paper
Symmetric Cross Entropy for Robust Learning With Noisy Labels
2019cited by this paper
Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity
2019cited by this paper
Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels
2018influential reference
Co-teaching: Robust training of deep neural networks with extremely noisy labels
2018cited by this paper
WebVision Database: Visual Learning and Understanding from Web Data
2017influential reference
MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
2017cited by this paper
Focal Loss for Dense Object Detection
2017cited by this paper
Robust Loss Functions under Label Noise for Deep Neural Networks
2017influential reference
mixup: Beyond Empirical Risk Minimization
2017cited by this paper
Understanding deep learning requires rethinking generalization
2016cited by this paper
Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach
2016cited by this paper
Deep Residual Learning for Image Recognition
2015cited by this paper
Learning with Noisy Labels
2013cited by this paper
On the difficulty of training recurrent neural networks
2012cited by this paper
Learning Multiple Layers of Features from Tiny Images
2009cited by this paper
ImageNet: A large-scale hierarchical image database
2009influential reference
Annual Conference
2006cited by this paper

CITED BY

Noise-aware weight updating in AdaBoost for handling mislabeled data
2026cites this paper
Reconstruction of Three-Dimensional Temperature and Salinity in the Equatorial Ocean with Deep-Learning
2025cites this paper
Noise-free prototype guided representation calibration under label noise
2025cites this paper
Robust Learning of Diffusion Models with Extremely Noisy Conditions
2025cites this paper
Active Negative Loss: A Robust Framework for Learning with Noisy Labels
2024cites this paper
An Integrated YOLO and Conﬁdent Learning Model for Enhancing Object Detection in RoboCon
year unknowncites this paper