Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels

Published 2018 in Neural Information Processing Systems

ABSTRACT

Deep neural networks (DNNs) have achieved tremendous success in a variety of applications across many disciplines. Yet, their superior performance comes with the expensive cost of requiring correctly annotated large-scale datasets. Moreover, due to DNNs' rich capacity, errors in training labels can hamper performance. To combat this problem, mean absolute error (MAE) has recently been proposed as a noise-robust alternative to the commonly-used categorical cross entropy (CCE) loss. However, as we show in this paper, MAE can perform poorly with DNNs and challenging datasets. Here, we present a theoretically grounded set of noise-robust loss functions that can be seen as a generalization of MAE and CCE. Proposed loss functions can be readily applied with any existing DNN architecture and algorithm, while yielding good performance in a wide range of noisy label scenarios. We report results from experiments conducted with CIFAR-10, CIFAR-100 and FASHION-MNIST datasets and synthetically generated noisy labels.

PUBLICATION RECORD

Publication year
2018
Venue
Neural Information Processing Systems
Publication date
2018-05-20
Fields of study
Mathematics, Computer Science, Medicine
Identifiers
arXiv 1805.07836 PMID 39839708 PMCID PMC11747755
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Using Trusted Data to Train Deep Networks on Labels Corrupted by Severe Noise
2018cited by this paper
Masking: A New Perspective of Noisy Supervision
2018cited by this paper
Iterative Learning with Open-set Noisy Labels
2018influential reference
Joint Optimization Framework for Learning with Noisy Labels
2018cited by this paper
Learning to Reweight Examples for Robust Deep Learning
2018cited by this paper
Learning from Noisy Large-Scale Datasets with Minimal Supervision
2017cited by this paper
Learning From Noisy Singly-labeled Data
2017cited by this paper
Robust Loss Functions under Label Noise for Deep Neural Networks
2017cited by this paper
Toward Robustness against Label Noise in Training Deep Discriminative Neural Networks
2017cited by this paper
A Closer Look at Memorization in Deep Networks
2017cited by this paper
Learning from Noisy Labels with Distillation
2017cited by this paper
Learning with Confident Examples: Rank Pruning for Robust Classification with Noisy Labels
2017cited by this paper
MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels
2017cited by this paper
MentorNet: Regularizing Very Deep Neural Networks on Corrupted Labels
2017cited by this paper
Understanding deep learning requires rethinking generalization
2016cited by this paper
Deep Networks with Stochastic Depth
2016cited by this paper
Training deep neural-networks using a noise adaptation layer
2016cited by this paper
Learning Deep Networks from Noisy Labels with Dropout Regularization
2016cited by this paper
Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach
2016influential reference
Human-level control through deep reinforcement learning
2015cited by this paper
Auxiliary Image Regularization for Deep CNNs with Noisy Labels
2015cited by this paper
Learning with Symmetric Label Noise: The Importance of Being Unhinged
2015cited by this paper
Deep Residual Learning for Image Recognition
2015influential reference
Learning from massive noisy labeled data for image classification
2015cited by this paper
Learning Deconvolution Network for Semantic Segmentation
2015cited by this paper
Classification in the Presence of Label Noise: A Survey
2014cited by this paper
Training Convolutional Networks with Noisy Labels
2014cited by this paper
Making risk minimization tolerant to label noise
2014cited by this paper
Learning from Noisy Labels with Deep Neural Networks
2014cited by this paper
Training Deep Neural Networks on Noisy Labels with Bootstrapping
2014cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
Learning with Noisy Labels
2013cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Support Vector Machines with the Ramp Loss and the Hard Margin Loss
2011cited by this paper
Noise Tolerance Under Risk Minimization
2011cited by this paper
Self-Paced Learning for Latent Variable Models
2010cited by this paper
Maximum Lq-likelihood estimation.
2010cited by this paper
ImageNet: A large-scale hierarchical image database
2009cited by this paper
Learning Multiple Layers of Features from Tiny Images
2009cited by this paper
Visualizing Data using t-SNE
2008cited by this paper
On the Design of Loss Functions for Classification: theory, robustness to outliers, and SavageBoost
2008cited by this paper
Nonlinear Programming Theory and Algorithms
2007cited by this paper
An Analysis of Transformations
1964cited by this paper
Ieee Transactions on Pattern Analysis and Machine Intelligence Classification with Noisy Labels by Importance Reweighting
year unknowncited by this paper

CITED BY

Real-time roadworks detection and high definition (HD) map updates for autonomous vehicles
2026cites this paper
PreMulBVD: A pretraining-based multi-modal binary vulnerability detection framework
2026cites this paper
Correntropy meets cross-entropy: A robust loss against noisy labels
2026cites this paper
Combating Noisy Labels through Fostering Self- and Neighbor-Consistency
2026cites this paper
Identifying and Correcting Label Noise for Robust GNNs via Influence Contradiction
2026cites this paper
A novel enhanced neural network for anomaly detection in the IoT environment
2026cites this paper
Log-Polynomial Optimization
2026cites this paper
S2CaT: Spatial–Spectral-Based CNN and Transformer Network for Hyperspectral Image Classification
2026cites this paper
Toward Efficient Identification of Retinal Diseases: A Lightweight Convolutional Neural Network‐Based Approach Using Optical Coherence Tomography
2026cites this paper
Optimal Transport Filtering for Robust Cross-Modal Retrieval with Open-Set Noisy Labels
2026cites this paper
Robust domain adaptation using gram optimal transport for high variance environments
2026cites this paper
KRAM: Knowledge-driven robust training against label noise for medication recommendation
2026cites this paper
Robust colonoscopy polyp segmentation using dynamic-Nu T-Loss with multi-scale and uncertainty-aware adaptation
2026cites this paper
Wharfree-Net: A Hybrid and Interpretable System for Left Atrial-to-Aortic Ratio Estimation in Veterinary Echocardiography
2026cites this paper
Statistical and machine learning approaches integrated with CFD for effective thermal conductivity prediction of NEPCMs in natural convection between horizontal concentric cylinders
2026cites this paper
ISRLNN: A software defect prediction method based on instance similarity reverse loss
2026cites this paper
UAV-based quantitative crack measurement for bridges integrating four-point laser metric calibration and mamba segmentation
2026cites this paper
Generalizing Abstention for Noise-Robust Learning in Medical Image Segmentation
2026influential citation
Fine-grained Classification of A Million Life Trajectories from Wikipedia
2026cites this paper
Gradients Must Earn Their Influence: Unifying SFT with Generalized Entropic Objectives
2026cites this paper
Supervised contrastive learning-based adaptive multi-scale time–frequency network for motor imagery decoding
2026cites this paper
Unsupervised Semantic Segmentation in Synchrotron Computed Tomography with Self-Correcting Pseudo Labels
2026cites this paper
Image classification and recognition with mobile inverted bottleneck convolution fusion CBAM attention
2026cites this paper
Enhancing Fairness Without Demographic Labels via Identifying and Mitigating Potential Biases
2026cites this paper
Learning with less: A survey of deep learning in medical imaging under varying supervision levels
2026cites this paper
NCSAM Noise-Compensated Sharpness-Aware Minimization for Noisy Label Learning
2026cites this paper
基于自适应协同学习的多模态激光雷达语义分割
2026cites this paper
An Uncertainty-Aware Continual Learning Framework for Fault Diagnosis of Rotating Machinery With Homogeneous-Heterogeneous Faults
2026cites this paper
DCDLNet: A label-noise tolerant classification algorithm for polsar images based on dual-band consistency and difference
2026cites this paper
SABLM-VD: Vulnerability detection with a semantic-aware binary language model
2026cites this paper
LGPNet: A dual-branch parallel network for cervical cell image classification
2026cites this paper
Noisy-label-adaptive SegFormer parameter optimization for high-enthalpy flow PLIF image segmentation
2026cites this paper
Clip-based road-marking detection with LLM-guided driving prompts
2026cites this paper
Multiview Collaborative Learning to Handle Noisy Label for Building Extraction From Remote Sensing Image
2026cites this paper
A Sequential Framework for Multi Class Grading of Diabetic Retinopathy Using Nelder-Mead Optimisation Algorithm and Evaluation with Adjusted Focal Loss
2026cites this paper
Noise-Adaptive Regularization for Robust Multi-Label Remote Sensing Image Classification
2026cites this paper
DrowsyDG-Phys: Generalizable driver drowsiness estimation in conditional automated vehicles using physiological signals.
2026cites this paper
Deep Learning-Based Faulty Component Diagnosis of Transmission Channels in ATE Affected by Thermal Degradation
2026cites this paper
ReMOND: A reinforcement learning-based multimodal wearable framework for real-time monitoring of respiratory infectious diseases
2026cites this paper
The Label Horizon Paradox: Rethinking Supervision Targets in Financial Forecasting
2026cites this paper
Label-wise reliability-aware classifier for robust chest X-ray multi-label classification
2026cites this paper
HasLoss: a novel Hassanat distance-based loss functions for binary classification.
2026cites this paper
Deep learning for crime analytics: A prioritization system for user reports from safety apps
2026cites this paper
Multi-Scale Attention Unet: An Innovative Approach for Segmentation of Optic Disc and Optic Cup in Early Detection of Retinopathy
2026cites this paper
Towards Multimodal Domain Generalization with Few Labels
2026cites this paper
Advances In Adaptive Machine Learning Algorithms for Enhanced Security In IoT Networks
2026cites this paper
From Calibration to Refinement: Seeking Certainty via Probabilistic Evidence Propagation for Noisy-Label Person Re-Identification
2026cites this paper
FUMESNet: Exploring Frequency-Based Transformer and Improving Skip Connection for Hyperspectral Methane Plume Segmentation
2026cites this paper
RA-Nav: A Risk-Aware Navigation System Based on Semantic Segmentation for Aerial Robots in Unpredictable Environments
2026cites this paper
Healthcare applications of 0-1 neural networks in prescriptive problems with observational data
2026cites this paper
Federated Learning with Dynamics-aware Loss for Label Noise
2026cites this paper
Reliable Mislabel Detection for Video Capsule Endoscopy Data
2026cites this paper
Hierarchical vision-language model with comprehensive language description for video anomaly detection
2026cites this paper
An Empirical Study on Noisy Data and LLM Pretraining Loss Divergence
2026cites this paper
PRISM: Personalized Recommendation via Information Synergy Module
2026cites this paper
TA-TransUNet: An Improved Deep Learning Network Model for Water Body Extraction From Remote Sensing Images
2026cites this paper
Class-Aware Multi-Granularity Co-Diffusion Models for Learning With Noisy Labels on Imbalanced Datasets
2026cites this paper
Robust learning under label noise via logit-based filtering and ranking-aware relabeling
2026cites this paper
SG-DGLF: A similarity-guided dual-graph learning framework
2026cites this paper
HR-MM Segformer: Enhancing land use and land cover semantic segmentation through transformer-based multisource remote sensing feature fusion
2026cites this paper
Enhancing multimodal emotion recognition with dynamic fuzzy membership and attention fusion
2026cites this paper
Mapping seismic risk of existing highway bridges at a regional scale using Artificial Neural Networks
2026cites this paper
Correcting Noisy Multilabel Predictions: Modeling Label Noise through Latent Space Shifts
2025cites this paper
Exploiting rank-based filter pruning for real-time UAV tracking
2025cites this paper
A survey on learning with noisy labels in Natural Language Processing: How to train models with label noise
2025cites this paper
FAST: A pioneering unlearning framework integrating fine-tuning, adverse training, and student–teacher methods
2025cites this paper
Domain-adaptive matching bridges synthetic and in vivo neural dynamics for neural circuit connectivity inference
2025cites this paper
SyNet: A Synergistic Network for 3D Object Detection Through Geometric-Semantic-Based Multi-Interaction Fusion
2025cites this paper
CAS-TJ: Channel attention shuffle and temporal jigsaw for audio classification
2025cites this paper
Corrupted but Not Broken: Understanding and Mitigating the Negative Impacts of Corrupted Data in Visual Instruction Tuning
2025cites this paper
From Isolates to Families: Using Neural Networks for Automated Language Affiliation
2025cites this paper
FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control
2025cites this paper
NoRD: A framework for noise-resilient self-distillation through relative supervision
2025cites this paper
LLM-Powered Preference Elicitation in Combinatorial Assignment
2025cites this paper
Deep Learning for Wound Tissue Segmentation: A Comprehensive Evaluation using A Novel Dataset
2025cites this paper
Meta-Data-Guided Robust Deep Neural Network Classification with Noisy Label
2025cites this paper
An ensemble deep learning framework for multi-class LncRNA subcellular localization with innovative encoding strategy
2025cites this paper
Enhancing Sample Selection Against Label Noise by Cutting Mislabeled Easy Examples
2025cites this paper
Coordinating Communication and Computing for Wireless VR in Open Radio Access Networks
2025cites this paper
Diffusing DeBias: a Recipe for Turning a Bug into a Feature
2025cites this paper
Multi-level and multi-scale cross attention network of wavelet packet transform for supersonic inlet unstart prediction
2025cites this paper
A hybrid CNN-LSTM model for involuntary fall detection using wrist-worn sensors
2025cites this paper
CASC-AI: Consensus-aware Self-corrective Learning for Noise Cell Segmentation
2025influential citation
Enhancing Visual Reasoning With LLM-Powered Knowledge Graphs for Visual Question Localized-Answering in Robotic Surgery
2025cites this paper
Sandbox: safeguarded multi-label learning through safe optimal transport
2025cites this paper
A deep learning-based parametric inversion for forecasting water-filled bodies position using electromagnetic method
2025cites this paper
Wildfire fuels mapping through artificial intelligence-based methods: A review
2025cites this paper
Sample Selection via Contrastive Fragmentation for Noisy Label Regression
2025cites this paper
Intent-Bert and Universal Context Encoders: A Framework for Workload and Sensor Agnostic Human Intention Prediction
2025cites this paper
Breaking the Blindfold: Deep Learning-based Blind Side-channel Analysis
2025cites this paper
Point-annotation supervision for robust 3D pulmonary infection segmentation by CT-based cascading deep learning
2025cites this paper
A Comparison of Spherical Neural Networks for Surround-View Fisheye Image Semantic Segmentation
2025cites this paper
Self-Correlation Network With Triple Contrastive Learning for Hyperspectral Image Classification With Noisy Labels
2025cites this paper
Unraveling the Mysteries of Label Noise in Source-Free Domain Adaptation: Theory and Practice
2025influential citation
Preference VLM: Leveraging VLMs for Scalable Preference-Based Reinforcement Learning
2025cites this paper
A Match Made in Heaven? AI-driven Matching of Vulnerabilities and Security Unit Tests
2025cites this paper
Enhancing whole slide image classification through label denoising in a multi-instance learning framework
2025cites this paper
Extended Invariant Risk Minimization for Machine Fault Diagnosis With Label Noise and Data Shift
2025cites this paper
BAN: A Universal Paradigm for Cross-Scene Classification Under Noisy Annotations From RGB and Hyperspectral Remote Sensing Images
2025cites this paper
Evaluation of Deep Learning Techniques in PV Farm Cyber Attacks Detection
2025cites this paper