Reconciling Feature-Reuse and Overfitting in DenseNet with Specialized Dropout

Kun Wan,Boyuan Feng,Lingwei Xie,Yufei Ding

Published 2018 in IEEE International Conference on Tools with Artificial Intelligence

ABSTRACT

Recently convolutional neural networks (CNNs) achieve great accuracy in visual recognition tasks. DenseNets become one of the most popular CNN models due to its effectiveness in the feature-reuse. However, like other CNN models, DenseNets also face the overfitting problem if not more severe. Existing dropout methods can be applied but not effective. In particular, the property of the feature-reuse in DenseNets will be impeded, and the dropout effect will be weakened by the spatial correlation inside feature maps. To address these problems, we craft the design of a specialized dropout method from three aspects, the dropout location, the dropout granularity, and the dropout probability. The insights attained here could potentially be applied as a general approach for boosting the accuracy of other CNN models with similar shortcut connections. Experimental results show that DenseNets with our specialized dropout method yield better accuracies compared to vanilla DenseNets and state-of-the-art CNN models, and such accuracy boost increases with the model depth.

PUBLICATION RECORD

Publication year
2018
Venue
IEEE International Conference on Tools with Artificial Intelligence
Publication date
2018-09-28
Fields of study
Computer Science
Identifiers
DOI 10.1109/ICTAI.2019.00110 arXiv 1810.00091
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Multi-Scale Dense Convolutional Networks for Efficient Prediction
2017cited by this paper
Multi-Scale Dense Networks for Resource Efficient Image Classification
2017cited by this paper
Residual Networks of Residual Networks: Multilevel Residual Networks
2016cited by this paper
Swapout: Learning an ensemble of deep architectures
2016cited by this paper
Identity Mappings in Deep Residual Networks
2016cited by this paper
Deep Networks with Stochastic Depth
2016cited by this paper
Densely Connected Convolutional Networks
2016cited by this paper
Wide Residual Networks
2016cited by this paper
Deep Residual Learning for Image Recognition
2015influential reference
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
2015cited by this paper
Training Very Deep Networks
2015cited by this paper
Dropout: a simple way to prevent neural networks from overfitting
2014influential reference
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014cited by this paper
Deeply-Supervised Nets
2014cited by this paper
FitNets: Hints for Thin Deep Nets
2014cited by this paper
Going deeper with convolutions
2014cited by this paper
Efficient object localization using Convolutional Networks
2014cited by this paper
Regularization of Neural Networks using DropConnect
2013cited by this paper
Improving neural networks by preventing co-adaptation of feature detectors
2012cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Deep Sparse Rectifier Neural Networks
2011cited by this paper
Understanding the difficulty of training deep feedforward neural networks
2010cited by this paper
On the power of small-depth threshold circuits
1990cited by this paper
Computational limitations of small-depth circuits
1987cited by this paper

CITED BY

Balling levels detection in laser powder bed fusion using nonlinear genetic algorithm optimized convolutional neural network
2025cites this paper
Multi-label Image Classification for Ocular Disease Diagnosis Using K-fold Cross-Validation on the ODIR-5K Dataset
2024cites this paper
Autonomous Vision-Guided Two-Arm Collaborative Microassembly Using Learned Manipulation Model
2024cites this paper
Artificial intelligence and dental age estimation: development and validation of an automated stage allocation technique on all mandibular tooth types in panoramic radiographs
2024cites this paper
Semi-supervised lung adenocarcinoma histopathology image classification based on multi-teacher knowledge distillation
2024cites this paper
A Deep Learning-based Grasp Pose Estimation Approach for Large-Size Deformable Objects in Clutter
2024cites this paper
Enhancing signal-to-noise ratio in real-time LED-based photoacoustic imaging: A comparative study of CNN-based deep learning architectures
2024cites this paper
Automatic hip osteoarthritis grading with uncertainty estimation from computed tomography using digitally-reconstructed radiographs
2023cites this paper
Eye Know You Too: A DenseNet Architecture for End-to-end Eye Movement Biometrics
2022cites this paper
Eye Know You Too: Toward Viable End-to-End Eye Movement Biometrics for User Authentication
2022cites this paper
Sparsely Connected DenseNet for Malaria Parasite Detection
2021cites this paper
Detecting abnormalities in X-Ray images using Neural Networks
2020influential citation