Visualizing and Understanding Convolutional Networks

Published 2013 in European Conference on Computer Vision

ABSTRACT

Large Convolutional Network models have recently demonstrated impressive classification performance on the ImageNet benchmark Krizhevsky et al. [18]. However there is no clear understanding of why they perform so well, or how they might be improved. In this paper we explore both issues. We introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of the classifier. Used in a diagnostic role, these visualizations allow us to find model architectures that outperform Krizhevsky et al on the ImageNet classification benchmark. We also perform an ablation study to discover the performance contribution from different model layers. We show our ImageNet model generalizes well to other datasets: when the softmax classifier is retrained, it convincingly beats the current state-of-the-art results on Caltech-101 and Caltech-256 datasets.

PUBLICATION RECORD

Publication year
2013
Venue
European Conference on Computer Vision
Publication date
2013-11-12
Fields of study
Computer Science
Identifiers
DOI 10.1007/978-3-319-10590-1_53 arXiv 1311.2901
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks
2014cited by this paper
DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition
2013cited by this paper
Multipath Sparse Coding Using Hierarchical Matching Pursuit
2013influential reference
Improving Histograms of Oriented Gradients for Pedestrian Detection
2013cited by this paper
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
2013cited by this paper
Some Improvements on Deep Convolutional Neural Network Based Image Classification
2013cited by this paper
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
2013cited by this paper
Improving neural networks by preventing co-adaptation of feature detectors
2012influential reference
Multi-column deep neural networks for image classification
2012cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Multi-column deep neural network for traffic sign classification
2012cited by this paper
Adaptive deconvolutional networks for mid and high level feature learning
2011influential reference
Efficient learning of sparse, distributed, convolutional feature representations for object recognition
2011cited by this paper
Unbiased look at dataset bias
2011cited by this paper
Tiled convolutional neural networks
2010cited by this paper
Linear spatial pyramid matching using sparse coding for image classification
2009cited by this paper
Visualizing Higher-Layer Features of a Deep Network
2009cited by this paper
Co-occurrence Histograms of Oriented Gradients for Pedestrian Detection
2009cited by this paper
ImageNet: A large-scale hierarchical image database
2009cited by this paper
What is the best multi-stage architecture for object recognition?
2009cited by this paper
Extracting and composing robust features with denoising autoencoders
2008cited by this paper
Greedy Layer-Wise Training of Deep Networks
2006cited by this paper
On the Analysis and Interpretation of Inhomogeneous Quadratic Forms as Receptive Fields
2006cited by this paper
A Fast Learning Algorithm for Deep Belief Nets
2006cited by this paper
One-shot learning of object categories
2006influential reference
Backpropagation Applied to Handwritten Zip Code Recognition
1989influential reference

CITED BY

HP-GAN: Harnessing pretrained networks for GAN improvement with FakeTwins and discriminator consistency.
2026cites this paper
Ultrafast laser filamentation classification and analysis via neural networks
2026cites this paper
Convolutional neural networks and volcano plots for screening and predicting two-dimensional single-atom catalysts in CO2 reduction
2026cites this paper
Parameter efficient vs full fine-tuning for building children’s myopia prediction models
2026cites this paper
GSS: Gated Subspace Steering for Selective Memorization Mitigation in LLMs
2026cites this paper
A packer identification method based on section-entropy plot
2026cites this paper
DA-GAN: Dual-Attention GAN for Underwater Image Enhancement With Contrast and Color Correction
2026cites this paper
MM-Net: Facial Expression Recognition Based on Multi-level and Multi-scale Attention Mechanisms
2026cites this paper
TUL-IB: Enhancing Explainability in Trajectory User Linking with Information Bottleneck
2026cites this paper
Data poisoning-based backdoor attacks against supervised learning rules of Spiking Neural Networks
2026cites this paper
Learning a Generative Meta-Model of LLM Activations
2026cites this paper
Exploring SAIG Methods for an Objective Evaluation of XAI
2026cites this paper
Kill it with FIRE: On Leveraging Latent Space Directions for Runtime Backdoor Mitigation in Deep Neural Networks
2026cites this paper
Reproducing DragDiffusion: Interactive Point-Based Editing with Diffusion Models
2026cites this paper
Modularity is the Bedrock of Natural and Artificial Intelligence
2026influential citation
RyNet: Multi-Level Attention for Rich and Complementary Feature Representation in CNNs
2026cites this paper
Learning to Reason: Temporal Saliency Distillation for Interpretable Knowledge Transfer
2026cites this paper
Deep Learning-Based Skin Care Detection with Multi-method Explainability: Grad-CAM, Lime, and Occlusion Sensitivity
2026cites this paper
Enhancing deep learning interpretability for hand-crafted feature-guided histologic image classification via weak-to-strong generalization
2026cites this paper
ProToken: Token-Level Attribution for Federated Large Language Models
2026cites this paper
Similarity of Processing Steps in Vision Model Representations
2026cites this paper
Explainable AI-Driven Quality and Condition Monitoring in Smart Manufacturing
2026cites this paper
Spatial-frequency domain-aware network for hyperspectral multiclass change detection with subpixel guidance
2026cites this paper
Facial expression recognition for emotional state identification using deep convolutional neural network
2026cites this paper
Halt the Hallucination: Decoupling Signal and Semantic OOD Detection Based on Cascaded Early Rejection
2026cites this paper
MUFASA: A Multi-Layer Framework for Slot Attention
2026cites this paper
Feature salience -- not task-informativeness -- drives machine learning model explanations
2026influential citation
Information Abstraction for Data Transmission Networks based on Large Language Models
2026cites this paper
Tailoring Patient-Specific Cranial Implants for Bone Reconstruction via End-to-End Deep Learning Image-to-Print Approach
2026cites this paper
Early-warning the compact-to-dendritic transition via spatiotemporal learning of two-dimensional growth images
2026cites this paper
Deep Learning-Based Precoder Design for Network Massive MIMO Transmission
2026cites this paper
Time–Frequency characterization of microearthquakes based on Convolutional Neural Networks and explainability models
2026cites this paper
Novel decoupling algorithm based on transfer learning for multi-axis force sensor
2026cites this paper
Developing fully convolutional networks with permittivity-based class mapping for tunnel lining defects detection in ground penetration radar scan data
2026cites this paper
Local Layer-wise Differential Privacy in Federated Learning
2026cites this paper
Parallelizing Node-Level Explainability in Graph Neural Networks
2026cites this paper
A Bio-Inspired Method for Investigating and Boosting the Performance of Deep Neural Networks
2026cites this paper
Insights on the Working Principles of a CNN for Forest Height Regression From Single-Pass InSAR Data
2026cites this paper
YOLO11-WLBS: an efficient model for pavement defect detection
2026cites this paper
Advanced Training Algorithms in Sigma-Delta Spiking YOLO for Energy-Efficient Object Detection on Neuromorphic Hardware
2026cites this paper
Shapley estimated explanation: A fast post-hoc attribution method for interpreting intelligent mechanical fault diagnosis
2026cites this paper
Fluxamba: Topology-Aware Anisotropic State Space Models for Geological Lineament Segmentation in Multi-Source Remote Sensing
2026cites this paper
Multi-scale feature fusion for cross-modality person re-identification: the MSJLNet approach
2026cites this paper
Multi-level functional network-based PD identification via graph deep learning
2026cites this paper
Distributionally Robust Classification for Multi-source Unsupervised Domain Adaptation
2026cites this paper
Interpretable Detector Secure Against Stealthy False Power Consumption Attacks
2026cites this paper
Hybrid vision-language models for improved transparency in healthcare processes: The retinal diagnosis use case
2026cites this paper
Investigating the Robustness of Subtask Distillation under Spurious Correlation
2026cites this paper
A dimensional structure based knowledge distillation method for cross-modal learning.
2026cites this paper
Review of CNN-Based Approaches for Preprocessing, Segmentation and Classification of Knee Osteoarthritis
2026cites this paper
Cross-Modal Redundancy and the Geometry of Vision-Language Embeddings
2026cites this paper
Efficient-LVSM: Faster, Cheaper, and Better Large View Synthesis Model via Decoupled Co-Refinement Attention
2026cites this paper
Surveillance Facial Image Quality Assessment: A Multi-dimensional Dataset and Lightweight Model
2026cites this paper
An interpretable machine learning framework with data-informed imaging biomarkers for diagnosis and prediction of Alzheimer's disease.
2026cites this paper
Quantifying Explanation Quality in Graph Neural Networks using Out-of-Distribution Generalization
2026cites this paper
The effect of whitening on explanation performance
2026cites this paper
Beyond Uniform Credit: Causal Credit Assignment for Policy Optimization
2026cites this paper
LARV: Data-Free Layer-wise Adaptive Rescaling Veneer for Model Merging
2026cites this paper
Competency Bank-Based Underwater Image Enhancement Framework
2026cites this paper
Sample-Free Safety Assessment of Neural Network Controllers via Taylor Methods
2026cites this paper
MATdiff: Mask-aware transformer with diffusion model for large-mask image inpainting
2026cites this paper
Explainable AI: Context-aware layer-wise integrated gradients for explaining transformer models
2026cites this paper
DOI: A Systematic Framework for Incremental Identification of Specific Emitters in Open-Set Scenarios
2026cites this paper
Deep Learning‐Based Tracking of Subduction Zones in Mantle Convection Models
2026cites this paper
Regression in Earth Observation: Are vision–language models up to the challenge?
2026cites this paper
Generative AI-Driven Metaverse: The Promises and Challenges of AI-Generated Content
2026cites this paper
Adaptive correlation learning for cross-modal hashing
2026cites this paper
High precision classification method for black tea: Deep learning combined with two-dimensional correlation spectroscopy
2026cites this paper
DCDLNet: A label-noise tolerant classification algorithm for polsar images based on dual-band consistency and difference
2026cites this paper
Spatiotemporal Attention With Conditional Feature Modulation for Satellite-Based Solar Irradiance Prediction
2026cites this paper
Learning from Historical Activations in Graph Neural Networks
2026cites this paper
Explore the Ideology of Deep Learning in ENSO Forecasts
2026cites this paper
Do LLMs Encode Functional Importance of Reasoning Tokens?
2026influential citation
Identifying Good and Bad Neurons for Task-Level Controllable LLMs
2026cites this paper
Bi-Orthogonal Factor Decomposition for Vision Transformers
2026cites this paper
Hidden Monotonicity: Explaining Deep Neural Networks via their DC Decomposition
2026cites this paper
Beyond the final layer: Attentive multilayer fusion for vision transformers
2026cites this paper
Adaptive Label Error Detection: A Bayesian Approach to Mislabeled Data Detection
2026cites this paper
CEREM: A segment-wise attention network for chinese highly aggregated semantic extraction
2026cites this paper
A hybrid approach for facial parsing using transfer learning
2026cites this paper
A Lightweight Frozen Multi-Convolution Dual-Branch Network for Efficient sEMG-Based Gesture Recognition
2026cites this paper
Consistent explainable image quality assessment for medical imaging
2026cites this paper
Dualformer: Time-Frequency Dual Domain Learning for Long-term Time Series Forecasting
2026cites this paper
Bridging the Black Box: A Survey on Mechanistic Interpretability in AI
2026cites this paper
The reasoning-like capabilities of large language models across different languages: Insights from representational similarity analysis
2026cites this paper
Co-Designing Digital Humans for Online Learning: A Framework for Human-AI Pedagogical Integration
2026cites this paper
Improved Strawberry Disease Classification under Class Imbalance through In-Backbone Latent Diffusion
2026cites this paper
A deep learning-based computational pipeline predicts developmental outcome in retinal organoids
2026cites this paper
Parameter identification based on statistical and neural network approaches for the vegetation-water model
2026cites this paper
FedCPP: A hybrid proactive-passive defense framework for backdoor attack mitigation in federated learning
2026cites this paper
Do Pathology Foundation Models Encode Disease Progression? A Pseudotime Analysis of Visual Representations
2026cites this paper
Deep Models, Shallow Alignment: Uncovering the Granularity Mismatch in Neural Decoding
2026cites this paper
A Bayesian Backed DeepLabV3+ Customized Hybrid-Depth Framework for Brain Tumor Segmentation and Classification
2026cites this paper
Flexible perception of face attributes under naturalistic visual constraints
2026cites this paper
A systematic review on human action detection and classification architectures using deep learning methodology
2026cites this paper
Metaheuristic Hyperparameter Optimization and Explainable Deep Learning for Baggage Threat Detection
2026cites this paper
Catalyst: Out-of-Distribution Detection via Elastic Scaling
2026cites this paper
Model Specific Task Similarity for Vision Language Model Selection via Layer Conductance
2026cites this paper
Erase at the Core: Representation Unlearning for Machine Unlearning
2026cites this paper
Comparative Evaluation of Inception V3 and ResNet 50 for Pneumonia Prediction
2026cites this paper