On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation

Sebastian Bach,Alexander Binder,G. Montavon,Frederick Klauschen,K. Müller,W. Samek

Published 2015 in PLoS ONE

ABSTRACT

Understanding and interpreting classification decisions of automated image classification systems is of high value in many applications, as it allows to verify the reasoning of the system and provides additional information to the human expert. Although machine learning methods are solving very successfully a plethora of tasks, they have in most cases the disadvantage of acting as a black box, not providing any information about what made them arrive at a particular decision. This work proposes a general solution to the problem of understanding classification decisions by pixel-wise decomposition of nonlinear classifiers. We introduce a methodology that allows to visualize the contributions of single pixels to predictions for kernel-based classifiers over Bag of Words features and for multilayered neural networks. These pixel contributions can be visualized as heatmaps and are provided to a human expert who can intuitively not only verify the validity of the classification decision, but also focus further analysis on regions of potential interest. We evaluate our method for classifiers trained on PASCAL VOC 2009 images, synthetic image data containing geometric shapes, the MNIST handwritten digits data set and for the pre-trained ImageNet model available as part of the Caffe open source package.

PUBLICATION RECORD

Publication year
2015
Venue
PLoS ONE
Publication date
2015-07-10
Fields of study
Medicine, Computer Science
Identifiers
DOI 10.1371/journal.pone.0130140 PMID 26161953 PMCID 4498753
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar, PubMed

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

IEEE Computer Society
2019cited by this paper
Visual Causal Feature Learning
2014cited by this paper
Explaining and Harnessing Adversarial Examples
2014cited by this paper
Author manuscript, published in "International Journal of Computer Vision (2013)" International Journal of Computer Vision manuscript No. (will be inserted by the editor) Image Classification with the Fisher Vector: Theory and Practice
2013cited by this paper
Visualizing and Understanding Convolutional Networks
2013cited by this paper
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
2013influential reference
Intriguing properties of neural networks
2013cited by this paper
Taxonomic Prediction with Tree-Structured Covariances
2013cited by this paper
Enhanced representation and multi-task learning for image annotation
2013cited by this paper
ImageNet classification with deep convolutional neural networks
2012cited by this paper
Semantic Kernel Forests from Multiple Taxonomies
2012cited by this paper
On Taxonomies for Multi-class Image Categorization
2012cited by this paper
Visualization of Nonlinear Classification Models in Neuroimaging - Signed Sensitivity Maps
2012cited by this paper
What has my classifier learned? Visualizing the classification rules of bag-of-feature model by support region detection
2012cited by this paper
The CLEF 2011 Photo Annotation and Concept-based Retrieval Tasks
2011cited by this paper
In defense of soft-assignment coding
2011cited by this paper
The Visual Extent of an Object
2011cited by this paper
Adaptive deconvolutional networks for mid and high level feature learning
2011cited by this paper
l p -Norm Multiple Kernel Learning
2011cited by this paper
UvA-DARE ( Digital Academic Repository ) The visual extent of an object : suppose we know the object locations
2011cited by this paper
Building high-level features using large scale unsupervised learning
2011cited by this paper
Insights from Classifying Visual Concepts with Multiple Kernel Learning
2011cited by this paper
Visualization of nonlinear kernel models in neuroimaging by sensitivity maps
2011cited by this paper
Visual Interpretation of Kernel‐Based Prediction Models
2011cited by this paper
The Visual Extent of an Object
2011cited by this paper
lp-Norm Multiple Kernel Learning
2011cited by this paper
ImageCLEF, Experimental Evaluation in Visual Information Retrieval
2010cited by this paper
Evaluating Color Descriptors for Object and Scene Recognition
2010cited by this paper
Visual Word Ambiguity
2010cited by this paper
Locality-constrained Linear Coding for image classification
2010cited by this paper
Convolutional networks and applications in vision
2010cited by this paper
ImageCLEF, Experimental Evaluation in Visual Information Retrieval
2010cited by this paper
Improving the Fisher Kernel for Large-Scale Image Classification
2010influential reference
Object Recognition from Polarimetric SAR Images
2010cited by this paper
Radar Remote Sensing of Urban Areas
2010cited by this paper
How to Explain Individual Classification Decisions
2009cited by this paper
Heterogeneous feature machines for visual recognition
2009cited by this paper
Efficient and Accurate Lp-Norm Multiple Kernel Learning
2009cited by this paper
Nonlinear Learning using Local Coordinate Coding
2009cited by this paper
Multiple kernels for object detection
2009cited by this paper
Linear spatial pyramid matching using sparse coding for image classification
2009cited by this paper
Visualizing Higher-Layer Features of a Deep Network
2009cited by this paper
Why is Real-World Visual Object Recognition Hard?
2008cited by this paper
Kernel Codebooks for Scene Categorization
2008cited by this paper
Randomized Clustering Forests for Image Classification
2008cited by this paper
The PASCAL Visual Object Classes Challenge
2006influential reference
Self-supervised Monocular Road Detection in Desert Terrain
2006cited by this paper
The Pascal Visual Object Classes Challenge 2006 ( VOC 2006 ) Results
2006influential reference
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study
2006cited by this paper
The mnist database of handwritten digits
2005influential reference
A Bayesian hierarchical model for learning natural scene categories
2005cited by this paper
Distinctive Image Features from Scale-Invariant Keypoints
2004cited by this paper
An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data
2004cited by this paper
Multiple kernel learning, conic duality, and the SMO algorithm
2004cited by this paper
Review and comparison of methods to study the contribution of variables in artificial neural network models
2003cited by this paper
Visual categorization with bags of keypoints
2002cited by this paper
Neural Networks: Tricks of the Trade
2002cited by this paper
Learning the Kernel Matrix with Semidefinite Programming
2002cited by this paper
Rapid object detection using a boosted cascade of simple features
2001cited by this paper
Understanding Neural Networks via Rule Extraction
1995cited by this paper
Classification of cervical cell nuclei using morphological segmentation and textural feature extraction
1994cited by this paper
Neural Networks for Pattern Recognition
1993cited by this paper
Learning representations by back-propagating errors
1986cited by this paper
Learning representations by back-propagation errors, nature
1986cited by this paper

CITED BY

Explainable AI: Context-aware layer-wise integrated gradients for explaining transformer models
2026influential citation
Action Shapley: A Training Data Selection Metric for World Model in Reinforcement Learning
2026cites this paper
Alzheimer’s disease prediction via an explainable CNN using genetic algorithm and SHAP values
2026cites this paper
Competency Bank-Based Underwater Image Enhancement Framework
2026cites this paper
Machine Learning in Epidemiology
2026cites this paper
Feature-Aware Test Generation for Deep Learning Models
2026cites this paper
A neuron-level interpretation of reservoir computing by its perturbation-based memory capacity
2026cites this paper
AI-powered Biomedical Imaging: Recent Achievements and Challenges
2026cites this paper
Stylized Explanations: Enhancing Neural Networks Visual Interpretations Using Neural Style Transfer
2026cites this paper
AI-driven discovery and engineering of human endogenous nanocage proteins for mRNA delivery
2026influential citation
What Helps---and What Hurts: Bidirectional Explanations for Vision Transformers
2026cites this paper
Refining explainability in chest X-ray diagnostics with lesion-aware hybrid transformer and local similarity of integrated re-normalized attention map
2026cites this paper
Investigating the Utility of Explainable Artificial Intelligence for Neuroimaging‐Based Dementia Diagnosis and Prognosis
2026cites this paper
Towards Visually Explaining Statistical Tests with Applications in Biomedical Imaging
2026cites this paper
Time Series-Based Explainable Model for Lithium-Ion Battery State of Health Prediction
2026cites this paper
SeaTraNet: A local-global feature fusion network for abnormal behavior recognition of single trawler
2026cites this paper
Mechanistic Interpretability of ReLU Neural Networks Through Piecewise-Affine Mapping
2026cites this paper
Adversarial Evasion Attacks on Computer Vision using SHAP Values
2026cites this paper
Bi-Attention HateXplain : Taking into account the sequential aspect of data during explainability in a multi-task context
2026cites this paper
Towards Transparent Time Series Analysis: Exploring Methods and Enhancing Interpretability
2026cites this paper
DAVE: Distribution-aware Attribution via ViT Gradient Decomposition
2026cites this paper
XSPLAIN: XAI-enabling Splat-based Prototype Learning for Attribute-aware INterpretability
2026cites this paper
Fair feature attribution for multi-output prediction: a Shapley-based perspective
2026cites this paper
ROKA: Robust Knowledge Unlearning against Adversaries
2026cites this paper
EvoDropX: Evolutionary Optimization of Feature Corruption Sequences for Faithful Explanations of Transformer Models
2026cites this paper
Multimodal artificial intelligence for enhanced skin cancer diagnosis and prognosis.
2026cites this paper
The effect of whitening on explanation performance
2026cites this paper
Multimodal fusion and explainability of artificial intelligence models in Alzheimer’s Disease detection
2026cites this paper
Memory Retrieval in Transformers: Insights from The Encoding Specificity Principle
2026cites this paper
[From black box to white box - The limits of transparency in artificial intelligence IA used in healthcare].
2026cites this paper
From black-box to white-box: Interpretable deep reinforcement learning with Kolmogorov-Arnold networks for autonomous driving
2026cites this paper
A contribution by gradient explainability method for 1D-CNNs on ultrasonic data
2026cites this paper
NDE 4.0: The confluence of cutting-edge nondestructive inspection practices, data fusion techniques, artificial intelligence, and cyber-physical systems for effective evaluation of materials and structures
2026cites this paper
Adaptive Layer-Wise Personalized Federated Deep Reinforcement Learning for Heterogeneous Edge Caching
2026cites this paper
Explainable artificial intelligence (XAI) in medical imaging: a systematic review of techniques, applications, and challenges
2026cites this paper
TradePool: A Novel Interpretable Framework for Quantifying Atomic Attribution Values in Molecular Property Prediction.
2026cites this paper
Transformer Is Inherently a Causal Learner
2026cites this paper
Hidden Monotonicity: Explaining Deep Neural Networks via their DC Decomposition
2026influential citation
Component-wise independent adaptive learning and local optimization for long-term forecasting
2026cites this paper
Learning to Explain: Supervised Token Attribution from Transformer Attention Patterns
2026influential citation
A Monosemantic Attribution Framework for Stable Interpretability in Clinical Neuroscience Large Language Models
2026cites this paper
The Confusion is Real: GRAPHIC - A Network Science Approach to Confusion Matrices in Deep Learning
2026cites this paper
Hybrid vision-language models for improved transparency in healthcare processes: The retinal diagnosis use case
2026cites this paper
Auditing Sybil: Explaining Deep Lung Cancer Risk Prediction Through Generative Interventional Attributions
2026cites this paper
Exploring SAIG Methods for an Objective Evaluation of XAI
2026cites this paper
Statistical Inference and Learning for Shapley Additive Explanations (SHAP)
2026cites this paper
The value of artificial intelligence combined with multimodal data analysis in tumor immunotherapy and targeted therapy.
2026cites this paper
Sparseness-Optimized Feature Importance for Time Series Classification
2026cites this paper
Visualization methods for explainable medical imaging diagnosis: A survey
2026cites this paper
Explaining explainability: A comprehensive survey on explainable artificial intelligence and relevant industry applications
2026cites this paper
Enhancing Physics-Informed Neural Networks with Domain-aware Fourier Features: Towards Improved Performance and Interpretable Results
2026cites this paper
Axiomatic On-Manifold Shapley via Optimal Generative Flows
2026cites this paper
Explainable Artificial Intelligence (XAI) for EEG Analysis: A Survey on Recent Trends and Advancements
2026cites this paper
A Survey on Explainable AI for Semantic Communication: Architecture, Challenges, and Future Opportunities
2026cites this paper
X-SYS: A Reference Architecture for Interactive Explanation Systems
2026cites this paper
Feature salience -- not task-informativeness -- drives machine learning model explanations
2026influential citation
ShapBPT: Image Feature Attributions Using Data-Aware Binary Partition Trees
2026cites this paper
Explainable Artificial Intelligence for Deepfake Detection: Pipeline, Open Source and Comparisons
2026cites this paper
Uncertainty explanation of artificial intelligence models by SHAP
2026cites this paper
Large-Scale Deep Learning-Based Hourly Storm Surge Modeling: Application Across the U.S. Gulf and East Coasts
2026cites this paper
Saturation and pressure change estimation from time-lapse seismic data using vision transformers
2026cites this paper
TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors
2026cites this paper
A Tutorial on Data-Driven Quality-of-Experience Modeling With Explainable Artificial Intelligence
2026cites this paper
Explainable Artificial Intelligence Enhance Image Semantic Communication System in 6G-IoT
2026cites this paper
Clarity in DEM cementation predictions: Integrating automated machine learning and interpretability analysis of shear wave velocity
2026cites this paper
LogHGAD: A hypergraph-based log anomaly detection method for AIoT systems
2026cites this paper
Review of fault diagnosis for rotating machinery: Prior knowledge integration in data-driven methods benefits model interpretability and generalizability
2026cites this paper
Explainable multimodal brain imaging through a multiple-branch neural network
2026cites this paper
SynthraXCoreNet: An Interpretable, Well-Calibrated Six-CNN Ensemble for Dermoscopic Skin-Lesion Classification
2026cites this paper
AI in soil moisture remote sensing
2026cites this paper
Explore the Ideology of Deep Learning in ENSO Forecasts
2026cites this paper
A visualization-driven decision support system for selecting feature attribution methods
2026cites this paper
Detecting Hallucinations in Retrieval-Augmented Generation via Semantic-level Internal Reasoning Graph
2026cites this paper
Counterfactual Explanation-Based Cryptocurrency Price Prediction
2026cites this paper
Uncovering spatial process heterogeneity from graph-based deep spatial regression
2026cites this paper
Distilling Lightweight Domain Experts from Large ML Models by Identifying Relevant Subspaces
2026cites this paper
Hybrid Method to Explain Predictions of Stacking Ensemble Model
2026cites this paper
SCALPEL: Selective Capability Ablation via Low-rank Parameter Editing for Large Language Model Interpretability Analysis
2026cites this paper
Spatial Sensitive Grad-CAM++: Towards High-Quality Visual Explanations for Object Detectors via Weighted Combination of Gradient Maps
2026cites this paper
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
2025influential citation
Disentangling Visual Transformers: Patch-Level Interpretability for Image Classification
2025cites this paper
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
2025influential citation
Explainable AI enhanced transformer based UNet for medical images segmentation using gradient weighted class activation map
2025cites this paper
Explainability and uncertainty: Two sides of the same coin for enhancing the interpretability of deep learning models in healthcare
2025cites this paper
Explainable paroxysmal atrial fibrillation diagnosis using an artificial intelligence-enabled electrocardiogram
2025cites this paper
NeurFlow: Interpreting Neural Networks through Neuron Groups and Functional Interactions
2025cites this paper
Response coupling with an auxiliary neural signal for enhancing brain signal detection
2025cites this paper
QRLaXAI: quantum representation learning and explainable AI
2025cites this paper
XDATE: eXplainable Deep belief network-based Auto-encoder with exTended Garson Algorithm
2025cites this paper
CNN Interpretability with Multivector Tucker Saliency Maps for Self-Supervised Models
2025cites this paper
Transforming Architectural Digitisation: Advancements in AI-Driven 3D Reality-Based Modelling
2025cites this paper
Interpretable Text Embeddings and Text Similarity Explanation: A Survey
2025cites this paper
Fidex and FidexGlo: From Local Explanations to Global Explanations of Deep Models
2025cites this paper
Current methods in explainable artificial intelligence and future prospects for integrative physiology
2025cites this paper
AI-Assisted Decision Making with Human Learning
2025cites this paper
Gradient Co-occurrence Analysis for Detecting Unsafe Prompts in Large Language Models
2025cites this paper
B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability
2025cites this paper
The Hidden Dimensions of LLM Alignment: A Multi-Dimensional Safety Analysis
2025influential citation
DANI-NET: A Physics-Aware Deep Learning Framework for Change Detection Using Repeat-Pass InSAR
2025cites this paper
Evaluating the interpretability of prototype networks for medical image analysis
2025cites this paper