Interpretable Explanations of Black Boxes by Meaningful Perturbation

Published 2017 in IEEE International Conference on Computer Vision

ABSTRACT

As machine learning algorithms are increasingly applied to high impact yet high risk tasks, such as medical diagnosis or autonomous driving, it is critical that researchers can explain how such algorithms arrived at their predictions. In recent years, a number of image saliency methods have been developed to summarize where highly complex neural networks “look” in an image for evidence for their predictions. However, these techniques are limited by their heuristic nature and architectural constraints. In this paper, we make two main contributions: First, we propose a general framework for learning different kinds of explanations for any black box algorithm. Second, we specialise the framework to find the part of an image most responsible for a classifier decision. Unlike previous works, our method is model-agnostic and testable because it is grounded in explicit and interpretable image perturbations.

PUBLICATION RECORD

Publication year
2017
Venue
IEEE International Conference on Computer Vision
Publication date
2017-04-11
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.1109/ICCV.2017.371 arXiv 1704.03296
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Adversarial examples in the physical world
2016influential reference
I Have Seen Enough: Transferring Parts Across Categories
2016cited by this paper
A model explanation system
2016cited by this paper
“Why Should I Trust You?”: Explaining the Predictions of Any Classifier
2016influential reference
Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization
2016cited by this paper
Top-Down Neural Attention by Excitation Backprop
2016influential reference
Salient Deconvolutional Networks
2016influential reference
Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization
2016cited by this paper
Visualizing Deep Convolutional Neural Networks Using Natural Pre-images
2015cited by this paper
On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
2015cited by this paper
Learning Deep Features for Discriminative Localization
2015influential reference
Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks
2015cited by this paper
Microsoft COCO: Common Objects in Context
2014influential reference
Going deeper with convolutions
2014influential reference
Deep neural networks are easily fooled: High confidence predictions for unrecognizable images
2014cited by this paper
ImageNet Large Scale Visual Recognition Challenge
2014cited by this paper
Adam: A Method for Stochastic Optimization
2014cited by this paper
Understanding deep image representations by inverting them
2014cited by this paper
Object Detectors Emerge in Deep Scene CNNs
2014cited by this paper
Striving for Simplicity: The All Convolutional Net
2014influential reference
Visualizing and Understanding Convolutional Networks
2013influential reference
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
2013influential reference
ImageNet classification with deep convolutional neural networks
2012cited by this paper

CITED BY

A Lightweight Hybrid Encoder-Decoder Framework for Multiple Degree of Freedom Muscle Force Estimation
2026cites this paper
Towards Benchmarking AI Explainability
2026cites this paper
A Framework for Evaluating Faithfulness in Explainable AI for Machine Anomalous Sound Detection Using Frequency-Band Perturbation
2026cites this paper
Local-to-Global Logical Explanations for Deep Vision Models
2026cites this paper
Identifying Good and Bad Neurons for Task-Level Controllable LLMs
2026cites this paper
Sparseness-Optimized Feature Importance for Time Series Classification
2026cites this paper
Multimodal interpretable image recognition network via language-guided global-local collaboratively alignment
2026cites this paper
Sparseness-optimized feature importance with prior knowledge and reinforcement learning-powered optimization
2026cites this paper
Exploring transparency in pathological image analysis: A comprehensive review of explainable artificial intelligence (XAI) techniques
2026cites this paper
Evaluating the Ability of Explanations to Disambiguate Models in a Rashomon Set
2026cites this paper
DD-CAM: Minimal Sufficient Explanations for Vision Models Using Delta Debugging
2026cites this paper
Bi-Orthogonal Factor Decomposition for Vision Transformers
2026cites this paper
PRISM-CAFO: Prior-conditioned Remote-sensing Infrastructure Segmentation and Mapping for CAFOs
2026cites this paper
Memory Retrieval in Transformers: Insights from The Encoding Specificity Principle
2026cites this paper
A Vision-Based Explainable Deep Learning Approach for Multi-Class Drone Detection and Recognition
2026cites this paper
Follow the Forest Trail: Distillation by Gradient Boosting Models to Enhance Symbolic Regression Performance
2026cites this paper
Infinite Self-Attention
2026cites this paper
Directional Reasoning Trajectory Change (DRTC): Identifying Critical Trace Segments in Reasoning Models
2026cites this paper
Hidden Monotonicity: Explaining Deep Neural Networks via their DC Decomposition
2026cites this paper
Explore the Ideology of Deep Learning in ENSO Forecasts
2026cites this paper
Concept-Based Explanation for Deep Vision Models: A Comprehensive Survey on Techniques, Taxonomy, Applications, and Recent Advances
2026cites this paper
Explainable AI – Based Study of the Interactions between Remote Sensing and Ground-Truth Climate Variables and Lake Chad’s Level Fluctuations
2026cites this paper
Learning to Seek Evidence: A Verifiable Reasoning Agent with Causal Faithfulness Analysis
2025cites this paper
Clarifying the Opacity of Neural Networks
2025cites this paper
An accurate pixel-Level explainable approach for CNNs and its application
2025cites this paper
Generating Part-Based Global Explanations Via Correspondence
2025cites this paper
Cross-scale soil moisture content monitoring of winter wheat by integrating UAV and sentinel-1/2 data
2025cites this paper
Photorealistic Inpainting for Perturbation-based Explanations in Ecological Monitoring
2025cites this paper
Fake News and Offensive Content Detection in Malayalam Using Machine Learning, Deep Learning, and Transformer Based Methods With XAI
2025cites this paper
Extremal Contours: Gradient-driven contours for compact visual attribution
2025cites this paper
Value bounds and Convergence Analysis for Averages of LRP attributions
2025cites this paper
Looking in the mirror: A faithful counterfactual explanation method for interpreting deep image classification models
2025cites this paper
Explainable AI Does not Provide Reason Explanations
2025cites this paper
A comprehensive analysis of perturbation methods in explainable AI feature attribution validation for neural time series classifiers
2025cites this paper
On the notion of missingness for path attribution explainability methods in medical settings: Guiding the selection of medically meaningful baselines
2025cites this paper
Priority Guided Explanation for Knowledge Tracing with Dual Ranking and Similarity Consistency
2025cites this paper
FXG Score-CAM: Comprehensive Score-Weighted Visual Explanations for Convolutional Neural Networks
2025cites this paper
Pixel-level Certified Explanations via Randomized Smoothing
2025cites this paper
Explaining Large Language Models with gSMILE
2025cites this paper
A comprehensive survey of imputation methods in medical missing data analysis
2025cites this paper
TELL-ME: Toward Personalized Explanations of Large Language Models
2025cites this paper
Enhancing Bottleneck Concept Learning in Image Classification
2025cites this paper
Explainsegnet: Interpretable Segmentation for Alzheimer's Diagnosis
2025cites this paper
Relevance-driven Input Dropout: an Explanation-guided Regularization Technique
2025cites this paper
A frequency mask and decoupling max-logit based XAI method to explain DNN for fault diagnosis
2025cites this paper
TriGuard: Testing Model Safety with Attribution Entropy, Verification, and Drift
2025cites this paper
CProtoNet: A conceptual prototype network based on conceptual similarity
2025cites this paper
ScoreCAM++: Gated Score-Weighted Visual Explanations for CNNs
2025cites this paper
Interpreting convolutional neural network explainability for head-and-neck cancer radiotherapy organ-at-risk segmentation
2025cites this paper
Attribution Explanations for Deep Neural Networks: A Theoretical Perspective
2025cites this paper
Beyond Output Faithfulness: Learning Attributions that Preserve Computational Pathways
2025cites this paper
Database Views as Explanations for Relational Deep Learning
2025influential citation
Transparency of medical artificial intelligence systems
2025cites this paper
Towards Explainable Image Classification
2025cites this paper
Deep graph neural networks for spatiotemporal forecasting of sub-seasonal sea ice: A case study in Hudson Bay
2025cites this paper
xAI-CV: An Overview of Explainable Artificial Intelligence in Computer Vision
2025cites this paper
Explaining and interpreting hyperdimensional computing classifiers on tabular data
2025cites this paper
Explainable Artificial Intelligence in Drug Discovery: Bridging Predictive Power and Mechanistic Insight
2025cites this paper
FACE: Faithful Automatic Concept Extraction
2025cites this paper
Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations
2025cites this paper
Malware Detection with AI: A Comprehensive Review of Trends and Challenges with Future Directions
2025cites this paper
Explainable AI for Clinical Decision Support Systems: Literature Review, Key Gaps, and Research Synthesis
2025cites this paper
Improving local interpretable classifier explanations exploiting self-generated semantic features
2025cites this paper
State-of-the-Art in Responsible, Explainable, and Fair AI for Medical Image Analysis
2025cites this paper
DDL: Effective and Comprehensible Interpretation Framework for Diverse Deepfake Detectors
2025cites this paper
Stealthy Query-Efficient OpaqueAttack Against Interpretable Deep Learning
2025cites this paper
PhysNav-DG: A Novel Adaptive Framework for Robust VLM-Sensor Fusion in Navigation Applications
2025cites this paper
Understanding the Black Box: A Deep Empirical Dive into Shapley Value Approximations for Tabular Data
2025cites this paper
Pruning the Paradox: How CLIP's Most Informative Heads Enhance Performance While Amplifying Bias
2025cites this paper
XMutant: XAI-based Fuzzing for Deep Learning Systems
2025cites this paper
Interpretable Novel Target Discovery through Open-Set Domain Adaptation
2025cites this paper
Walking the Web of Concept-Class Relationships in Incrementally Trained Interpretable Models
2025cites this paper
Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP
2025cites this paper
Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence
2025cites this paper
Locally Explaining Prediction Behavior via Gradual Interventions and Measuring Property Gradients
2025cites this paper
Attention, Please! PixelSHAP Reveals What Vision-Language Models Actually Focus On
2025cites this paper
Tangentially Aligned Integrated Gradients for User-Friendly Explanations
2025cites this paper
Transparency in AI for emergency management: building trust and accountability
2025cites this paper
Evaluating the Impact of AI-Generated Visual Explanations on Decision-Making for Image Matching
2025cites this paper
Delta Marches to autonomously learn histopathology rules by generative latent space traversals
2025cites this paper
Rethinking Transferable Adversarial Attacks With Double Adversarial Neuron Attribution
2025cites this paper
Anomaly Detection Using Computer Vision: A Comparative Analysis of Class Distinction and Performance Metrics
2025cites this paper
Interactivity x Explainability: Toward Understanding How Interactivity Can Improve Computer Vision Explanations
2025cites this paper
A Meaningful Perturbation Metric for Evaluating Explainability Methods
2025influential citation
POMELO: Black-Box Feature Attribution with Full-Input, In-Distribution Perturbations
2025cites this paper
PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition
2025cites this paper
Privacy Risks and Preservation Methods in Explainable Artificial Intelligence: A Scoping Review
2025cites this paper
Explainable AI the Latest Advancements and New Trends
2025cites this paper
Out-of-Distribution Detection via Channelwise Feature Aggregation in Neural Network-Based Receivers
2025cites this paper
A Functionally-Grounded Benchmark Framework for XAI Methods: Insights and Foundations from a Systematic Literature Review
2025cites this paper
Efficient Preimage Approximation for Neural Network Certification
2025cites this paper
Do Protein Transformers Have Biological Intelligence?
2025cites this paper
Why Do Class-Dependent Evaluation Effects Occur with Time Series Feature Attributions? A Synthetic Data Investigation
2025cites this paper
Rethinking Explainability in the Era of Multimodal AI
2025cites this paper
Interpretable Learning Method Based on Causal Interactive Attention
2025cites this paper
Advancements in deep learning and explainable artificial intelligence for enhanced medical image analysis: A comprehensive survey and future directions
2025cites this paper
A Comprehensive Review of Explainable Artificial Intelligence (XAI) in Computer Vision
2025cites this paper
Stochastic Parameter Decomposition
2025cites this paper
Concept-Based Mechanistic Interpretability Using Structured Knowledge Graphs
2025cites this paper
Class-Dependent Perturbation Effects in Evaluating Time Series Attributions
2025cites this paper