An unexpected unity among methods for interpreting model predictions

Published 2016 in arXiv.org

ABSTRACT

Understanding why a model made a certain prediction is crucial in many data science fields. Interpretable predictions engender appropriate trust and provide insight into how the model may be improved. However, with large modern datasets the best accuracy is often achieved by complex models even experts struggle to interpret, which creates a tension between accuracy and interpretability. Recently, several methods have been proposed for interpreting predictions from complex models by estimating the importance of input features. Here, we present how a model-agnostic additive representation of the importance of input features unifies current methods. This representation is optimal, in the sense that it is the only set of additive values that satisfies important properties. We show how we can leverage these properties to create novel visual explanations of model predictions. The thread of unity that this representation weaves through the literature indicates that there are common principles to be learned about the interpretation of model predictions that apply in many scenarios.

PUBLICATION RECORD

Publication year
2016
Venue
arXiv.org
Publication date
2016-11-22
Fields of study
Mathematics, Computer Science
Identifiers
arXiv 1611.07478
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Not Just a Black Box: Learning Important Features Through Propagating Activation Differences
2016cited by this paper
“Why Should I Trust You?”: Explaining the Predictions of Any Classifier
2016influential reference
On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
2015cited by this paper
Explaining prediction models and individual predictions with feature contributions
2014cited by this paper
Analysis of regression in game theory approach
2001cited by this paper
The Shapley Value
1994cited by this paper
Monotonic solutions of cooperative games
1985cited by this paper

CITED BY

Adversarial Evasion Attacks on Computer Vision using SHAP Values
2026cites this paper
Detecting Fake News with XAI: A Look at SHAP for Making Machine Learning Models More Explainable
2025cites this paper
GNSS Data Mining for Train Positioning Test Case Generation
2025cites this paper
AI-Augmented LLMs Achieve Therapist-Level Responses in Motivational Interviewing
2025cites this paper
Prediction of permeability in a tight sandstone reservoir using a gated network stacking model driven by data and physical models
2024cites this paper
ShaRP: Explaining Rankings and Preferences with Shapley Values
2024cites this paper
Investigating hydrological processes using explainable deep-learning models
2024cites this paper
Explainable artificial intelligence.
2024cites this paper
Redefining pandemic preparedness: Multidisciplinary insights from the CERP modelling workshop in infectious diseases, workshop report
2024cites this paper
Model-Agnostic Interpretation Framework in Machine Learning: A Comparative Study in NBA Sports
2024cites this paper
A simple approach for local and global variable importance in nonlinear regression models
2023cites this paper
The Network Structure of Open Innovation and the Creativity in the Semiconductor Manufacturing Equipment Industry
2023cites this paper
Forecasting rare earth stock prices with machine learning
2023cites this paper
Enhancing wireline formation testing with explainable machine learning: Predicting effective and non-effective stations
2023cites this paper
"When Can I Trust It?" Contextualising Explainability Methods for Classifiers
2023cites this paper
A spectrum of explainable and interpretable machine learning approaches for genomic studies
2023cites this paper
Soil suitability classification for crop selection in precision agriculture using GBRT-based hybrid DNN surrogate models
2023cites this paper
Explaining Deep Learning Models for Credit Scoring with SHAP: A Case Study Using Open Banking Data
2023cites this paper
Multi-output machine learning models for kinetic data evaluation : A Fischer–Tropsch synthesis case study
2022cites this paper
Augmenting interpretable models with large language models during training
2022cites this paper
Mapping Knowledge Representations to Concepts: A Review and New Perspectives
2022influential citation
Optimization of the remediation of oil-drilling cuttings by combining artificial neural networks with knowledge-based models
2022cites this paper
Easy to Decide, Hard to Agree: Reducing Disagreements Between Saliency Methods
2022cites this paper
Machine learning for a sustainable energy future
2022cites this paper
Emb-GAM: an Interpretable and Efficient Predictor using Pre-trained Language Models
2022cites this paper
Comparing Baseline Shapley and Integrated Gradients for Local Explanation: Some Additional Insights
2022cites this paper
Detection of ADHD based on Eye Movements during Natural Viewing
2022cites this paper
Entropy-based discrimination between translated Chinese and original Chinese using data mining techniques
2022cites this paper
Staff Working Paper No. 947 Software validation and artificial intelligence in finance – a primer
2021cites this paper
AI4Water v1.0: An open source python package for modeling hydrological time series using data-driven methods
2021cites this paper
Explainable Artificial Intelligence Approaches: A Survey
2021cites this paper
Explainable autoencoder-based representation learning for gene expression data
2021cites this paper
A novel model usability evaluation framework (MUsE) for explainable artificial intelligence
2021influential citation
Decorrelated Variable Importance
2021cites this paper
Diesel passenger vehicle shares influenced COVID-19 changes in urban nitrogen dioxide pollution
2021cites this paper
Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model
2021cites this paper
Supporting digital content marketing and messaging through topic modelling and decision trees
2021cites this paper
Model-Agnostic Methods for XAI
2021cites this paper
Interpretable Machine Learning for Meteorological Data
2021cites this paper
Research on Explainable Artificial Intelligence Techniques: An User Perspective
2021cites this paper
Comprehensible Convolutional Neural Networks via Guided Concept Learning
2021cites this paper
Information-theoretic Evolution of Model Agnostic Global Explanations
2021cites this paper
Applying interpretable machine learning to classify tree and utility pole related crash injury types
2021cites this paper
A Novel Method for COVID-19 Diagnosis Using Artificial Intelligence in Chest X-ray Images
2021cites this paper
Learning Semantically Meaningful Features for Interpretable Classifications
2021cites this paper
A qualitative research framework for the design of user-centered displays of explanations for machine learning model predictions in healthcare
2020cites this paper
Right for the Wrong Scientific Reasons: Revising Deep Networks by Interacting with their Explanations
2020cites this paper
Big-Data Science in Porous Materials: Materials Genomics and Machine Learning
2020cites this paper
Interpret Neural Networks by Extracting Critical Subnetworks
2020cites this paper
Explainable Predictive Process Monitoring
2020influential citation
Domain-Specific, Semi-Supervised Transfer Learning for Medical Imaging
2020cites this paper
Making deep neural networks right for the right scientific reasons by interacting with their explanations
2020cites this paper
MeLIME: Meaningful Local Explanation for Machine Learning Models
2020cites this paper
Understanding Multi-Vehicle Collision Patterns on Freeways—A Machine Learning Approach
2020cites this paper
Latent Modeling of the Human Epigenome
2020cites this paper
Why model why? Assessing the strengths and limitations of LIME
2020influential citation
Automated biomarker candidate discovery in imaging mass spectrometry data through spatially localized Shapley additive explanations
2020cites this paper
Intrinsic Meaning of Shapley Values in Regression
2020cites this paper
Practical Aspects of Hydraulic Fracturing Design Optimization using Machine Learning on Field Data: Digital Database, Algorithms and Planning the Field Tests (Russian)
2020cites this paper
Fibers of Failure: Classifying Errors in Predictive Processes
2020cites this paper
Interpretable machine learning
2020cites this paper
Interpretation of Stability Assessment Machine Learning Models Based on Shapley Value
2019cites this paper
MODEL AGNOSTIC GLOBALLY INTERPRETABLE EXPLANATIONS
2019cites this paper
The Vulnerabilities of Graph Convolutional Networks: Stronger Attacks and Defensive Techniques
2019cites this paper
Recovering Pairwise Interactions Using Neural Networks
2019cites this paper
A comparative study for interpreting deep learning prediction of the Parkinson's disease diagnosis from SPECT imaging
2019cites this paper
Technical Report: Partial Dependence through Stratification
2019cites this paper
Multiclass Disease Classification from Microbial Whole-Community Metagenomes
2019cites this paper
Optimizing for Interpretability in Deep Neural Networks with Tree Regularization
2019cites this paper
Local Interpretation Methods to Machine Learning Using the Domain of the Feature Space
2019cites this paper
How to make more from exposure data? An integrated machine learning pipeline to predict pathogen exposure
2019cites this paper
A Stratification Approach to Partial Dependence for Codependent Variables
2019cites this paper
Global Aggregations of Local Explanations for Black Box models
2019cites this paper
Adversarial Examples for Graph Data: Deep Insights into Attack and Defense
2019cites this paper
Extracting Incentives from Black-Box Decisions
2019cites this paper
Explanatory Interactive Machine Learning
2019influential citation
Neural network interpretation of the Parkinson's disease diagnosis from SPECT imaging
2019cites this paper
Adversarial Examples on Graph Data: Deep Insights into Attack and Defense
2019cites this paper
Local vs. Global Interpretability of Machine Learning Models in Type 2 Diabetes Mellitus Screening
2019cites this paper
Toward Faithful Explanatory Active Learning with Self-explainable Neural Nets ?
2019cites this paper
Fibres of Failure: Classifying errors in predictive processes
2018cites this paper
Artificial Intelligence for Decision Support in Command and Control Systems
2018cites this paper
Response to anonymous referee #1
2018cites this paper
Fuzzy logic interpretation of quadratic networks
2018cites this paper
NOMAD 2018 Kaggle Competition: Solving Materials Science Challenges Through Crowd Sourcing
2018cites this paper
Multi-scale deep tensor factorization learns a latent representation of the human epigenome
2018cites this paper
Expert-in-the-Loop Supervised Learning for Computer Security Detection Systems. (Apprentissage supervisé et systèmes de détection : une approche de bout-en-bout impliquant les experts en sécurité)
2018cites this paper
A Social Science-based Approach to Explanations for (Game) AI
2018cites this paper
Interpretable Machine Learning with Applications in Neuroscience
2018cites this paper
Training Machine Learning Models by Regularizing their Explanations
2018cites this paper
Stakeholders in Explainable AI
2018cites this paper
Interpret Neural Networks by Identifying Critical Data Routing Paths
2018cites this paper
Model Explanation and Interpretation Concepts for Stimulating Advanced Human-Machine Interaction with "Expert-in-the-Loop"
2018cites this paper
Fuzzy Logic Interpretation of Artificial Neural Networks
2018cites this paper
Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences
2018cites this paper
Contrastive Explanations with Local Foil Trees
2018cites this paper
"Why Should I Trust Interactive Learners?" Explaining Interactive Queries of Classifiers to Users
2018influential citation
The Design and Validation of an Intuitive Confidence Measure
2018cites this paper
Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs
2018cites this paper
Visualizing the Feature Importance for Black Box Models
2018cites this paper