Tangentially Aligned Integrated Gradients for User-Friendly Explanations

Lachlan Simpson,Federico Costanza,Kyle Millar,A. Cheng,Cheng-Chew Lim,Hong-Gunn Chew

Published 2025 in Irish Conference on Artificial Intelligence and Cognitive Science

ABSTRACT

Integrated gradients is prevalent within machine learning to address the black-box problem of neural networks. The explanations given by integrated gradients depend on a choice of base-point. The choice of base-point is not a priori obvious and can lead to drastically different explanations. There is a longstanding hypothesis that data lies on a low dimensional Riemannian manifold. The quality of explanations on a manifold can be measured by the extent to which an explanation for a point lies in its tangent space. In this work, we propose that the base-point should be chosen such that it maximises the tangential alignment of the explanation. We formalise the notion of tangential alignment and provide theoretical conditions under which a base-point choice will provide explanations lying in the tangent space. We demonstrate how to approximate the optimal base-point on several well-known image classification datasets. Furthermore, we compare the optimal base-point choice with common base-points and three gradient explainability models.

PUBLICATION RECORD

Publication year
2025
Venue
Irish Conference on Artificial Intelligence and Cognitive Science
Publication date
2025-03-11
Fields of study
Mathematics, Computer Science
Identifiers
DOI 10.48550/arXiv.2503.08240 arXiv 2503.08240
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution
2024cited by this paper
Probabilistic Lipschitzness and the Stable Rank for Comparing Explanation Models
2024cited by this paper
A Rigorous Study of Integrated Gradients Method and Extensions to Internal Neuron Attributions
2022cited by this paper
The Manifold Hypothesis for Gradient-Based Explanations
2022influential reference
Do Perceptually Aligned Gradients Imply Robustness?
2022influential reference
Statistical exploration of the Manifold Hypothesis
2022cited by this paper
Opportunities and Challenges in Explainable Artificial Intelligence (XAI): A Survey
2020cited by this paper
Evaluating Attribution for Graph Neural Networks
2020cited by this paper
Captum: A unified and generic model interpretability library for PyTorch
2020cited by this paper
The unreasonable effectiveness of deep learning in artificial intelligence
2020cited by this paper
Visualizing the Impact of Feature Attribution Baselines
2020influential reference
Solving the Black Box Problem: A Normative Framework for Explainable Artificial Intelligence
2019cited by this paper
A Note about: Local Explanation Methods for Deep Neural Networks lack Sensitivity to Parameter Values
2018cited by this paper
Robustness May Be at Odds with Accuracy
2018cited by this paper
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
2017cited by this paper
Interpretable Explanations of Black Boxes by Meaningful Perturbation
2017cited by this paper
Towards Deep Learning Models Resistant to Adversarial Attacks
2017cited by this paper
The Riemannian Geometry of Deep Generative Models
2017cited by this paper
SmoothGrad: removing noise by adding noise
2017cited by this paper
Axiomatic Attribution for Deep Networks
2017influential reference
The (Un)reliability of saliency methods
2017cited by this paper
Not Just a Black Box: Learning Important Features Through Propagating Activation Differences
2016cited by this paper
Deep Learning
2016cited by this paper
You Only Look Once: Unified, Real-Time Object Detection
2015cited by this paper
Challenges in representation learning: A report on three machine learning contests
2013cited by this paper
Testing the Manifold Hypothesis
2013cited by this paper
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
2013cited by this paper
The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web]
2012cited by this paper
Principles of mathematical analysis
1964cited by this paper

CITED BY

Riemannian Integrated Gradients: A Geometric View of Explainable AI
2025cites this paper
Exploring Attributions in Convolutional Neural Networks for Cow Identification
2025cites this paper
SMA: Who Said That? Auditing Membership Leakage in Semi-Black-box RAG Controlling
2025cites this paper
Graph-based Integrated Gradients for Explaining Graph Neural Networks
2025influential citation
Path-Weighted Integrated Gradients for Interpretable Dementia Classification
2025cites this paper