Integrated Gradients (IG) as well as its variants are well-known techniques for interpreting the decisions of deep neural networks. While IG-based approaches attain state-of-the-art performance, they often integrate noise into their explanation saliency maps, which reduce their interpretability. To minimize the noise, we examine the source of the noise analytically and propose a new approach to reduce the explanation noise based on our analytical findings. We propose the Important Direction Gradient Integration (IDGI) framework, which can be easily incorporated into any IG-based method that uses the Reimann Integration for integrated gradient computation. Extensive experiments with three IG-based methods show that IDGI improves them drastically on numerous interpretability metrics.
IDGI: A Framework to Eliminate Explanation Noise from Integrated Gradients
The Important Direction Gradient Integration (IDGI) framework reduces noise in explanation saliency maps of IG-based deep neural network interpretation methods.
- Year
- 2023
- Venue
- CVPR 2023 1
- Authors
- 3
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2303.14242ARXIV-DEFAULT
- TL;DR
- Semantic Scholar