Using Stratified Sampling to Improve LIME Image Explanations

Published in AAAI-24 - 38th AAAI Conference on Artificial Intelligence, 2024

We investigate the use of a stratified sampling approach for LIME Image, a popular model-agnostic explainable AI method for computer vision tasks, in order to reduce the artifacts generated by typical Monte Carlo sampling. Such artifacts are due to the undersampling of the dependent variable in the synthetic neighborhood around the image being explained, which may result in inadequate explanations due to the impossibility of fitting a linear regressor on the sampled data. We then highlight a connection with the Shapley theory, where similar arguments about undersampling and sample relevance were suggested in the past. We derive all the formulas and adjustment factors required for an unbiased stratified sampling estimator. Experiments show the efficacy of the proposed approach.

Paper contribution

In this paper we:

investigate the distribution of the dependent variable in the sampled synthetic neighborhood of LIME Image, identifying in the undersampling a cause that results in inadequate explanations;
delve into the causes of the synthetic neighborhood inadequacy, recognizing a link with the Shapley theory;
reformulate the synthetic neighborhood generation using an unbiased stratified sampling strategy;
provide empirical proofs of the advantage of using stratified sampling for LIME Image on a popular dataset.

Method Availability

The method is available under: https://github.com/rashidrao-pk/lime_stratified
Examples and full results given at https://github.com/rashidrao-pk/lime-stratified-examples
Links: Python Package, GitHub Codes, Paper PDF Arxiv, PDF on Proceedings of AAAI-24

Datasets and Models

Dataset: ImageNet
Model: Resnet-50

How LIME_Image Works

Keywords

XAI · LIME · Stratified Sampling . ML: Transparent, Interpretable, Explainable ML, RU: Stochastic Optimization, SO: Sampling/Simulation-based Search

Authors

Sr. No.	Author Name	Affiliation	Google Scholar
1.	Muhammad Rashid	University of Torino, Dept. of Computer Science, Torino, Italy	Muhammad Rashid
2.	Elvio G. Amparore	University of Torino, Dept. of Computer Science, Torino, Italy	Elvio G. Amparore
3.	Enrico Ferrari	Rulex Innovation Labs, Rulex Inc., Genova, Italy	Enrico Ferrari
4.	Damiano Verda	Rulex Innovation Labs, Rulex Inc., Genova, Italy	Damiano Verda

Recommended citation: Rashid,Muhammad et al. (2024). "." Proceedings of the AAAI Conference on Artificial Intelligence. 38(13).
Download Paper | Download Slides

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Muhammad Rashid