Statistical Significance of Feature Importance Rankings

Jeremy Goldwasser; Giles Hooker

Statistical Significance of Feature Importance Rankings

Jeremy Goldwasser, Giles Hooker

Proceedings of the Forty-first Conference on Uncertainty in Artificial Intelligence, PMLR 286:1476-1496, 2025.

Abstract

Feature importance scores are ubiquitous tools for understanding the predictions of machine learning models. However, many popular attribution methods suffer from high instability due to random sampling. Leveraging novel ideas from hypothesis testing, we devise techniques that ensure the most important features are correct with high-probability guarantees. These are capable of assessing both the set of $K$ top-ranked features as well as the order of its elements. Given local or global importance scores, we demonstrate how to retrospectively verify the stability of the highest ranks. We then introduce two efficient sampling algorithms that identify the $K$ most important features, perhaps in order, with probability at least $1-\alpha$. The theoretical justification for these procedures is validated empirically on SHAP and LIME.

Cite this Paper

BibTeX

@InProceedings{pmlr-v286-goldwasser25a,
  title = 	 {Statistical Significance of Feature Importance Rankings},
  author =       {Goldwasser, Jeremy and Hooker, Giles},
  booktitle = 	 {Proceedings of the Forty-first Conference on Uncertainty in Artificial Intelligence},
  pages = 	 {1476--1496},
  year = 	 {2025},
  editor = 	 {Chiappa, Silvia and Magliacane, Sara},
  volume = 	 {286},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--25 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v286/main/assets/goldwasser25a/goldwasser25a.pdf},
  url = 	 {https://proceedings.mlr.press/v286/goldwasser25a.html},
  abstract = 	 {Feature importance scores are ubiquitous tools for understanding the predictions of machine learning models. However, many popular attribution methods suffer from high instability due to random sampling. Leveraging novel ideas from hypothesis testing, we devise techniques that ensure the most important features are correct with high-probability guarantees. These are capable of assessing both the set of $K$ top-ranked features as well as the order of its elements. Given local or global importance scores, we demonstrate how to retrospectively verify the stability of the highest ranks. We then introduce two efficient sampling algorithms that identify the $K$ most important features, perhaps in order, with probability at least $1-\alpha$. The theoretical justification for these procedures is validated empirically on SHAP and LIME.}
}

Endnote

%0 Conference Paper
%T Statistical Significance of Feature Importance Rankings
%A Jeremy Goldwasser
%A Giles Hooker
%B Proceedings of the Forty-first Conference on Uncertainty in Artificial Intelligence
%C Proceedings of Machine Learning Research
%D 2025
%E Silvia Chiappa
%E Sara Magliacane	
%F pmlr-v286-goldwasser25a
%I PMLR
%P 1476--1496
%U https://proceedings.mlr.press/v286/goldwasser25a.html
%V 286
%X Feature importance scores are ubiquitous tools for understanding the predictions of machine learning models. However, many popular attribution methods suffer from high instability due to random sampling. Leveraging novel ideas from hypothesis testing, we devise techniques that ensure the most important features are correct with high-probability guarantees. These are capable of assessing both the set of $K$ top-ranked features as well as the order of its elements. Given local or global importance scores, we demonstrate how to retrospectively verify the stability of the highest ranks. We then introduce two efficient sampling algorithms that identify the $K$ most important features, perhaps in order, with probability at least $1-\alpha$. The theoretical justification for these procedures is validated empirically on SHAP and LIME.

APA

Goldwasser, J. & Hooker, G.. (2025). Statistical Significance of Feature Importance Rankings. Proceedings of the Forty-first Conference on Uncertainty in Artificial Intelligence, in Proceedings of Machine Learning Research 286:1476-1496 Available from https://proceedings.mlr.press/v286/goldwasser25a.html.

Statistical Significance of Feature Importance Rankings

Abstract

Cite this Paper

Related Material