[edit]
Gradient-Informed Neural Network Statistical Robustness Estimation
Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, PMLR 206:323-334, 2023.
Abstract
Deep neural networks are robust against random corruptions of the inputs to some extent. This global sense of safety is not sufficient in critical applications where probabilities of failure must be assessed with accuracy. Some previous works applied known statistical methods from the field of rare event analysis to classification. Yet, they use classifiers as black-box models without taking into account gradient information, readily available for deep learning models via auto-differentiation. We propose a new and highly efficient estimator of probabilities of failure dedicated to neural networks as it leverages the fast computation of gradients of the model through back-propagation.