Gaining Free or Low-Cost Interpretability with Interpretable Partial Substitute

Tong Wang

Gaining Free or Low-Cost Interpretability with Interpretable Partial Substitute

Tong Wang

Proceedings of the 36th International Conference on Machine Learning, PMLR 97:6505-6514, 2019.

Abstract

This work addresses the situation where a black-box model with good predictive performance is chosen over its interpretable competitors, and we show interpretability is still achievable in this case. Our solution is to find an interpretable substitute on a subset of data where the black-box model is overkill or nearly overkill while leaving the rest to the black-box. This transparency is obtained at minimal cost or no cost of the predictive performance. Under this framework, we develop a Hybrid Rule Sets (HyRS) model that uses decision rules to capture the subspace of data where the rules are as accurate or almost as accurate as the black-box provided. To train a HyRS, we devise an efficient search algorithm that iteratively finds the optimal model and exploits theoretically grounded strategies to reduce computation. Our framework is agnostic to the black-box during training. Experiments on structured and text data show that HyRS obtains an effective trade-off between transparency and interpretability.

Cite this Paper

BibTeX

@InProceedings{pmlr-v97-wang19a,
  title = 	 {Gaining Free or Low-Cost Interpretability with Interpretable Partial Substitute},
  author =       {Wang, Tong},
  booktitle = 	 {Proceedings of the 36th International Conference on Machine Learning},
  pages = 	 {6505--6514},
  year = 	 {2019},
  editor = 	 {Chaudhuri, Kamalika and Salakhutdinov, Ruslan},
  volume = 	 {97},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {09--15 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v97/wang19a/wang19a.pdf},
  url = 	 {https://proceedings.mlr.press/v97/wang19a.html},
  abstract = 	 {This work addresses the situation where a black-box model with good predictive performance is chosen over its interpretable competitors, and we show interpretability is still achievable in this case. Our solution is to find an interpretable substitute on a subset of data where the black-box model is overkill or nearly overkill while leaving the rest to the black-box. This transparency is obtained at minimal cost or no cost of the predictive performance. Under this framework, we develop a Hybrid Rule Sets (HyRS) model that uses decision rules to capture the subspace of data where the rules are as accurate or almost as accurate as the black-box provided. To train a HyRS, we devise an efficient search algorithm that iteratively finds the optimal model and exploits theoretically grounded strategies to reduce computation. Our framework is agnostic to the black-box during training. Experiments on structured and text data show that HyRS obtains an effective trade-off between transparency and interpretability.}
}

Endnote

%0 Conference Paper
%T Gaining Free or Low-Cost Interpretability with Interpretable Partial Substitute
%A Tong Wang
%B Proceedings of the 36th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2019
%E Kamalika Chaudhuri
%E Ruslan Salakhutdinov	
%F pmlr-v97-wang19a
%I PMLR
%P 6505--6514
%U https://proceedings.mlr.press/v97/wang19a.html
%V 97
%X This work addresses the situation where a black-box model with good predictive performance is chosen over its interpretable competitors, and we show interpretability is still achievable in this case. Our solution is to find an interpretable substitute on a subset of data where the black-box model is overkill or nearly overkill while leaving the rest to the black-box. This transparency is obtained at minimal cost or no cost of the predictive performance. Under this framework, we develop a Hybrid Rule Sets (HyRS) model that uses decision rules to capture the subspace of data where the rules are as accurate or almost as accurate as the black-box provided. To train a HyRS, we devise an efficient search algorithm that iteratively finds the optimal model and exploits theoretically grounded strategies to reduce computation. Our framework is agnostic to the black-box during training. Experiments on structured and text data show that HyRS obtains an effective trade-off between transparency and interpretability.

APA

Wang, T.. (2019). Gaining Free or Low-Cost Interpretability with Interpretable Partial Substitute. Proceedings of the 36th International Conference on Machine Learning, in Proceedings of Machine Learning Research 97:6505-6514 Available from https://proceedings.mlr.press/v97/wang19a.html.

Gaining Free or Low-Cost Interpretability with Interpretable Partial Substitute

Abstract

Cite this Paper

Related Material