Tunable Plug-In Rules with Reduced Posterior Certainty Loss in Imbalanced Datasets

Emmanouil Krasanakis; Eleftherios Spyromitros-Xioufis; Symeon Papadopoulos; Yiannis Kompatsiaris

Tunable Plug-In Rules with Reduced Posterior Certainty Loss in Imbalanced Datasets

Emmanouil Krasanakis, Eleftherios Spyromitros-Xioufis, Symeon Papadopoulos, Yiannis Kompatsiaris

Proceedings of the First International Workshop on Learning with Imbalanced Domains: Theory and Applications, PMLR 74:116-128, 2017.

Abstract

Classifiers have difficulty recognizing under-represented minorities in imbalanced datasets, due to their focus on minimizing the overall misclassification error. This introduces predictive biases against minority classes. Post-processing plug-in rules are popular for tackling class imbalance, but they often affect the certainty of base classifier posteriors, when the latter already perform correct classification. This shortcoming makes them ill-suited to scoring tasks, where informative posterior scores are required for human interpretation. To this end, we propose the

$ILoss$ metric to measure the impact of imbalance-aware classifiers on the certainty of posterior distributions. We then generalize post-processing plug-in rules in an easily tunable framework and theoretically show that this framework tends to improve performance balance. Finally, we experimentally assert that appropriate usage of our framework can reduce

$ILoss$ while yielding similar performance, with respect to common imbalance-aware measures, to existing plug-in rules for binary problems.

Cite this Paper

BibTeX


@InProceedings{pmlr-v74-krasanakis17a,
  title = 	 {Tunable Plug-In Rules with Reduced Posterior Certainty Loss in Imbalanced Datasets},
  author = 	 {Krasanakis, Emmanouil and Spyromitros-Xioufis, Eleftherios and Papadopoulos, Symeon and Kompatsiaris, Yiannis},
  booktitle = 	 {Proceedings of the First International Workshop on Learning with Imbalanced Domains: Theory and Applications},
  pages = 	 {116--128},
  year = 	 {2017},
  editor = 	 {Luís Torgo, Paula Branco and Moniz, Nuno},
  volume = 	 {74},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {22 Sep},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v74/krasanakis17a/krasanakis17a.pdf},
  url = 	 {https://proceedings.mlr.press/v74/krasanakis17a.html},
  abstract = 	 {Classifiers have difficulty recognizing under-represented minorities in imbalanced datasets, due to their focus on minimizing the overall misclassification error. This introduces predictive biases against minority classes. Post-processing plug-in rules are popular for tackling class imbalance, but they often affect the certainty of base classifier posteriors, when the latter already perform correct classification. This shortcoming makes them ill-suited to scoring tasks, where informative posterior scores are required for human interpretation. To this end, we propose the $ILoss$ metric to measure the impact of imbalance-aware classifiers on the certainty of posterior distributions. We then generalize post-processing plug-in rules in an easily tunable framework and theoretically show that this framework tends to improve performance balance. Finally, we experimentally assert that appropriate usage of our framework can reduce $ILoss$ while yielding similar performance, with respect to common imbalance-aware measures, to existing plug-in rules for binary problems.}
}

Endnote

%0 Conference Paper
%T Tunable Plug-In Rules with Reduced Posterior Certainty Loss in Imbalanced Datasets
%A Emmanouil Krasanakis
%A Eleftherios Spyromitros-Xioufis
%A Symeon Papadopoulos
%A Yiannis Kompatsiaris
%B Proceedings of the First International Workshop on Learning with Imbalanced Domains: Theory and Applications
%C Proceedings of Machine Learning Research
%D 2017
%E Paula Branco Luís Torgo
%E Nuno Moniz	
%F pmlr-v74-krasanakis17a
%I PMLR
%P 116--128
%U https://proceedings.mlr.press/v74/krasanakis17a.html
%V 74
%X Classifiers have difficulty recognizing under-represented minorities in imbalanced datasets, due to their focus on minimizing the overall misclassification error. This introduces predictive biases against minority classes. Post-processing plug-in rules are popular for tackling class imbalance, but they often affect the certainty of base classifier posteriors, when the latter already perform correct classification. This shortcoming makes them ill-suited to scoring tasks, where informative posterior scores are required for human interpretation. To this end, we propose the $ILoss$ metric to measure the impact of imbalance-aware classifiers on the certainty of posterior distributions. We then generalize post-processing plug-in rules in an easily tunable framework and theoretically show that this framework tends to improve performance balance. Finally, we experimentally assert that appropriate usage of our framework can reduce $ILoss$ while yielding similar performance, with respect to common imbalance-aware measures, to existing plug-in rules for binary problems.

APA


Krasanakis, E., Spyromitros-Xioufis, E., Papadopoulos, S. & Kompatsiaris, Y.. (2017). Tunable Plug-In Rules with Reduced Posterior Certainty Loss in Imbalanced Datasets. Proceedings of the First International Workshop on Learning with Imbalanced Domains: Theory and Applications, in Proceedings of Machine Learning Research 74:116-128 Available from https://proceedings.mlr.press/v74/krasanakis17a.html.

Related Material

Download PDF