AUCμ: A Performance Metric for Multi-Class Machine Learning Models

Ross Kleiman, David Page
Proceedings of the 36th International Conference on Machine Learning, PMLR 97:3439-3447, 2019.

Abstract

The area under the receiver operating characteristic curve (AUC) is arguably the most common metric in machine learning for assessing the quality of a two-class classification model. As the number and complexity of machine learning applications grows, so too does the need for measures that can gracefully extend to classification models trained for more than two classes. Prior work in this area has proven computationally intractable and/or inconsistent with known properties of AUC, and thus there is still a need for an improved multi-class efficacy metric. We provide in this work a multi-class extension of AUC that we call AUC{\textmu} that is derived from first principles of the binary class AUC. AUC{\textmu} has similar computational complexity to AUC and maintains the properties of AUC critical to its interpretation and use.

Cite this Paper


BibTeX
@InProceedings{pmlr-v97-kleiman19a, title = {{AUC}{\textmu}: A Performance Metric for Multi-Class Machine Learning Models}, author = {Kleiman, Ross and Page, David}, booktitle = {Proceedings of the 36th International Conference on Machine Learning}, pages = {3439--3447}, year = {2019}, editor = {Chaudhuri, Kamalika and Salakhutdinov, Ruslan}, volume = {97}, series = {Proceedings of Machine Learning Research}, month = {09--15 Jun}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v97/kleiman19a/kleiman19a.pdf}, url = {https://proceedings.mlr.press/v97/kleiman19a.html}, abstract = {The area under the receiver operating characteristic curve (AUC) is arguably the most common metric in machine learning for assessing the quality of a two-class classification model. As the number and complexity of machine learning applications grows, so too does the need for measures that can gracefully extend to classification models trained for more than two classes. Prior work in this area has proven computationally intractable and/or inconsistent with known properties of AUC, and thus there is still a need for an improved multi-class efficacy metric. We provide in this work a multi-class extension of AUC that we call AUC{\textmu} that is derived from first principles of the binary class AUC. AUC{\textmu} has similar computational complexity to AUC and maintains the properties of AUC critical to its interpretation and use.} }
Endnote
%0 Conference Paper %T AUCμ: A Performance Metric for Multi-Class Machine Learning Models %A Ross Kleiman %A David Page %B Proceedings of the 36th International Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2019 %E Kamalika Chaudhuri %E Ruslan Salakhutdinov %F pmlr-v97-kleiman19a %I PMLR %P 3439--3447 %U https://proceedings.mlr.press/v97/kleiman19a.html %V 97 %X The area under the receiver operating characteristic curve (AUC) is arguably the most common metric in machine learning for assessing the quality of a two-class classification model. As the number and complexity of machine learning applications grows, so too does the need for measures that can gracefully extend to classification models trained for more than two classes. Prior work in this area has proven computationally intractable and/or inconsistent with known properties of AUC, and thus there is still a need for an improved multi-class efficacy metric. We provide in this work a multi-class extension of AUC that we call AUC{\textmu} that is derived from first principles of the binary class AUC. AUC{\textmu} has similar computational complexity to AUC and maintains the properties of AUC critical to its interpretation and use.
APA
Kleiman, R. & Page, D.. (2019). AUCμ: A Performance Metric for Multi-Class Machine Learning Models. Proceedings of the 36th International Conference on Machine Learning, in Proceedings of Machine Learning Research 97:3439-3447 Available from https://proceedings.mlr.press/v97/kleiman19a.html.

Related Material