Estimating Accuracy from Unlabeled Data: A Bayesian Approach

Emmanouil Antonios Platanios; Avinava Dubey; Tom Mitchell

Estimating Accuracy from Unlabeled Data: A Bayesian Approach

Emmanouil Antonios Platanios, Avinava Dubey, Tom Mitchell

Proceedings of The 33rd International Conference on Machine Learning, PMLR 48:1416-1425, 2016.

Abstract

We consider the question of how unlabeled data can be used to estimate the true accuracy of learned classifiers, and the related question of how outputs from several classifiers performing the same task can be combined based on their estimated accuracies. To answer these questions, we first present a simple graphical model that performs well in practice. We then provide two nonparametric extensions to it that improve its performance. Experiments on two real-world data sets produce accuracy estimates within a few percent of the true accuracy, using solely unlabeled data. Our models also outperform existing state-of-the-art solutions in both estimating accuracies, and combining multiple classifier outputs.

Cite this Paper

BibTeX


@InProceedings{pmlr-v48-platanios16,
  title = 	 {Estimating Accuracy from Unlabeled Data: A Bayesian Approach},
  author = 	 {Platanios, Emmanouil Antonios and Dubey, Avinava and Mitchell, Tom},
  booktitle = 	 {Proceedings of The 33rd International Conference on Machine Learning},
  pages = 	 {1416--1425},
  year = 	 {2016},
  editor = 	 {Balcan, Maria Florina and Weinberger, Kilian Q.},
  volume = 	 {48},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {New York, New York, USA},
  month = 	 {20--22 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v48/platanios16.pdf},
  url = 	 {https://proceedings.mlr.press/v48/platanios16.html},
  abstract = 	 {We consider the question of how unlabeled data can be used to estimate the true accuracy of learned classifiers, and the related question of how outputs from several classifiers performing the same task can be combined based on their estimated accuracies. To answer these questions, we first present a simple graphical model that performs well in practice. We then provide two nonparametric extensions to it that improve its performance. Experiments on two real-world data sets produce accuracy estimates within a few percent of the true accuracy, using solely unlabeled data. Our models also outperform existing state-of-the-art solutions in both estimating accuracies, and combining multiple classifier outputs.}
}

Endnote

%0 Conference Paper
%T Estimating Accuracy from Unlabeled Data: A Bayesian Approach
%A Emmanouil Antonios Platanios
%A Avinava Dubey
%A Tom Mitchell
%B Proceedings of The 33rd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2016
%E Maria Florina Balcan
%E Kilian Q. Weinberger	
%F pmlr-v48-platanios16
%I PMLR
%P 1416--1425
%U https://proceedings.mlr.press/v48/platanios16.html
%V 48
%X We consider the question of how unlabeled data can be used to estimate the true accuracy of learned classifiers, and the related question of how outputs from several classifiers performing the same task can be combined based on their estimated accuracies. To answer these questions, we first present a simple graphical model that performs well in practice. We then provide two nonparametric extensions to it that improve its performance. Experiments on two real-world data sets produce accuracy estimates within a few percent of the true accuracy, using solely unlabeled data. Our models also outperform existing state-of-the-art solutions in both estimating accuracies, and combining multiple classifier outputs.

RIS


TY  - CPAPER
TI  - Estimating Accuracy from Unlabeled Data: A Bayesian Approach
AU  - Emmanouil Antonios Platanios
AU  - Avinava Dubey
AU  - Tom Mitchell
BT  - Proceedings of The 33rd International Conference on Machine Learning
DA  - 2016/06/11
ED  - Maria Florina Balcan
ED  - Kilian Q. Weinberger	
ID  - pmlr-v48-platanios16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 48
SP  - 1416
EP  - 1425
L1  - http://proceedings.mlr.press/v48/platanios16.pdf
UR  - https://proceedings.mlr.press/v48/platanios16.html
AB  - We consider the question of how unlabeled data can be used to estimate the true accuracy of learned classifiers, and the related question of how outputs from several classifiers performing the same task can be combined based on their estimated accuracies. To answer these questions, we first present a simple graphical model that performs well in practice. We then provide two nonparametric extensions to it that improve its performance. Experiments on two real-world data sets produce accuracy estimates within a few percent of the true accuracy, using solely unlabeled data. Our models also outperform existing state-of-the-art solutions in both estimating accuracies, and combining multiple classifier outputs.
ER  -

APA


Platanios, E.A., Dubey, A. & Mitchell, T.. (2016). Estimating Accuracy from Unlabeled Data: A Bayesian Approach. Proceedings of The 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research 48:1416-1425 Available from https://proceedings.mlr.press/v48/platanios16.html.

Estimating Accuracy from Unlabeled Data: A Bayesian Approach

Abstract

Cite this Paper

Related Material