Active Learning as Non-Convex Optimization

Andrew Guillory; Erick Chastain; Jeff Bilmes

Active Learning as Non-Convex Optimization

Andrew Guillory, Erick Chastain, Jeff Bilmes

Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, PMLR 5:201-208, 2009.

Abstract

We propose a new view of active learning algorithms as optimization. We show that many online active learning algorithms can be viewed as stochastic gradient descent on non-convex objective functions. Variations of some of these algorithms and objective functions have been previously proposed without noting this connection. We also point out a connection between the standard min-margin offline active learning algorithm and non-convex losses. Finally, we discuss and show empirically how viewing active learning as non-convex loss minimization helps explain two previously observed phenomena: certain active learning algorithms achieve better generalization error than passive learning algorithms on certain data sets and on other data sets many active learning algorithms are prone to local minima.

Cite this Paper

BibTeX


@InProceedings{pmlr-v5-guillory09a,
  title = 	 {Active Learning as Non-Convex Optimization},
  author = 	 {Guillory, Andrew and Chastain, Erick and Bilmes, Jeff},
  booktitle = 	 {Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics},
  pages = 	 {201--208},
  year = 	 {2009},
  editor = 	 {van Dyk, David and Welling, Max},
  volume = 	 {5},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Hilton Clearwater Beach Resort, Clearwater Beach, Florida USA},
  month = 	 {16--18 Apr},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v5/guillory09a/guillory09a.pdf},
  url = 	 {https://proceedings.mlr.press/v5/guillory09a.html},
  abstract = 	 {We propose a new view of active learning algorithms as optimization. We show that many online active learning algorithms can be viewed as stochastic gradient descent on non-convex objective functions. Variations of some of these algorithms and objective functions have been previously proposed without noting this connection.  We also point out a connection between the standard min-margin offline active learning algorithm and non-convex losses.  Finally, we discuss and show empirically how viewing active learning as non-convex loss minimization helps explain two previously observed phenomena: certain active learning algorithms achieve better generalization error than passive learning algorithms on certain data sets and on other data sets many active learning algorithms are prone to local minima.}
}

Endnote

%0 Conference Paper
%T Active Learning as Non-Convex Optimization
%A Andrew Guillory
%A Erick Chastain
%A Jeff Bilmes
%B Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2009
%E David van Dyk
%E Max Welling	
%F pmlr-v5-guillory09a
%I PMLR
%P 201--208
%U https://proceedings.mlr.press/v5/guillory09a.html
%V 5
%X We propose a new view of active learning algorithms as optimization. We show that many online active learning algorithms can be viewed as stochastic gradient descent on non-convex objective functions. Variations of some of these algorithms and objective functions have been previously proposed without noting this connection.  We also point out a connection between the standard min-margin offline active learning algorithm and non-convex losses.  Finally, we discuss and show empirically how viewing active learning as non-convex loss minimization helps explain two previously observed phenomena: certain active learning algorithms achieve better generalization error than passive learning algorithms on certain data sets and on other data sets many active learning algorithms are prone to local minima.

RIS


TY  - CPAPER
TI  - Active Learning as Non-Convex Optimization
AU  - Andrew Guillory
AU  - Erick Chastain
AU  - Jeff Bilmes
BT  - Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics
DA  - 2009/04/15
ED  - David van Dyk
ED  - Max Welling	
ID  - pmlr-v5-guillory09a
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 5
SP  - 201
EP  - 208
L1  - http://proceedings.mlr.press/v5/guillory09a/guillory09a.pdf
UR  - https://proceedings.mlr.press/v5/guillory09a.html
AB  - We propose a new view of active learning algorithms as optimization. We show that many online active learning algorithms can be viewed as stochastic gradient descent on non-convex objective functions. Variations of some of these algorithms and objective functions have been previously proposed without noting this connection.  We also point out a connection between the standard min-margin offline active learning algorithm and non-convex losses.  Finally, we discuss and show empirically how viewing active learning as non-convex loss minimization helps explain two previously observed phenomena: certain active learning algorithms achieve better generalization error than passive learning algorithms on certain data sets and on other data sets many active learning algorithms are prone to local minima.
ER  -

APA


Guillory, A., Chastain, E. & Bilmes, J.. (2009). Active Learning as Non-Convex Optimization. Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 5:201-208 Available from https://proceedings.mlr.press/v5/guillory09a.html.

Related Material

Download PDF