Multi-Label Classification with Unlabeled Data: An Inductive Approach

Le Wu; Min-Ling Zhang

Multi-Label Classification with Unlabeled Data: An Inductive Approach

Le Wu, Min-Ling Zhang

Proceedings of the 5th Asian Conference on Machine Learning, PMLR 29:197-212, 2013.

Abstract

The problem of multi-label classification has attracted great interests in the last decade. Multi-label classification refers to the problems where an example that is represented by a \emphsingle instance can be assigned to \emphmore than one category. Until now, most of the researches on multi-label classification have focused on supervised settings whose assumption is that large amount of labeled training data is available. Unfortunately, labeling training example is expensive and time-consuming, especially when it has more than one label. However, in many cases abundant unlabeled data is easy to obtain. Current attempts toward exploiting unlabeled data for multi-label classification work under the \emphtransductive setting, which aim at making predictions on existing unlabeled data while can not generalize to new unseen data. In this paper, the problem of \emphinductive semi-supervised multi-label classification is studied, where a new approach named \textsliMLCU, i.e. \emphinductive Multi-Label Classification with Unlabeled data, is proposed. We formulate the inductive semi-supervised multi-label learning as an optimization problem of learning linear models and ConCave Convex Procedure \textsl(CCCP) is applied to optimize the non-convex optimization problem. Empirical studies on twelve diversified real-word multi-label learning tasks clearly validate the superiority of \textsliMLCU against the other well-established multi-label learning approaches.

Cite this Paper

BibTeX

@InProceedings{pmlr-v29-Wu13,
  title = 	 {Multi-Label Classification with Unlabeled Data: An Inductive Approach},
  author = 	 {Wu, Le and Zhang, Min-Ling},
  booktitle = 	 {Proceedings of the 5th Asian Conference on Machine Learning},
  pages = 	 {197--212},
  year = 	 {2013},
  editor = 	 {Ong, Cheng Soon and Ho, Tu Bao},
  volume = 	 {29},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Australian National University, Canberra, Australia},
  month = 	 {13--15 Nov},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v29/Wu13.pdf},
  url = 	 {https://proceedings.mlr.press/v29/Wu13.html},
  abstract = 	 {The problem of multi-label classification has attracted great interests in the last decade. Multi-label classification refers to the problems where an example that is represented by a \emphsingle instance can be assigned to \emphmore than one category. Until now, most of the researches on multi-label classification have focused on supervised settings whose assumption is that large amount of labeled training data is available. Unfortunately, labeling training example is expensive and time-consuming, especially when it has more than one label. However, in many cases abundant unlabeled data is easy to obtain. Current attempts toward exploiting unlabeled data for multi-label classification work under the \emphtransductive setting, which aim at making predictions on existing unlabeled data while can not generalize to new unseen data. In this paper, the problem of \emphinductive semi-supervised multi-label classification is studied, where a new approach named \textsliMLCU, i.e. \emphinductive Multi-Label Classification with Unlabeled data, is proposed. We formulate the inductive semi-supervised multi-label learning as an optimization problem of learning linear models and ConCave Convex Procedure \textsl(CCCP) is applied to optimize the non-convex optimization problem. Empirical studies on twelve diversified real-word multi-label learning tasks clearly validate the superiority of \textsliMLCU against the other well-established multi-label learning approaches.}
}

Endnote

%0 Conference Paper
%T Multi-Label Classification with Unlabeled Data: An Inductive Approach
%A Le Wu
%A Min-Ling Zhang
%B Proceedings of the 5th Asian Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2013
%E Cheng Soon Ong
%E Tu Bao Ho	
%F pmlr-v29-Wu13
%I PMLR
%P 197--212
%U https://proceedings.mlr.press/v29/Wu13.html
%V 29
%X The problem of multi-label classification has attracted great interests in the last decade. Multi-label classification refers to the problems where an example that is represented by a \emphsingle instance can be assigned to \emphmore than one category. Until now, most of the researches on multi-label classification have focused on supervised settings whose assumption is that large amount of labeled training data is available. Unfortunately, labeling training example is expensive and time-consuming, especially when it has more than one label. However, in many cases abundant unlabeled data is easy to obtain. Current attempts toward exploiting unlabeled data for multi-label classification work under the \emphtransductive setting, which aim at making predictions on existing unlabeled data while can not generalize to new unseen data. In this paper, the problem of \emphinductive semi-supervised multi-label classification is studied, where a new approach named \textsliMLCU, i.e. \emphinductive Multi-Label Classification with Unlabeled data, is proposed. We formulate the inductive semi-supervised multi-label learning as an optimization problem of learning linear models and ConCave Convex Procedure \textsl(CCCP) is applied to optimize the non-convex optimization problem. Empirical studies on twelve diversified real-word multi-label learning tasks clearly validate the superiority of \textsliMLCU against the other well-established multi-label learning approaches.

RIS

TY  - CPAPER
TI  - Multi-Label Classification with Unlabeled Data: An Inductive Approach
AU  - Le Wu
AU  - Min-Ling Zhang
BT  - Proceedings of the 5th Asian Conference on Machine Learning
DA  - 2013/10/21
ED  - Cheng Soon Ong
ED  - Tu Bao Ho	
ID  - pmlr-v29-Wu13
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 29
SP  - 197
EP  - 212
L1  - http://proceedings.mlr.press/v29/Wu13.pdf
UR  - https://proceedings.mlr.press/v29/Wu13.html
AB  - The problem of multi-label classification has attracted great interests in the last decade. Multi-label classification refers to the problems where an example that is represented by a \emphsingle instance can be assigned to \emphmore than one category. Until now, most of the researches on multi-label classification have focused on supervised settings whose assumption is that large amount of labeled training data is available. Unfortunately, labeling training example is expensive and time-consuming, especially when it has more than one label. However, in many cases abundant unlabeled data is easy to obtain. Current attempts toward exploiting unlabeled data for multi-label classification work under the \emphtransductive setting, which aim at making predictions on existing unlabeled data while can not generalize to new unseen data. In this paper, the problem of \emphinductive semi-supervised multi-label classification is studied, where a new approach named \textsliMLCU, i.e. \emphinductive Multi-Label Classification with Unlabeled data, is proposed. We formulate the inductive semi-supervised multi-label learning as an optimization problem of learning linear models and ConCave Convex Procedure \textsl(CCCP) is applied to optimize the non-convex optimization problem. Empirical studies on twelve diversified real-word multi-label learning tasks clearly validate the superiority of \textsliMLCU against the other well-established multi-label learning approaches.
ER  -

APA

Wu, L. & Zhang, M.. (2013). Multi-Label Classification with Unlabeled Data: An Inductive Approach. Proceedings of the 5th Asian Conference on Machine Learning, in Proceedings of Machine Learning Research 29:197-212 Available from https://proceedings.mlr.press/v29/Wu13.html.

Related Material

Download PDF