Large-Margin Softmax Loss for Convolutional Neural Networks

Weiyang Liu; Yandong Wen; Zhiding Yu; Meng Yang

Large-Margin Softmax Loss for Convolutional Neural Networks

Weiyang Liu, Yandong Wen, Zhiding Yu, Meng Yang

Proceedings of The 33rd International Conference on Machine Learning, PMLR 48:507-516, 2016.

Abstract

Cross-entropy loss together with softmax is arguably one of the most common used supervision components in convolutional neural networks (CNNs). Despite its simplicity, popularity and excellent performance, the component does not explicitly encourage discriminative learning of features. In this paper, we propose a generalized large-margin softmax (L-Softmax) loss which explicitly encourages intra-class compactness and inter-class separability between learned features. Moreover, L-Softmax not only can adjust the desired margin but also can avoid overfitting. We also show that the L-Softmax loss can be optimized by typical stochastic gradient descent. Extensive experiments on four benchmark datasets demonstrate that the deeply-learned features with L-softmax loss become more discriminative, hence significantly boosting the performance on a variety of visual classification and verification tasks.

Cite this Paper

BibTeX


@InProceedings{pmlr-v48-liud16,
  title = 	 {Large-Margin Softmax Loss for Convolutional Neural Networks},
  author = 	 {Liu, Weiyang and Wen, Yandong and Yu, Zhiding and Yang, Meng},
  booktitle = 	 {Proceedings of The 33rd International Conference on Machine Learning},
  pages = 	 {507--516},
  year = 	 {2016},
  editor = 	 {Balcan, Maria Florina and Weinberger, Kilian Q.},
  volume = 	 {48},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {New York, New York, USA},
  month = 	 {20--22 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v48/liud16.pdf},
  url = 	 {https://proceedings.mlr.press/v48/liud16.html},
  abstract = 	 {Cross-entropy loss together with softmax is arguably one of the most common used supervision components in convolutional neural networks (CNNs). Despite its simplicity, popularity and excellent performance, the component does not explicitly encourage discriminative learning of features. In this paper, we propose a generalized large-margin softmax (L-Softmax) loss which explicitly encourages intra-class compactness and inter-class separability between learned features. Moreover, L-Softmax not only can adjust the desired margin but also can avoid overfitting. We also show that the L-Softmax loss can be optimized by typical stochastic gradient descent. Extensive experiments on four benchmark datasets demonstrate that the deeply-learned features with L-softmax loss become more discriminative, hence significantly boosting the performance on a variety of visual classification and verification tasks.}
}

Endnote

%0 Conference Paper
%T Large-Margin Softmax Loss for Convolutional Neural Networks
%A Weiyang Liu
%A Yandong Wen
%A Zhiding Yu
%A Meng Yang
%B Proceedings of The 33rd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2016
%E Maria Florina Balcan
%E Kilian Q. Weinberger	
%F pmlr-v48-liud16
%I PMLR
%P 507--516
%U https://proceedings.mlr.press/v48/liud16.html
%V 48
%X Cross-entropy loss together with softmax is arguably one of the most common used supervision components in convolutional neural networks (CNNs). Despite its simplicity, popularity and excellent performance, the component does not explicitly encourage discriminative learning of features. In this paper, we propose a generalized large-margin softmax (L-Softmax) loss which explicitly encourages intra-class compactness and inter-class separability between learned features. Moreover, L-Softmax not only can adjust the desired margin but also can avoid overfitting. We also show that the L-Softmax loss can be optimized by typical stochastic gradient descent. Extensive experiments on four benchmark datasets demonstrate that the deeply-learned features with L-softmax loss become more discriminative, hence significantly boosting the performance on a variety of visual classification and verification tasks.

RIS


TY  - CPAPER
TI  - Large-Margin Softmax Loss for Convolutional Neural Networks
AU  - Weiyang Liu
AU  - Yandong Wen
AU  - Zhiding Yu
AU  - Meng Yang
BT  - Proceedings of The 33rd International Conference on Machine Learning
DA  - 2016/06/11
ED  - Maria Florina Balcan
ED  - Kilian Q. Weinberger	
ID  - pmlr-v48-liud16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 48
SP  - 507
EP  - 516
L1  - http://proceedings.mlr.press/v48/liud16.pdf
UR  - https://proceedings.mlr.press/v48/liud16.html
AB  - Cross-entropy loss together with softmax is arguably one of the most common used supervision components in convolutional neural networks (CNNs). Despite its simplicity, popularity and excellent performance, the component does not explicitly encourage discriminative learning of features. In this paper, we propose a generalized large-margin softmax (L-Softmax) loss which explicitly encourages intra-class compactness and inter-class separability between learned features. Moreover, L-Softmax not only can adjust the desired margin but also can avoid overfitting. We also show that the L-Softmax loss can be optimized by typical stochastic gradient descent. Extensive experiments on four benchmark datasets demonstrate that the deeply-learned features with L-softmax loss become more discriminative, hence significantly boosting the performance on a variety of visual classification and verification tasks.
ER  -

APA


Liu, W., Wen, Y., Yu, Z. & Yang, M.. (2016). Large-Margin Softmax Loss for Convolutional Neural Networks. Proceedings of The 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research 48:507-516 Available from https://proceedings.mlr.press/v48/liud16.html.

Related Material

Download PDF