Convexified Convolutional Neural Networks

Yuchen Zhang; Percy Liang; Martin J. Wainwright

Convexified Convolutional Neural Networks

Yuchen Zhang, Percy Liang, Martin J. Wainwright

Proceedings of the 34th International Conference on Machine Learning, PMLR 70:4044-4053, 2017.

Abstract

We describe the class of convexified convolutional neural networks (CCNNs), which capture the parameter sharing of convolutional neural networks in a convex manner. By representing the nonlinear convolutional filters as vectors in a reproducing kernel Hilbert space, the CNN parameters can be represented as a low-rank matrix, which can be relaxed to obtain a convex optimization problem. For learning two-layer convolutional neural networks, we prove that the generalization error obtained by a convexified CNN converges to that of the best possible CNN. For learning deeper networks, we train CCNNs in a layer-wise manner. Empirically, CCNNs achieve competitive or better performance than CNNs trained by backpropagation, SVMs, fully-connected neural networks, stacked denoising auto-encoders, and other baseline methods.

Cite this Paper

BibTeX


@InProceedings{pmlr-v70-zhang17f,
  title = 	 {Convexified Convolutional Neural Networks},
  author =       {Yuchen Zhang and Percy Liang and Martin J. Wainwright},
  booktitle = 	 {Proceedings of the 34th International Conference on Machine Learning},
  pages = 	 {4044--4053},
  year = 	 {2017},
  editor = 	 {Precup, Doina and Teh, Yee Whye},
  volume = 	 {70},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {06--11 Aug},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v70/zhang17f/zhang17f.pdf},
  url = 	 {https://proceedings.mlr.press/v70/zhang17f.html},
  abstract = 	 {We describe the class of convexified convolutional neural networks (CCNNs), which capture the parameter sharing of convolutional neural networks in a convex manner. By representing the nonlinear convolutional filters as vectors in a reproducing kernel Hilbert space, the CNN parameters can be represented as a low-rank matrix, which can be relaxed to obtain a convex optimization problem. For learning two-layer convolutional neural networks, we prove that the generalization error obtained by a convexified CNN converges to that of the best possible CNN. For learning deeper networks, we train CCNNs in a layer-wise manner. Empirically, CCNNs achieve competitive or better performance than CNNs trained by backpropagation, SVMs, fully-connected neural networks, stacked denoising auto-encoders, and other baseline methods.}
}

Endnote

%0 Conference Paper
%T Convexified Convolutional Neural Networks
%A Yuchen Zhang
%A Percy Liang
%A Martin J. Wainwright
%B Proceedings of the 34th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2017
%E Doina Precup
%E Yee Whye Teh	
%F pmlr-v70-zhang17f
%I PMLR
%P 4044--4053
%U https://proceedings.mlr.press/v70/zhang17f.html
%V 70
%X We describe the class of convexified convolutional neural networks (CCNNs), which capture the parameter sharing of convolutional neural networks in a convex manner. By representing the nonlinear convolutional filters as vectors in a reproducing kernel Hilbert space, the CNN parameters can be represented as a low-rank matrix, which can be relaxed to obtain a convex optimization problem. For learning two-layer convolutional neural networks, we prove that the generalization error obtained by a convexified CNN converges to that of the best possible CNN. For learning deeper networks, we train CCNNs in a layer-wise manner. Empirically, CCNNs achieve competitive or better performance than CNNs trained by backpropagation, SVMs, fully-connected neural networks, stacked denoising auto-encoders, and other baseline methods.

APA


Zhang, Y., Liang, P. & Wainwright, M.J.. (2017). Convexified Convolutional Neural Networks. Proceedings of the 34th International Conference on Machine Learning, in Proceedings of Machine Learning Research 70:4044-4053 Available from https://proceedings.mlr.press/v70/zhang17f.html.

Convexified Convolutional Neural Networks

Abstract

Cite this Paper

Related Material