Extracting Invariant Features From Images Using An Equivariant Autoencoder

Denis Kuzminykh, Daniil Polykovskiy, Alexander Zhebrak
Proceedings of The 10th Asian Conference on Machine Learning, PMLR 95:438-453, 2018.

Abstract

Convolutional Neural Networks achieve state of the art results in many image recognition tasks. While their structure makes predictions invariant to small translations, some recognition tasks require invariance to other transformations, like rotation and reflection. We apply group convolutions to build an Equivariant Autoencoder with embeddings that change predictably under the specified set of transformations. We then introduce two approaches to extracting invariant features from these embeddings—Gram Pooling and Equivariant Attention. These two methods separate transformation-relevant information from all other image features. We use obtained embeddings in classification and clustering tasks and show an improvement of the classification quality on the learned embeddings compared to pure autoencoder and average pooling method. A visualization of the learned manifold shows that objects of the same class tend to cluster together, which was not observed for the pure autoencoder embeddings.

Cite this Paper


BibTeX
@InProceedings{pmlr-v95-kuzminykh18a, title = {Extracting Invariant Features From Images Using An Equivariant Autoencoder}, author = {Kuzminykh, Denis and Polykovskiy, Daniil and Zhebrak, Alexander}, booktitle = {Proceedings of The 10th Asian Conference on Machine Learning}, pages = {438--453}, year = {2018}, editor = {Zhu, Jun and Takeuchi, Ichiro}, volume = {95}, series = {Proceedings of Machine Learning Research}, month = {14--16 Nov}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v95/kuzminykh18a/kuzminykh18a.pdf}, url = {https://proceedings.mlr.press/v95/kuzminykh18a.html}, abstract = {Convolutional Neural Networks achieve state of the art results in many image recognition tasks. While their structure makes predictions invariant to small translations, some recognition tasks require invariance to other transformations, like rotation and reflection. We apply group convolutions to build an Equivariant Autoencoder with embeddings that change predictably under the specified set of transformations. We then introduce two approaches to extracting invariant features from these embeddings—Gram Pooling and Equivariant Attention. These two methods separate transformation-relevant information from all other image features. We use obtained embeddings in classification and clustering tasks and show an improvement of the classification quality on the learned embeddings compared to pure autoencoder and average pooling method. A visualization of the learned manifold shows that objects of the same class tend to cluster together, which was not observed for the pure autoencoder embeddings.} }
Endnote
%0 Conference Paper %T Extracting Invariant Features From Images Using An Equivariant Autoencoder %A Denis Kuzminykh %A Daniil Polykovskiy %A Alexander Zhebrak %B Proceedings of The 10th Asian Conference on Machine Learning %C Proceedings of Machine Learning Research %D 2018 %E Jun Zhu %E Ichiro Takeuchi %F pmlr-v95-kuzminykh18a %I PMLR %P 438--453 %U https://proceedings.mlr.press/v95/kuzminykh18a.html %V 95 %X Convolutional Neural Networks achieve state of the art results in many image recognition tasks. While their structure makes predictions invariant to small translations, some recognition tasks require invariance to other transformations, like rotation and reflection. We apply group convolutions to build an Equivariant Autoencoder with embeddings that change predictably under the specified set of transformations. We then introduce two approaches to extracting invariant features from these embeddings—Gram Pooling and Equivariant Attention. These two methods separate transformation-relevant information from all other image features. We use obtained embeddings in classification and clustering tasks and show an improvement of the classification quality on the learned embeddings compared to pure autoencoder and average pooling method. A visualization of the learned manifold shows that objects of the same class tend to cluster together, which was not observed for the pure autoencoder embeddings.
APA
Kuzminykh, D., Polykovskiy, D. & Zhebrak, A.. (2018). Extracting Invariant Features From Images Using An Equivariant Autoencoder. Proceedings of The 10th Asian Conference on Machine Learning, in Proceedings of Machine Learning Research 95:438-453 Available from https://proceedings.mlr.press/v95/kuzminykh18a.html.

Related Material