ControlVAE: Controllable Variational Autoencoder

Huajie Shao; Shuochao Yao; Dachun Sun; Aston Zhang; Shengzhong Liu; Dongxin Liu; Jun Wang; Tarek Abdelzaher

ControlVAE: Controllable Variational Autoencoder

Huajie Shao, Shuochao Yao, Dachun Sun, Aston Zhang, Shengzhong Liu, Dongxin Liu, Jun Wang, Tarek Abdelzaher

Proceedings of the 37th International Conference on Machine Learning, PMLR 119:8655-8664, 2020.

Abstract

Variational Autoencoders (VAE) and their variants have been widely used in a variety of applications, such as dialog generation, image generation and disentangled representation learning. However, the existing VAE models may suffer from KL vanishing in language modeling and low reconstruction quality for disentangling. To address these issues, we propose a novel controllable variational autoencoder framework, ControlVAE, that combines a controller, inspired by automatic control theory, with the basic VAE to improve the performance of resulting generative models. Specifically, we design a new non-linear PI controller, a variant of the proportional-integral-derivative (PID) control, to automatically tune the hyperparameter (weight) added in the VAE objective using the output KL-divergence as feedback during model training. The framework is evaluated using three applications; namely, language modeling, disentangled representation learning, and image generation. The results show that ControlVAE can achieve much better reconstruction quality than the competitive methods for the comparable disentanglement performance. For language modeling, it not only averts the KL-vanishing, but also improves the diversity of generated text. Finally, we also demonstrate that ControlVAE improves the reconstruction quality for image generation compared to the original VAE.

Cite this Paper

BibTeX


@InProceedings{pmlr-v119-shao20b,
  title = 	 {{C}ontrol{VAE}: Controllable Variational Autoencoder},
  author =       {Shao, Huajie and Yao, Shuochao and Sun, Dachun and Zhang, Aston and Liu, Shengzhong and Liu, Dongxin and Wang, Jun and Abdelzaher, Tarek},
  booktitle = 	 {Proceedings of the 37th International Conference on Machine Learning},
  pages = 	 {8655--8664},
  year = 	 {2020},
  editor = 	 {III, Hal Daumé and Singh, Aarti},
  volume = 	 {119},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--18 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v119/shao20b/shao20b.pdf},
  url = 	 {https://proceedings.mlr.press/v119/shao20b.html},
  abstract = 	 {Variational Autoencoders (VAE) and their variants have been widely used in a variety of applications, such as dialog generation, image generation and disentangled representation learning. However, the existing VAE models may suffer from KL vanishing in language modeling and low reconstruction quality for disentangling. To address these issues, we propose a novel controllable variational autoencoder framework, ControlVAE, that combines a controller, inspired by automatic control theory, with the basic VAE to improve the performance of resulting generative models. Specifically, we design a new non-linear PI controller, a variant of the proportional-integral-derivative (PID) control, to automatically tune the hyperparameter (weight) added in the VAE objective using the output KL-divergence as feedback during model training. The framework is evaluated using three applications; namely, language modeling, disentangled representation learning, and image generation. The results show that ControlVAE can achieve much better reconstruction quality than the competitive methods for the comparable disentanglement performance. For language modeling, it not only averts the KL-vanishing, but also improves the diversity of generated text. Finally, we also demonstrate that ControlVAE improves the reconstruction quality for image generation compared to the original VAE.}
}

Endnote

%0 Conference Paper
%T ControlVAE: Controllable Variational Autoencoder
%A Huajie Shao
%A Shuochao Yao
%A Dachun Sun
%A Aston Zhang
%A Shengzhong Liu
%A Dongxin Liu
%A Jun Wang
%A Tarek Abdelzaher
%B Proceedings of the 37th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2020
%E Hal Daumé III
%E Aarti Singh	
%F pmlr-v119-shao20b
%I PMLR
%P 8655--8664
%U https://proceedings.mlr.press/v119/shao20b.html
%V 119
%X Variational Autoencoders (VAE) and their variants have been widely used in a variety of applications, such as dialog generation, image generation and disentangled representation learning. However, the existing VAE models may suffer from KL vanishing in language modeling and low reconstruction quality for disentangling. To address these issues, we propose a novel controllable variational autoencoder framework, ControlVAE, that combines a controller, inspired by automatic control theory, with the basic VAE to improve the performance of resulting generative models. Specifically, we design a new non-linear PI controller, a variant of the proportional-integral-derivative (PID) control, to automatically tune the hyperparameter (weight) added in the VAE objective using the output KL-divergence as feedback during model training. The framework is evaluated using three applications; namely, language modeling, disentangled representation learning, and image generation. The results show that ControlVAE can achieve much better reconstruction quality than the competitive methods for the comparable disentanglement performance. For language modeling, it not only averts the KL-vanishing, but also improves the diversity of generated text. Finally, we also demonstrate that ControlVAE improves the reconstruction quality for image generation compared to the original VAE.

APA


Shao, H., Yao, S., Sun, D., Zhang, A., Liu, S., Liu, D., Wang, J. & Abdelzaher, T.. (2020). ControlVAE: Controllable Variational Autoencoder. Proceedings of the 37th International Conference on Machine Learning, in Proceedings of Machine Learning Research 119:8655-8664 Available from https://proceedings.mlr.press/v119/shao20b.html.

ControlVAE: Controllable Variational Autoencoder

Abstract

Cite this Paper

Related Material