Structured Neural Topic Models for Reviews

Babak Esmaeili; Hongyi Huang; Byron Wallace; Jan-Willem van de Meent

Structured Neural Topic Models for Reviews

Babak Esmaeili, Hongyi Huang, Byron Wallace, Jan-Willem van de Meent

Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, PMLR 89:3429-3439, 2019.

Abstract

We present Variational Aspect-based Latent Topic Allocation (VALTA), a family of autoencoding topic models that learn aspect-based representations of reviews. VALTA defines a user-item encoder that maps bag-of-words vectors for combined reviews associated with each paired user and item onto structured embeddings, which in turn define per-aspect topic weights. We model individual reviews in a structured manner by inferring an aspect assignment for each sentence in a given review, where the per-aspect topic weights obtained by the user-item encoder serve to define a mixture over topics, conditioned on the aspect. The result is an autoencoding neural topic model for reviews, which can be trained in a fully unsupervised manner to learn topics that are structured into aspects. Experimental evaluation on large number of datasets demonstrates that aspects are interpretable, yield higher coherence scores than non-structured autoencoding topic model variants, and can be utilized to perform aspect-based comparison and genre discovery.

Cite this Paper

BibTeX


@InProceedings{pmlr-v89-esmaeili19b,
  title = 	 {Structured Neural Topic Models for Reviews},
  author =       {Esmaeili, Babak and Huang, Hongyi and Wallace, Byron and Meent, Jan-Willem van de},
  booktitle = 	 {Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics},
  pages = 	 {3429--3439},
  year = 	 {2019},
  editor = 	 {Chaudhuri, Kamalika and Sugiyama, Masashi},
  volume = 	 {89},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {16--18 Apr},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v89/esmaeili19b/esmaeili19b.pdf},
  url = 	 {https://proceedings.mlr.press/v89/esmaeili19b.html},
  abstract = 	 {We present Variational Aspect-based Latent Topic Allocation (VALTA), a family of autoencoding topic models that learn aspect-based representations of reviews. VALTA defines a user-item encoder that maps bag-of-words vectors for combined reviews associated with each paired user and item onto structured embeddings, which in turn define per-aspect topic weights. We model individual reviews in a structured manner by inferring an aspect assignment for each sentence in a given review, where the per-aspect topic weights obtained by the user-item encoder serve to define a mixture over topics, conditioned on the aspect. The result is an autoencoding neural topic model for reviews, which can be trained in a fully unsupervised manner to learn topics that are structured into aspects. Experimental evaluation on large number of datasets demonstrates that aspects are interpretable, yield higher coherence scores than non-structured autoencoding topic model variants, and can be utilized to perform aspect-based comparison and genre discovery.}
}

Endnote

%0 Conference Paper
%T Structured Neural Topic Models for Reviews
%A Babak Esmaeili
%A Hongyi Huang
%A Byron Wallace
%A Jan-Willem van de Meent
%B Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2019
%E Kamalika Chaudhuri
%E Masashi Sugiyama	
%F pmlr-v89-esmaeili19b
%I PMLR
%P 3429--3439
%U https://proceedings.mlr.press/v89/esmaeili19b.html
%V 89
%X We present Variational Aspect-based Latent Topic Allocation (VALTA), a family of autoencoding topic models that learn aspect-based representations of reviews. VALTA defines a user-item encoder that maps bag-of-words vectors for combined reviews associated with each paired user and item onto structured embeddings, which in turn define per-aspect topic weights. We model individual reviews in a structured manner by inferring an aspect assignment for each sentence in a given review, where the per-aspect topic weights obtained by the user-item encoder serve to define a mixture over topics, conditioned on the aspect. The result is an autoencoding neural topic model for reviews, which can be trained in a fully unsupervised manner to learn topics that are structured into aspects. Experimental evaluation on large number of datasets demonstrates that aspects are interpretable, yield higher coherence scores than non-structured autoencoding topic model variants, and can be utilized to perform aspect-based comparison and genre discovery.

APA


Esmaeili, B., Huang, H., Wallace, B. & Meent, J.v.d.. (2019). Structured Neural Topic Models for Reviews. Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 89:3429-3439 Available from https://proceedings.mlr.press/v89/esmaeili19b.html.

Structured Neural Topic Models for Reviews

Abstract

Cite this Paper

Related Material