Scalable Deep Poisson Factor Analysis for Topic Modeling

Zhe Gan; Changyou Chen; Ricardo Henao; David Carlson; Lawrence Carin

Scalable Deep Poisson Factor Analysis for Topic Modeling

Zhe Gan, Changyou Chen, Ricardo Henao, David Carlson, Lawrence Carin

Proceedings of the 32nd International Conference on Machine Learning, PMLR 37:1823-1832, 2015.

Abstract

A new framework for topic modeling is developed, based on deep graphical models, where interactions between topics are inferred through deep latent binary hierarchies. The proposed multi-layer model employs a deep sigmoid belief network or restricted Boltzmann machine, the bottom binary layer of which selects topics for use in a Poisson factor analysis model. Under this setting, topics live on the bottom layer of the model, while the deep specification serves as a flexible prior for revealing topic structure. Scalable inference algorithms are derived by applying Bayesian conditional density filtering algorithm, in addition to extending recently proposed work on stochastic gradient thermostats. Experimental results on several corpora show that the proposed approach readily handles very large collections of text documents, infers structured topic representations, and obtains superior test perplexities when compared with related models.

Cite this Paper

BibTeX


@InProceedings{pmlr-v37-gan15,
  title = 	 {Scalable Deep Poisson Factor Analysis for Topic Modeling},
  author = 	 {Gan, Zhe and Chen, Changyou and Henao, Ricardo and Carlson, David and Carin, Lawrence},
  booktitle = 	 {Proceedings of the 32nd International Conference on Machine Learning},
  pages = 	 {1823--1832},
  year = 	 {2015},
  editor = 	 {Bach, Francis and Blei, David},
  volume = 	 {37},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {Lille, France},
  month = 	 {07--09 Jul},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v37/gan15.pdf},
  url = 	 {https://proceedings.mlr.press/v37/gan15.html},
  abstract = 	 {A new framework for topic modeling is developed, based on deep graphical models, where interactions between topics are inferred through deep latent binary hierarchies. The proposed multi-layer model employs a deep sigmoid belief network or restricted Boltzmann machine, the bottom binary layer of which selects topics for use in a Poisson factor analysis model. Under this setting, topics live on the bottom layer of the model, while the deep specification serves as a flexible prior for revealing topic structure. Scalable inference algorithms are derived by applying Bayesian conditional density filtering algorithm, in addition to extending recently proposed work on stochastic gradient thermostats. Experimental results on several corpora show that the proposed approach readily handles very large collections of text documents, infers structured topic representations, and obtains superior test perplexities when compared with related models.}
}

Endnote

%0 Conference Paper
%T Scalable Deep Poisson Factor Analysis for Topic Modeling
%A Zhe Gan
%A Changyou Chen
%A Ricardo Henao
%A David Carlson
%A Lawrence Carin
%B Proceedings of the 32nd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2015
%E Francis Bach
%E David Blei	
%F pmlr-v37-gan15
%I PMLR
%P 1823--1832
%U https://proceedings.mlr.press/v37/gan15.html
%V 37
%X A new framework for topic modeling is developed, based on deep graphical models, where interactions between topics are inferred through deep latent binary hierarchies. The proposed multi-layer model employs a deep sigmoid belief network or restricted Boltzmann machine, the bottom binary layer of which selects topics for use in a Poisson factor analysis model. Under this setting, topics live on the bottom layer of the model, while the deep specification serves as a flexible prior for revealing topic structure. Scalable inference algorithms are derived by applying Bayesian conditional density filtering algorithm, in addition to extending recently proposed work on stochastic gradient thermostats. Experimental results on several corpora show that the proposed approach readily handles very large collections of text documents, infers structured topic representations, and obtains superior test perplexities when compared with related models.

RIS


TY  - CPAPER
TI  - Scalable Deep Poisson Factor Analysis for Topic Modeling
AU  - Zhe Gan
AU  - Changyou Chen
AU  - Ricardo Henao
AU  - David Carlson
AU  - Lawrence Carin
BT  - Proceedings of the 32nd International Conference on Machine Learning
DA  - 2015/06/01
ED  - Francis Bach
ED  - David Blei	
ID  - pmlr-v37-gan15
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 37
SP  - 1823
EP  - 1832
L1  - http://proceedings.mlr.press/v37/gan15.pdf
UR  - https://proceedings.mlr.press/v37/gan15.html
AB  - A new framework for topic modeling is developed, based on deep graphical models, where interactions between topics are inferred through deep latent binary hierarchies. The proposed multi-layer model employs a deep sigmoid belief network or restricted Boltzmann machine, the bottom binary layer of which selects topics for use in a Poisson factor analysis model. Under this setting, topics live on the bottom layer of the model, while the deep specification serves as a flexible prior for revealing topic structure. Scalable inference algorithms are derived by applying Bayesian conditional density filtering algorithm, in addition to extending recently proposed work on stochastic gradient thermostats. Experimental results on several corpora show that the proposed approach readily handles very large collections of text documents, infers structured topic representations, and obtains superior test perplexities when compared with related models.
ER  -

APA


Gan, Z., Chen, C., Henao, R., Carlson, D. & Carin, L.. (2015). Scalable Deep Poisson Factor Analysis for Topic Modeling. Proceedings of the 32nd International Conference on Machine Learning, in Proceedings of Machine Learning Research 37:1823-1832 Available from https://proceedings.mlr.press/v37/gan15.html.

Scalable Deep Poisson Factor Analysis for Topic Modeling

Abstract

Cite this Paper

Related Material