Tensor Networks for Probabilistic Sequence Modeling

Jacob Miller; Guillaume Rabusseau; John Terilla

Tensor Networks for Probabilistic Sequence Modeling

Jacob Miller, Guillaume Rabusseau, John Terilla

Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, PMLR 130:3079-3087, 2021.

Abstract

Tensor networks are a powerful modeling framework developed for computational many-body physics, which have only recently been applied within machine learning. In this work we utilize a uniform matrix product state (u-MPS) model for probabilistic modeling of sequence data. We first show that u-MPS enable sequence-level parallelism, with length-n sequences able to be evaluated in depth O(log n). We then introduce a novel generative algorithm giving trained u-MPS the ability to efficiently sample from a wide variety of conditional distributions, each one defined by a regular expression. Special cases of this algorithm correspond to autoregressive and fill-in-the-blank sampling, but more complex regular expressions permit the generation of richly structured data in a manner that has no direct analogue in neural generative models. Experiments on sequence modeling with synthetic and real text data show u-MPS outperforming a variety of baselines and effectively generalizing their predictions in the presence of limited data.

Cite this Paper

BibTeX


@InProceedings{pmlr-v130-miller21a,
  title = 	 { Tensor Networks for Probabilistic Sequence Modeling },
  author =       {Miller, Jacob and Rabusseau, Guillaume and Terilla, John},
  booktitle = 	 {Proceedings of The 24th International Conference on Artificial Intelligence and Statistics},
  pages = 	 {3079--3087},
  year = 	 {2021},
  editor = 	 {Banerjee, Arindam and Fukumizu, Kenji},
  volume = 	 {130},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--15 Apr},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v130/miller21a/miller21a.pdf},
  url = 	 {https://proceedings.mlr.press/v130/miller21a.html},
  abstract = 	 { Tensor networks are a powerful modeling framework developed for computational many-body physics, which have only recently been applied within machine learning. In this work we utilize a uniform matrix product state (u-MPS) model for probabilistic modeling of sequence data. We first show that u-MPS enable sequence-level parallelism, with length-n sequences able to be evaluated in depth O(log n). We then introduce a novel generative algorithm giving trained u-MPS the ability to efficiently sample from a wide variety of conditional distributions, each one defined by a regular expression. Special cases of this algorithm correspond to autoregressive and fill-in-the-blank sampling, but more complex regular expressions permit the generation of richly structured data in a manner that has no direct analogue in neural generative models. Experiments on sequence modeling with synthetic and real text data show u-MPS outperforming a variety of baselines and effectively generalizing their predictions in the presence of limited data. }
}

Endnote

%0 Conference Paper
%T  Tensor Networks for Probabilistic Sequence Modeling 
%A Jacob Miller
%A Guillaume Rabusseau
%A John Terilla
%B Proceedings of The 24th International Conference on Artificial Intelligence and Statistics
%C Proceedings of Machine Learning Research
%D 2021
%E Arindam Banerjee
%E Kenji Fukumizu	
%F pmlr-v130-miller21a
%I PMLR
%P 3079--3087
%U https://proceedings.mlr.press/v130/miller21a.html
%V 130
%X  Tensor networks are a powerful modeling framework developed for computational many-body physics, which have only recently been applied within machine learning. In this work we utilize a uniform matrix product state (u-MPS) model for probabilistic modeling of sequence data. We first show that u-MPS enable sequence-level parallelism, with length-n sequences able to be evaluated in depth O(log n). We then introduce a novel generative algorithm giving trained u-MPS the ability to efficiently sample from a wide variety of conditional distributions, each one defined by a regular expression. Special cases of this algorithm correspond to autoregressive and fill-in-the-blank sampling, but more complex regular expressions permit the generation of richly structured data in a manner that has no direct analogue in neural generative models. Experiments on sequence modeling with synthetic and real text data show u-MPS outperforming a variety of baselines and effectively generalizing their predictions in the presence of limited data.

APA


Miller, J., Rabusseau, G. & Terilla, J.. (2021).  Tensor Networks for Probabilistic Sequence Modeling . Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 130:3079-3087 Available from https://proceedings.mlr.press/v130/miller21a.html.

Tensor Networks for Probabilistic Sequence Modeling

Abstract

Cite this Paper

Related Material