PAC learning of Probabilistic Automaton based on the Method of Moments

Hadrien Glaude; Olivier Pietquin

PAC learning of Probabilistic Automaton based on the Method of Moments

Hadrien Glaude, Olivier Pietquin

Proceedings of The 33rd International Conference on Machine Learning, PMLR 48:820-829, 2016.

Abstract

Probabilitic Finite Automata (PFA) are generative graphical models that define distributions with latent variables over finite sequences of symbols, a.k.a. stochastic languages. Traditionally, unsupervised learning of PFA is performed through algorithms that iteratively improves the likelihood like the Expectation-Maximization (EM) algorithm. Recently, learning algorithms based on the so-called Method of Moments (MoM) have been proposed as a much faster alternative that comes with PAC-style guarantees. However, these algorithms do not ensure the learnt automata to model a proper distribution, limiting their applicability and preventing them to serve as an initialization to iterative algorithms. In this paper, we propose a new MoM-based algorithm with PAC-style guarantees that learns automata defining proper distributions. We assess its performances on synthetic problems from the PAutomaC challenge and real datasets extracted from Wikipedia against previous MoM-based algorithms and EM algorithm.

Cite this Paper

BibTeX


@InProceedings{pmlr-v48-glaude16,
  title = 	 {PAC learning of Probabilistic Automaton based on the Method of Moments},
  author = 	 {Glaude, Hadrien and Pietquin, Olivier},
  booktitle = 	 {Proceedings of The 33rd International Conference on Machine Learning},
  pages = 	 {820--829},
  year = 	 {2016},
  editor = 	 {Balcan, Maria Florina and Weinberger, Kilian Q.},
  volume = 	 {48},
  series = 	 {Proceedings of Machine Learning Research},
  address = 	 {New York, New York, USA},
  month = 	 {20--22 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v48/glaude16.pdf},
  url = 	 {https://proceedings.mlr.press/v48/glaude16.html},
  abstract = 	 {Probabilitic Finite Automata (PFA) are generative graphical models that define distributions with latent variables over finite sequences of symbols, a.k.a. stochastic languages. Traditionally, unsupervised learning of PFA is performed through algorithms that iteratively improves the likelihood like the Expectation-Maximization (EM) algorithm. Recently, learning algorithms based on the so-called Method of Moments (MoM) have been proposed as a much faster alternative that comes with PAC-style guarantees. However, these algorithms do not ensure the learnt automata to model a proper distribution, limiting their applicability and preventing them to serve as an initialization to iterative algorithms. In this paper, we propose a new MoM-based algorithm with PAC-style guarantees that learns automata defining proper distributions. We assess its performances on synthetic problems from the PAutomaC challenge and real datasets extracted from Wikipedia against previous MoM-based algorithms and EM algorithm.}
}

Endnote

%0 Conference Paper
%T PAC learning of Probabilistic Automaton based on the Method of Moments
%A Hadrien Glaude
%A Olivier Pietquin
%B Proceedings of The 33rd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2016
%E Maria Florina Balcan
%E Kilian Q. Weinberger	
%F pmlr-v48-glaude16
%I PMLR
%P 820--829
%U https://proceedings.mlr.press/v48/glaude16.html
%V 48
%X Probabilitic Finite Automata (PFA) are generative graphical models that define distributions with latent variables over finite sequences of symbols, a.k.a. stochastic languages. Traditionally, unsupervised learning of PFA is performed through algorithms that iteratively improves the likelihood like the Expectation-Maximization (EM) algorithm. Recently, learning algorithms based on the so-called Method of Moments (MoM) have been proposed as a much faster alternative that comes with PAC-style guarantees. However, these algorithms do not ensure the learnt automata to model a proper distribution, limiting their applicability and preventing them to serve as an initialization to iterative algorithms. In this paper, we propose a new MoM-based algorithm with PAC-style guarantees that learns automata defining proper distributions. We assess its performances on synthetic problems from the PAutomaC challenge and real datasets extracted from Wikipedia against previous MoM-based algorithms and EM algorithm.

RIS


TY  - CPAPER
TI  - PAC learning of Probabilistic Automaton based on the Method of Moments
AU  - Hadrien Glaude
AU  - Olivier Pietquin
BT  - Proceedings of The 33rd International Conference on Machine Learning
DA  - 2016/06/11
ED  - Maria Florina Balcan
ED  - Kilian Q. Weinberger	
ID  - pmlr-v48-glaude16
PB  - PMLR
DP  - Proceedings of Machine Learning Research
VL  - 48
SP  - 820
EP  - 829
L1  - http://proceedings.mlr.press/v48/glaude16.pdf
UR  - https://proceedings.mlr.press/v48/glaude16.html
AB  - Probabilitic Finite Automata (PFA) are generative graphical models that define distributions with latent variables over finite sequences of symbols, a.k.a. stochastic languages. Traditionally, unsupervised learning of PFA is performed through algorithms that iteratively improves the likelihood like the Expectation-Maximization (EM) algorithm. Recently, learning algorithms based on the so-called Method of Moments (MoM) have been proposed as a much faster alternative that comes with PAC-style guarantees. However, these algorithms do not ensure the learnt automata to model a proper distribution, limiting their applicability and preventing them to serve as an initialization to iterative algorithms. In this paper, we propose a new MoM-based algorithm with PAC-style guarantees that learns automata defining proper distributions. We assess its performances on synthetic problems from the PAutomaC challenge and real datasets extracted from Wikipedia against previous MoM-based algorithms and EM algorithm.
ER  -

APA


Glaude, H. & Pietquin, O.. (2016). PAC learning of Probabilistic Automaton based on the Method of Moments. Proceedings of The 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research 48:820-829 Available from https://proceedings.mlr.press/v48/glaude16.html.

PAC learning of Probabilistic Automaton based on the Method of Moments

Abstract

Cite this Paper

Related Material