Weighted Finite Automata with Failure Transitions: Algorithms and Applications

Cyril Allauzen

Weighted Finite Automata with Failure Transitions: Algorithms and Applications

Cyril Allauzen

Proceedings of 16th edition of the International Conference on Grammatical Inference, PMLR 217:6-6, 2023.

Abstract

Weighted finite automata (WFA) are used in many applications including speech recognition, speech synthesis, machine translation, computational biology, image processing, and optical character recognition. Such applications often have strict time and memory requirements, so efficient representations and algorithms are paramount. We examine one useful technique, the use of failure transitions, to represent automata compactly. A failure transition is taken only when no immediate match to the input is possible at a given state. Automata with failure transitions, initially introduced for string matching problems, have found wider use including compactly representing language, pronunciation, transliteration and semantic models. In this talk, we will address the extension of several weighted finite automata algorithms to automata with failure transitions (

$\Phi$ -WFAs). Efficient algorithms to intersect two

$\Phi$ -WFAs, to remove failure transitions, to trim, and to compute the shortest distance in a

$\Phi$ -WFA will be presented. We will demonstrate the application of some of these algorithms on two language modeling tasks: the distillation of arbitrary probabilistic models as weighted finite automata with failure transitions and the federated learning of n-gram language models. We will show the relevance of these methods to the privacy-preserving training of language models for virtual keyboard applications for mobile devices. This talk covers work in collaboration with Michael Riley, Ananda Theertha Suresh, Brian Roark, Vlad Schogol, Mingqing Chen, Rajiv Mathews, Adeline Wong, and Françoise Beaufays.

Cite this Paper

BibTeX


@InProceedings{pmlr-v217-allauzen23a,
  title = 	 {Weighted Finite Automata with Failure Transitions: Algorithms and Applications},
  author =       {Allauzen, Cyril},
  booktitle = 	 {Proceedings of 16th edition of the International Conference on Grammatical Inference},
  pages = 	 {6--6},
  year = 	 {2023},
  editor = 	 {Coste, François and Ouardi, Faissal and Rabusseau, Guillaume},
  volume = 	 {217},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {10--13 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v217/allauzen23a/allauzen23a.pdf},
  url = 	 {https://proceedings.mlr.press/v217/allauzen23a.html},
  abstract = 	 {Weighted finite automata (WFA) are used in many applications including speech recognition, speech synthesis, machine translation, computational biology, image processing, and optical character recognition. Such applications often have strict time and memory requirements, so efficient representations and algorithms are paramount. We examine one useful technique, the use of failure transitions, to represent automata compactly. A failure transition is taken only when no immediate match to the input is possible at a given state. Automata with failure transitions, initially introduced for string matching problems, have found wider use including compactly representing language, pronunciation, transliteration and semantic models. In this talk, we will address the extension of several weighted finite automata algorithms to automata with failure transitions ($\Phi$-WFAs). Efficient algorithms to intersect two $\Phi$-WFAs, to remove failure transitions, to trim, and to compute the shortest distance in a $\Phi$-WFA will be presented. We will demonstrate the application of some of these algorithms on two language modeling tasks: the distillation of arbitrary probabilistic models as weighted finite automata with failure transitions and the federated learning of n-gram language models. We will show the relevance of these methods to the privacy-preserving training of language models for virtual keyboard applications for mobile devices. This talk covers work in collaboration with Michael Riley, Ananda Theertha Suresh, Brian Roark, Vlad Schogol, Mingqing Chen, Rajiv Mathews, Adeline Wong, and Françoise Beaufays.}
}

Endnote

%0 Conference Paper
%T Weighted Finite Automata with Failure Transitions: Algorithms and Applications
%A Cyril Allauzen
%B Proceedings of 16th edition of the International Conference on Grammatical Inference
%C Proceedings of Machine Learning Research
%D 2023
%E François Coste
%E Faissal Ouardi
%E Guillaume Rabusseau	
%F pmlr-v217-allauzen23a
%I PMLR
%P 6--6
%U https://proceedings.mlr.press/v217/allauzen23a.html
%V 217
%X Weighted finite automata (WFA) are used in many applications including speech recognition, speech synthesis, machine translation, computational biology, image processing, and optical character recognition. Such applications often have strict time and memory requirements, so efficient representations and algorithms are paramount. We examine one useful technique, the use of failure transitions, to represent automata compactly. A failure transition is taken only when no immediate match to the input is possible at a given state. Automata with failure transitions, initially introduced for string matching problems, have found wider use including compactly representing language, pronunciation, transliteration and semantic models. In this talk, we will address the extension of several weighted finite automata algorithms to automata with failure transitions ($\Phi$-WFAs). Efficient algorithms to intersect two $\Phi$-WFAs, to remove failure transitions, to trim, and to compute the shortest distance in a $\Phi$-WFA will be presented. We will demonstrate the application of some of these algorithms on two language modeling tasks: the distillation of arbitrary probabilistic models as weighted finite automata with failure transitions and the federated learning of n-gram language models. We will show the relevance of these methods to the privacy-preserving training of language models for virtual keyboard applications for mobile devices. This talk covers work in collaboration with Michael Riley, Ananda Theertha Suresh, Brian Roark, Vlad Schogol, Mingqing Chen, Rajiv Mathews, Adeline Wong, and Françoise Beaufays.

APA


Allauzen, C.. (2023). Weighted Finite Automata with Failure Transitions: Algorithms and Applications. Proceedings of 16th edition of the International Conference on Grammatical Inference, in Proceedings of Machine Learning Research 217:6-6 Available from https://proceedings.mlr.press/v217/allauzen23a.html.

Related Material

Download PDF