On Structure Priors for Learning Bayesian Networks

Ralf Eggeling, Jussi Viinikka, Aleksis Vuoksenmaa, Mikko Koivisto
Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, PMLR 89:1687-1695, 2019.

Abstract

To learn a Bayesian network structure from data, one popular approach is to maximize a decomposable likelihood-based score. While various scores have been proposed, they usually assume a uniform prior, or “penalty,” over the possible directed acyclic graphs (DAGs); relatively little attention has been paid to alternative priors. We investigate empirically several structure priors in combination with different scores, using benchmark data sets and data sets generated from benchmark networks. Our results suggest that, in practice, priors that strongly favor sparsity perform significantly better than the uniform prior or even the informed variant that is conditioned on the correct number of parents for each node. For an analytic comparison of different priors, we generalize a known recurrence equation for the number of DAGs to accommodate modular weightings of DAGs, a result that is also of independent interest.

Cite this Paper


BibTeX
@InProceedings{pmlr-v89-eggeling19a, title = {On Structure Priors for Learning Bayesian Networks}, author = {Eggeling, Ralf and Viinikka, Jussi and Vuoksenmaa, Aleksis and Koivisto, Mikko}, booktitle = {Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics}, pages = {1687--1695}, year = {2019}, editor = {Chaudhuri, Kamalika and Sugiyama, Masashi}, volume = {89}, series = {Proceedings of Machine Learning Research}, month = {16--18 Apr}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v89/eggeling19a/eggeling19a.pdf}, url = {https://proceedings.mlr.press/v89/eggeling19a.html}, abstract = {To learn a Bayesian network structure from data, one popular approach is to maximize a decomposable likelihood-based score. While various scores have been proposed, they usually assume a uniform prior, or “penalty,” over the possible directed acyclic graphs (DAGs); relatively little attention has been paid to alternative priors. We investigate empirically several structure priors in combination with different scores, using benchmark data sets and data sets generated from benchmark networks. Our results suggest that, in practice, priors that strongly favor sparsity perform significantly better than the uniform prior or even the informed variant that is conditioned on the correct number of parents for each node. For an analytic comparison of different priors, we generalize a known recurrence equation for the number of DAGs to accommodate modular weightings of DAGs, a result that is also of independent interest.} }
Endnote
%0 Conference Paper %T On Structure Priors for Learning Bayesian Networks %A Ralf Eggeling %A Jussi Viinikka %A Aleksis Vuoksenmaa %A Mikko Koivisto %B Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics %C Proceedings of Machine Learning Research %D 2019 %E Kamalika Chaudhuri %E Masashi Sugiyama %F pmlr-v89-eggeling19a %I PMLR %P 1687--1695 %U https://proceedings.mlr.press/v89/eggeling19a.html %V 89 %X To learn a Bayesian network structure from data, one popular approach is to maximize a decomposable likelihood-based score. While various scores have been proposed, they usually assume a uniform prior, or “penalty,” over the possible directed acyclic graphs (DAGs); relatively little attention has been paid to alternative priors. We investigate empirically several structure priors in combination with different scores, using benchmark data sets and data sets generated from benchmark networks. Our results suggest that, in practice, priors that strongly favor sparsity perform significantly better than the uniform prior or even the informed variant that is conditioned on the correct number of parents for each node. For an analytic comparison of different priors, we generalize a known recurrence equation for the number of DAGs to accommodate modular weightings of DAGs, a result that is also of independent interest.
APA
Eggeling, R., Viinikka, J., Vuoksenmaa, A. & Koivisto, M.. (2019). On Structure Priors for Learning Bayesian Networks. Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 89:1687-1695 Available from https://proceedings.mlr.press/v89/eggeling19a.html.

Related Material