Kernel-based Approach for Learning Causal Graphs from Mixed Data

Teny Handhayani; James Cussens

Kernel-based Approach for Learning Causal Graphs from Mixed Data

Teny Handhayani, James Cussens

Proceedings of the 10th International Conference on Probabilistic Graphical Models, PMLR 138:221-232, 2020.

Abstract

A causal graph can be generated from a dataset using a particular causal algorithm, for instance, the PC algorithm or Fast Causal Inference (FCI). This paper provides two contributions in learning causal graphs: an easy way to handle mixed data so that it can be used to learn causal graphs using the PC algorithm/FCI and a method to evaluate the learned graph structure when the true graph is unknown. This research proposes using kernel functions and Kernel Alignment to handle mixed data. The two main steps of this approach are computing a kernel matrix for each variable and calculating a pseudo-correlation matrix using Kernel Alignment. The Kernel Alignment matrix is used as a substitute for the correlation matrix that is the main component used in computing a partial correlation for the conditional independence test for Gaussian data in the PC Algorithm and FCI. The advantage of this idea is that is possible to handle more data types when there is a suitable kernel function to compute a kernel matrix for an observed variable. The proposed method is successfully applied to learn a causal graph from mixed data containing categorical, binary, ordinal, and continuous variables. We also introduce the Modal Value of Edges Existence (MVEE) method, a new method to evaluate the structure of learned graphs represented by Partial Ancestral Graph (PAG) when the true graph is unknown. MVEE produces an agreement graph as a proxy to the true graph to evaluate the structure of the learned graph. MVEE is successfully used to choose the best-learned graph when the true graph is unknown.

Cite this Paper

BibTeX


@InProceedings{pmlr-v138-handhayani20a,
  title = 	 {Kernel-based Approach for Learning Causal Graphs from Mixed Data},
  author =       {Handhayani, Teny and Cussens, James},
  booktitle = 	 {Proceedings of the 10th International Conference on Probabilistic Graphical Models},
  pages = 	 {221--232},
  year = 	 {2020},
  editor = 	 {Jaeger, Manfred and Nielsen, Thomas Dyhre},
  volume = 	 {138},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {23--25 Sep},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v138/handhayani20a/handhayani20a.pdf},
  url = 	 {https://proceedings.mlr.press/v138/handhayani20a.html},
  abstract = 	 {A causal graph can be generated from a dataset using a particular
 causal algorithm, for instance, the PC algorithm or Fast Causal
 Inference (FCI). This paper provides two contributions in learning
 causal graphs: an easy way to handle mixed data so that it can
 be used to learn causal graphs using the PC algorithm/FCI and a
 method to evaluate the learned graph structure when the true graph
 is unknown. This research proposes using kernel functions and Kernel
 Alignment to handle mixed data. The two main steps of this approach are
 computing a kernel matrix for each variable and calculating a
 pseudo-correlation matrix using Kernel Alignment. The Kernel
 Alignment matrix is used as a substitute for the correlation matrix
 that is the main component used in computing a partial correlation
 for the conditional independence test for Gaussian data in the PC
 Algorithm and FCI. The advantage of this idea is that is possible to
 handle more data types when there is a suitable kernel function to
 compute a kernel matrix for an observed variable. The proposed
 method is successfully applied to learn a causal graph from mixed
 data containing categorical, binary, ordinal, and continuous
 variables. We also introduce the Modal Value of Edges Existence
 (MVEE) method, a new method to evaluate the structure of learned
 graphs represented by Partial Ancestral Graph (PAG) when the true
 graph is unknown. MVEE produces an agreement graph as a proxy to the
 true graph to evaluate the structure of the learned graph. MVEE is
 successfully used to choose the best-learned graph when the true
 graph is unknown. }
}

Endnote

%0 Conference Paper
%T Kernel-based Approach for Learning Causal Graphs from Mixed Data
%A Teny Handhayani
%A James Cussens
%B Proceedings of the 10th International Conference on Probabilistic Graphical Models
%C Proceedings of Machine Learning Research
%D 2020
%E Manfred Jaeger
%E Thomas Dyhre Nielsen	
%F pmlr-v138-handhayani20a
%I PMLR
%P 221--232
%U https://proceedings.mlr.press/v138/handhayani20a.html
%V 138
%X A causal graph can be generated from a dataset using a particular
 causal algorithm, for instance, the PC algorithm or Fast Causal
 Inference (FCI). This paper provides two contributions in learning
 causal graphs: an easy way to handle mixed data so that it can
 be used to learn causal graphs using the PC algorithm/FCI and a
 method to evaluate the learned graph structure when the true graph
 is unknown. This research proposes using kernel functions and Kernel
 Alignment to handle mixed data. The two main steps of this approach are
 computing a kernel matrix for each variable and calculating a
 pseudo-correlation matrix using Kernel Alignment. The Kernel
 Alignment matrix is used as a substitute for the correlation matrix
 that is the main component used in computing a partial correlation
 for the conditional independence test for Gaussian data in the PC
 Algorithm and FCI. The advantage of this idea is that is possible to
 handle more data types when there is a suitable kernel function to
 compute a kernel matrix for an observed variable. The proposed
 method is successfully applied to learn a causal graph from mixed
 data containing categorical, binary, ordinal, and continuous
 variables. We also introduce the Modal Value of Edges Existence
 (MVEE) method, a new method to evaluate the structure of learned
 graphs represented by Partial Ancestral Graph (PAG) when the true
 graph is unknown. MVEE produces an agreement graph as a proxy to the
 true graph to evaluate the structure of the learned graph. MVEE is
 successfully used to choose the best-learned graph when the true
 graph is unknown.

APA


Handhayani, T. & Cussens, J.. (2020). Kernel-based Approach for Learning Causal Graphs from Mixed Data. Proceedings of the 10th International Conference on Probabilistic Graphical Models, in Proceedings of Machine Learning Research 138:221-232 Available from https://proceedings.mlr.press/v138/handhayani20a.html.

Related Material

Download PDF