Differentiable Mapper for Topological Optimization of Data Representation

Ziyad Oulhaj; Mathieu Carrière; Bertrand Michel

Differentiable Mapper for Topological Optimization of Data Representation

Ziyad Oulhaj, Mathieu Carrière, Bertrand Michel

Proceedings of the 41st International Conference on Machine Learning, PMLR 235:38919-38936, 2024.

Abstract

Unsupervised data representation and visualization using tools from topology is an active and growing field of Topological Data Analysis (TDA) and data science. Its most prominent line of work is based on the so-called Mapper graph, which is a combinatorial graph whose topological structures (connected components, branches, loops) are in correspondence with those of the data itself. While highly generic and applicable, its use has been hampered so far by the manual tuning of its many parameters—among these, a crucial one is the so-called filter: it is a continuous function whose variations on the data set are the main ingredient for both building the Mapper representation and assessing the presence and sizes of its topological structures. However, while a few parameter tuning methods have already been investigated for the other Mapper parameters (i.e., resolution, gain, clustering), there is currently no method for tuning the filter itself. In this work, we build on a recently proposed optimization framework incorporating topology to provide the first filter optimization scheme for Mapper graphs. In order to achieve this, we propose a relaxed and more general version of the Mapper graph, whose convergence properties are investigated. Finally, we demonstrate the usefulness of our approach by optimizing Mapper graph representations on several datasets, and showcasing the superiority of the optimized representation over arbitrary ones.

Cite this Paper

BibTeX

@InProceedings{pmlr-v235-oulhaj24a,
  title = 	 {Differentiable Mapper for Topological Optimization of Data Representation},
  author =       {Oulhaj, Ziyad and Carri\`{e}re, Mathieu and Michel, Bertrand},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {38919--38936},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/oulhaj24a/oulhaj24a.pdf},
  url = 	 {https://proceedings.mlr.press/v235/oulhaj24a.html},
  abstract = 	 {Unsupervised data representation and visualization using tools from topology is an active and growing field of Topological Data Analysis (TDA) and data science. Its most prominent line of work is based on the so-called Mapper graph, which is a combinatorial graph whose topological structures (connected components, branches, loops) are in correspondence with those of the data itself. While highly generic and applicable, its use has been hampered so far by the manual tuning of its many parameters—among these, a crucial one is the so-called filter: it is a continuous function whose variations on the data set are the main ingredient for both building the Mapper representation and assessing the presence and sizes of its topological structures. However, while a few parameter tuning methods have already been investigated for the other Mapper parameters (i.e., resolution, gain, clustering), there is currently no method for tuning the filter itself. In this work, we build on a recently proposed optimization framework incorporating topology to provide the first filter optimization scheme for Mapper graphs. In order to achieve this, we propose a relaxed and more general version of the Mapper graph, whose convergence properties are investigated. Finally, we demonstrate the usefulness of our approach by optimizing Mapper graph representations on several datasets, and showcasing the superiority of the optimized representation over arbitrary ones.}
}

Endnote

%0 Conference Paper
%T Differentiable Mapper for Topological Optimization of Data Representation
%A Ziyad Oulhaj
%A Mathieu Carrière
%A Bertrand Michel
%B Proceedings of the 41st International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Ruslan Salakhutdinov
%E Zico Kolter
%E Katherine Heller
%E Adrian Weller
%E Nuria Oliver
%E Jonathan Scarlett
%E Felix Berkenkamp	
%F pmlr-v235-oulhaj24a
%I PMLR
%P 38919--38936
%U https://proceedings.mlr.press/v235/oulhaj24a.html
%V 235
%X Unsupervised data representation and visualization using tools from topology is an active and growing field of Topological Data Analysis (TDA) and data science. Its most prominent line of work is based on the so-called Mapper graph, which is a combinatorial graph whose topological structures (connected components, branches, loops) are in correspondence with those of the data itself. While highly generic and applicable, its use has been hampered so far by the manual tuning of its many parameters—among these, a crucial one is the so-called filter: it is a continuous function whose variations on the data set are the main ingredient for both building the Mapper representation and assessing the presence and sizes of its topological structures. However, while a few parameter tuning methods have already been investigated for the other Mapper parameters (i.e., resolution, gain, clustering), there is currently no method for tuning the filter itself. In this work, we build on a recently proposed optimization framework incorporating topology to provide the first filter optimization scheme for Mapper graphs. In order to achieve this, we propose a relaxed and more general version of the Mapper graph, whose convergence properties are investigated. Finally, we demonstrate the usefulness of our approach by optimizing Mapper graph representations on several datasets, and showcasing the superiority of the optimized representation over arbitrary ones.

APA

Oulhaj, Z., Carrière, M. & Michel, B.. (2024). Differentiable Mapper for Topological Optimization of Data Representation. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:38919-38936 Available from https://proceedings.mlr.press/v235/oulhaj24a.html.

Differentiable Mapper for Topological Optimization of Data Representation

Abstract

Cite this Paper

Related Material