Scalable Gaussian Processes with Latent Kronecker Structure

Jihao Andreas Lin; Sebastian Ament; Maximilian Balandat; David Eriksson; José Miguel Hernández-Lobato; Eytan Bakshy

Scalable Gaussian Processes with Latent Kronecker Structure

Jihao Andreas Lin, Sebastian Ament, Maximilian Balandat, David Eriksson, José Miguel Hernández-Lobato, Eytan Bakshy

Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:37730-37744, 2025.

Abstract

Applying Gaussian processes (GPs) to very large datasets remains a challenge due to limited computational scalability. Matrix structures, such as the Kronecker product, can accelerate operations significantly, but their application commonly entails approximations or unrealistic assumptions. In particular, the most common path to creating a Kronecker-structured kernel matrix is by evaluating a product kernel on gridded inputs that can be expressed as a Cartesian product. However, this structure is lost if any observation is missing, breaking the Cartesian product structure, which frequently occurs in real-world data such as time series. To address this limitation, we propose leveraging latent Kronecker structure, by expressing the kernel matrix of observed values as the projection of a latent Kronecker product. In combination with iterative linear system solvers and pathwise conditioning, our method facilitates inference of exact GPs while requiring substantially fewer computational resources than standard iterative methods. We demonstrate that our method outperforms state-of-the-art sparse and variational GPs on real-world datasets with up to five million examples, including robotics, automated machine learning, and climate applications.

Cite this Paper

BibTeX

@InProceedings{pmlr-v267-lin25a,
  title = 	 {Scalable {G}aussian Processes with Latent {K}ronecker Structure},
  author =       {Lin, Jihao Andreas and Ament, Sebastian and Balandat, Maximilian and Eriksson, David and Hern\'{a}ndez-Lobato, Jos\'{e} Miguel and Bakshy, Eytan},
  booktitle = 	 {Proceedings of the 42nd International Conference on Machine Learning},
  pages = 	 {37730--37744},
  year = 	 {2025},
  editor = 	 {Singh, Aarti and Fazel, Maryam and Hsu, Daniel and Lacoste-Julien, Simon and Berkenkamp, Felix and Maharaj, Tegan and Wagstaff, Kiri and Zhu, Jerry},
  volume = 	 {267},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {13--19 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v267/main/assets/lin25a/lin25a.pdf},
  url = 	 {https://proceedings.mlr.press/v267/lin25a.html},
  abstract = 	 {Applying Gaussian processes (GPs) to very large datasets remains a challenge due to limited computational scalability. Matrix structures, such as the Kronecker product, can accelerate operations significantly, but their application commonly entails approximations or unrealistic assumptions. In particular, the most common path to creating a Kronecker-structured kernel matrix is by evaluating a product kernel on gridded inputs that can be expressed as a Cartesian product. However, this structure is lost if any observation is missing, breaking the Cartesian product structure, which frequently occurs in real-world data such as time series. To address this limitation, we propose leveraging latent Kronecker structure, by expressing the kernel matrix of observed values as the projection of a latent Kronecker product. In combination with iterative linear system solvers and pathwise conditioning, our method facilitates inference of exact GPs while requiring substantially fewer computational resources than standard iterative methods. We demonstrate that our method outperforms state-of-the-art sparse and variational GPs on real-world datasets with up to five million examples, including robotics, automated machine learning, and climate applications.}
}

Endnote

%0 Conference Paper
%T Scalable Gaussian Processes with Latent Kronecker Structure
%A Jihao Andreas Lin
%A Sebastian Ament
%A Maximilian Balandat
%A David Eriksson
%A José Miguel Hernández-Lobato
%A Eytan Bakshy
%B Proceedings of the 42nd International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2025
%E Aarti Singh
%E Maryam Fazel
%E Daniel Hsu
%E Simon Lacoste-Julien
%E Felix Berkenkamp
%E Tegan Maharaj
%E Kiri Wagstaff
%E Jerry Zhu	
%F pmlr-v267-lin25a
%I PMLR
%P 37730--37744
%U https://proceedings.mlr.press/v267/lin25a.html
%V 267
%X Applying Gaussian processes (GPs) to very large datasets remains a challenge due to limited computational scalability. Matrix structures, such as the Kronecker product, can accelerate operations significantly, but their application commonly entails approximations or unrealistic assumptions. In particular, the most common path to creating a Kronecker-structured kernel matrix is by evaluating a product kernel on gridded inputs that can be expressed as a Cartesian product. However, this structure is lost if any observation is missing, breaking the Cartesian product structure, which frequently occurs in real-world data such as time series. To address this limitation, we propose leveraging latent Kronecker structure, by expressing the kernel matrix of observed values as the projection of a latent Kronecker product. In combination with iterative linear system solvers and pathwise conditioning, our method facilitates inference of exact GPs while requiring substantially fewer computational resources than standard iterative methods. We demonstrate that our method outperforms state-of-the-art sparse and variational GPs on real-world datasets with up to five million examples, including robotics, automated machine learning, and climate applications.

APA

Lin, J.A., Ament, S., Balandat, M., Eriksson, D., Hernández-Lobato, J.M. & Bakshy, E.. (2025). Scalable Gaussian Processes with Latent Kronecker Structure. Proceedings of the 42nd International Conference on Machine Learning, in Proceedings of Machine Learning Research 267:37730-37744 Available from https://proceedings.mlr.press/v267/lin25a.html.

Scalable Gaussian Processes with Latent Kronecker Structure

Abstract

Cite this Paper

Related Material