High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization

Yihang Chen; Fanghui Liu; Taiji Suzuki; Volkan Cevher

High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization

Yihang Chen, Fanghui Liu, Taiji Suzuki, Volkan Cevher

Proceedings of the 41st International Conference on Machine Learning, PMLR 235:7081-7102, 2024.

Abstract

This paper studies kernel ridge regression in high dimensions under covariate shifts and analyzes the role of importance re-weighting. We first derive the asymptotic expansion of high dimensional kernels under covariate shifts. By a bias-variance decomposition, we theoretically demonstrate that the re-weighting strategy allows for decreasing the variance. For bias, we analyze the regularization of the arbitrary or well-chosen scale, showing that the bias can behave very differently under different regularization scales. In our analysis, the bias and variance can be characterized by the spectral decay of a data-dependent regularized kernel: the original kernel matrix associated with an additional re-weighting matrix, and thus the re-weighting strategy can be regarded as a data-dependent regularization for better understanding. Besides, our analysis provides asymptotic expansion of kernel functions/vectors under covariate shift, which has its own interest.

Cite this Paper

BibTeX


@InProceedings{pmlr-v235-chen24aa,
  title = 	 {High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization},
  author =       {Chen, Yihang and Liu, Fanghui and Suzuki, Taiji and Cevher, Volkan},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {7081--7102},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/chen24aa/chen24aa.pdf},
  url = 	 {https://proceedings.mlr.press/v235/chen24aa.html},
  abstract = 	 {This paper studies kernel ridge regression in high dimensions under covariate shifts and analyzes the role of importance re-weighting. We first derive the asymptotic expansion of high dimensional kernels under covariate shifts. By a bias-variance decomposition, we theoretically demonstrate that the re-weighting strategy allows for decreasing the variance. For bias, we analyze the regularization of the arbitrary or well-chosen scale, showing that the bias can behave very differently under different regularization scales. In our analysis, the bias and variance can be characterized by the spectral decay of a data-dependent regularized kernel: the original kernel matrix associated with an additional re-weighting matrix, and thus the re-weighting strategy can be regarded as a data-dependent regularization for better understanding. Besides, our analysis provides asymptotic expansion of kernel functions/vectors under covariate shift, which has its own interest.}
}

Endnote

%0 Conference Paper
%T High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization
%A Yihang Chen
%A Fanghui Liu
%A Taiji Suzuki
%A Volkan Cevher
%B Proceedings of the 41st International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Ruslan Salakhutdinov
%E Zico Kolter
%E Katherine Heller
%E Adrian Weller
%E Nuria Oliver
%E Jonathan Scarlett
%E Felix Berkenkamp	
%F pmlr-v235-chen24aa
%I PMLR
%P 7081--7102
%U https://proceedings.mlr.press/v235/chen24aa.html
%V 235
%X This paper studies kernel ridge regression in high dimensions under covariate shifts and analyzes the role of importance re-weighting. We first derive the asymptotic expansion of high dimensional kernels under covariate shifts. By a bias-variance decomposition, we theoretically demonstrate that the re-weighting strategy allows for decreasing the variance. For bias, we analyze the regularization of the arbitrary or well-chosen scale, showing that the bias can behave very differently under different regularization scales. In our analysis, the bias and variance can be characterized by the spectral decay of a data-dependent regularized kernel: the original kernel matrix associated with an additional re-weighting matrix, and thus the re-weighting strategy can be regarded as a data-dependent regularization for better understanding. Besides, our analysis provides asymptotic expansion of kernel functions/vectors under covariate shift, which has its own interest.

APA


Chen, Y., Liu, F., Suzuki, T. & Cevher, V.. (2024). High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:7081-7102 Available from https://proceedings.mlr.press/v235/chen24aa.html.

High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization

Abstract

Cite this Paper

Related Material