Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels

Xuchen You; Shouvanik Chakrabarti; Boyang Chen; Xiaodi Wu

Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels

Xuchen You, Shouvanik Chakrabarti, Boyang Chen, Xiaodi Wu

Proceedings of the 40th International Conference on Machine Learning, PMLR 202:40199-40224, 2023.

Abstract

A quantum neural network (QNN) is a parameterized mapping efficiently implementable on near-term Noisy Intermediate-Scale Quantum (NISQ) computers. It can be used for supervised learning when combined with classical gradient-based optimizers. Despite the existing empirical and theoretical investigations, the convergence of QNN training is not fully understood. Inspired by the success of the neural tangent kernels (NTKs) in probing into the dynamics of classical neural networks, a recent line of works proposes to study over-parameterized QNNs by examining a quantum version of tangent kernels. In this work, we study the dynamics of QNNs and show that contrary to popular belief it is qualitatively different from that of any kernel regression: due to the unitarity of quantum operations, there is a non-negligible deviation from the tangent kernel regression derived at the random initialization. As a result of the deviation, we prove the at-most sublinear convergence for QNNs with Pauli measurements, which is beyond the explanatory power of any kernel regression dynamics. We then present the actual dynamics of QNNs in the limit of over-parameterization. The new dynamics capture the change of convergence rate during training and implies that the range of measurements is crucial to the fast QNN convergence.

Cite this Paper

BibTeX

@InProceedings{pmlr-v202-you23a,
  title = 	 {Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels},
  author =       {You, Xuchen and Chakrabarti, Shouvanik and Chen, Boyang and Wu, Xiaodi},
  booktitle = 	 {Proceedings of the 40th International Conference on Machine Learning},
  pages = 	 {40199--40224},
  year = 	 {2023},
  editor = 	 {Krause, Andreas and Brunskill, Emma and Cho, Kyunghyun and Engelhardt, Barbara and Sabato, Sivan and Scarlett, Jonathan},
  volume = 	 {202},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {23--29 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v202/you23a/you23a.pdf},
  url = 	 {https://proceedings.mlr.press/v202/you23a.html},
  abstract = 	 {A quantum neural network (QNN) is a parameterized mapping efficiently implementable on near-term Noisy Intermediate-Scale Quantum (NISQ) computers. It can be used for supervised learning when combined with classical gradient-based optimizers. Despite the existing empirical and theoretical investigations, the convergence of QNN training is not fully understood. Inspired by the success of the neural tangent kernels (NTKs) in probing into the dynamics of classical neural networks, a recent line of works proposes to study over-parameterized QNNs by examining a quantum version of tangent kernels. In this work, we study the dynamics of QNNs and show that contrary to popular belief it is qualitatively different from that of any kernel regression: due to the unitarity of quantum operations, there is a non-negligible deviation from the tangent kernel regression derived at the random initialization. As a result of the deviation, we prove the at-most sublinear convergence for QNNs with Pauli measurements, which is beyond the explanatory power of any kernel regression dynamics. We then present the actual dynamics of QNNs in the limit of over-parameterization. The new dynamics capture the change of convergence rate during training and implies that the range of measurements is crucial to the fast QNN convergence.}
}

Endnote

%0 Conference Paper
%T Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels
%A Xuchen You
%A Shouvanik Chakrabarti
%A Boyang Chen
%A Xiaodi Wu
%B Proceedings of the 40th International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2023
%E Andreas Krause
%E Emma Brunskill
%E Kyunghyun Cho
%E Barbara Engelhardt
%E Sivan Sabato
%E Jonathan Scarlett	
%F pmlr-v202-you23a
%I PMLR
%P 40199--40224
%U https://proceedings.mlr.press/v202/you23a.html
%V 202
%X A quantum neural network (QNN) is a parameterized mapping efficiently implementable on near-term Noisy Intermediate-Scale Quantum (NISQ) computers. It can be used for supervised learning when combined with classical gradient-based optimizers. Despite the existing empirical and theoretical investigations, the convergence of QNN training is not fully understood. Inspired by the success of the neural tangent kernels (NTKs) in probing into the dynamics of classical neural networks, a recent line of works proposes to study over-parameterized QNNs by examining a quantum version of tangent kernels. In this work, we study the dynamics of QNNs and show that contrary to popular belief it is qualitatively different from that of any kernel regression: due to the unitarity of quantum operations, there is a non-negligible deviation from the tangent kernel regression derived at the random initialization. As a result of the deviation, we prove the at-most sublinear convergence for QNNs with Pauli measurements, which is beyond the explanatory power of any kernel regression dynamics. We then present the actual dynamics of QNNs in the limit of over-parameterization. The new dynamics capture the change of convergence rate during training and implies that the range of measurements is crucial to the fast QNN convergence.

APA

You, X., Chakrabarti, S., Chen, B. & Wu, X.. (2023). Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels. Proceedings of the 40th International Conference on Machine Learning, in Proceedings of Machine Learning Research 202:40199-40224 Available from https://proceedings.mlr.press/v202/you23a.html.

Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels

Abstract

Cite this Paper

Related Material