Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models

Dennis Wu; Jerry Yao-Chieh Hu; Teng-Yun Hsiao; Han Liu

Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models

Dennis Wu, Jerry Yao-Chieh Hu, Teng-Yun Hsiao, Han Liu

Proceedings of the 41st International Conference on Machine Learning, PMLR 235:53471-53514, 2024.

Abstract

We propose a two-stage optimization formulation for the memory retrieval dynamics of modern Hopfield models, termed

$\mathtt{U\text{-}Hop}$ . Our key contribution is a learnable feature map

$\Phi$ which transforms the Hopfield energy function into a kernel space. This transformation ensures convergence between the local minima of energy and the fixed points of retrieval dynamics within the kernel space. Consequently, the kernel norm induced by

$\Phi$ serves as a novel similarity measure. It utilizes the stored memory patterns as learning data to enhance memory capacity across all modern Hopfield models. Specifically, we accomplish this by constructing a separation loss

$\mathcal{L}_\Phi$ that separates the local minima of kernelized energy by separating stored memory patterns in kernel space. Methodologically,

$\mathtt{U\text{-}Hop}$ memory retrieval process consists of: (Stage I:) minimizing separation loss for a more uniformed memory (local minimum) distribution, followed by (Stage II:) standard Hopfield energy minimization for memory retrieval. This results in significant reduction of possible meta-stable states in the Hopfield energy function, thus preventing memory confusion. Empirically, with real-world datasets, we demonstrate that

$\mathtt{U\text{-}Hop}$ outperforms all existing modern Hopfield models and SOTA similarity measures, achieving a substantial margin in both associative memory retrieval and deep learning tasks. Code is available at GitHub; future updates are on arXiv.

Cite this Paper

BibTeX


@InProceedings{pmlr-v235-wu24i,
  title = 	 {Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models},
  author =       {Wu, Dennis and Hu, Jerry Yao-Chieh and Hsiao, Teng-Yun and Liu, Han},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {53471--53514},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/wu24i/wu24i.pdf},
  url = 	 {https://proceedings.mlr.press/v235/wu24i.html},
  abstract = 	 {We propose a two-stage optimization formulation for the memory retrieval dynamics of modern Hopfield models, termed $\mathtt{U\text{-}Hop}$. Our key contribution is a learnable feature map $\Phi$ which transforms the Hopfield energy function into a kernel space. This transformation ensures convergence between the local minima of energy and the fixed points of retrieval dynamics within the kernel space. Consequently, the kernel norm induced by $\Phi$ serves as a novel similarity measure. It utilizes the stored memory patterns as learning data to enhance memory capacity across all modern Hopfield models. Specifically, we accomplish this by constructing a separation loss $\mathcal{L}_\Phi$ that separates the local minima of kernelized energy by separating stored memory patterns in kernel space. Methodologically, $\mathtt{U\text{-}Hop}$ memory retrieval process consists of: (Stage I:) minimizing separation loss for a more uniformed memory (local minimum) distribution, followed by (Stage II:) standard Hopfield energy minimization for memory retrieval. This results in significant reduction of possible meta-stable states in the Hopfield energy function, thus preventing memory confusion. Empirically, with real-world datasets, we demonstrate that $\mathtt{U\text{-}Hop}$ outperforms all existing modern Hopfield models and SOTA similarity measures, achieving a substantial margin in both associative memory retrieval and deep learning tasks. Code is available at GitHub; future updates are on arXiv.}
}

Endnote

%0 Conference Paper
%T Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models
%A Dennis Wu
%A Jerry Yao-Chieh Hu
%A Teng-Yun Hsiao
%A Han Liu
%B Proceedings of the 41st International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Ruslan Salakhutdinov
%E Zico Kolter
%E Katherine Heller
%E Adrian Weller
%E Nuria Oliver
%E Jonathan Scarlett
%E Felix Berkenkamp	
%F pmlr-v235-wu24i
%I PMLR
%P 53471--53514
%U https://proceedings.mlr.press/v235/wu24i.html
%V 235
%X We propose a two-stage optimization formulation for the memory retrieval dynamics of modern Hopfield models, termed $\mathtt{U\text{-}Hop}$. Our key contribution is a learnable feature map $\Phi$ which transforms the Hopfield energy function into a kernel space. This transformation ensures convergence between the local minima of energy and the fixed points of retrieval dynamics within the kernel space. Consequently, the kernel norm induced by $\Phi$ serves as a novel similarity measure. It utilizes the stored memory patterns as learning data to enhance memory capacity across all modern Hopfield models. Specifically, we accomplish this by constructing a separation loss $\mathcal{L}_\Phi$ that separates the local minima of kernelized energy by separating stored memory patterns in kernel space. Methodologically, $\mathtt{U\text{-}Hop}$ memory retrieval process consists of: (Stage I:) minimizing separation loss for a more uniformed memory (local minimum) distribution, followed by (Stage II:) standard Hopfield energy minimization for memory retrieval. This results in significant reduction of possible meta-stable states in the Hopfield energy function, thus preventing memory confusion. Empirically, with real-world datasets, we demonstrate that $\mathtt{U\text{-}Hop}$ outperforms all existing modern Hopfield models and SOTA similarity measures, achieving a substantial margin in both associative memory retrieval and deep learning tasks. Code is available at GitHub; future updates are on arXiv.

APA


Wu, D., Hu, J.Y., Hsiao, T. & Liu, H.. (2024). Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:53471-53514 Available from https://proceedings.mlr.press/v235/wu24i.html.

Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models

Abstract

Cite this Paper

Related Material