Memory Efficient Neural Processes via Constant Memory Attention Block

Leo Feng; Frederick Tung; Hossein Hajimirsadeghi; Yoshua Bengio; Mohamed Osama Ahmed

Memory Efficient Neural Processes via Constant Memory Attention Block

Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

Proceedings of the 41st International Conference on Machine Learning, PMLR 235:13365-13386, 2024.

Abstract

Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant that only requires constant memory. To do so, we first propose an efficient update operation for Cross Attention. Leveraging the update operation, we propose Constant Memory Attention Block (CMAB), a novel attention block that (i) is permutation invariant, (ii) computes its output in constant memory, and (iii) performs constant computation updates. Finally, building on CMAB, we detail Constant Memory Attentive Neural Processes. Empirically, we show CMANPs achieve state-of-the-art results on popular NP benchmarks while being significantly more memory efficient than prior methods.

Cite this Paper

BibTeX


@InProceedings{pmlr-v235-feng24i,
  title = 	 {Memory Efficient Neural Processes via Constant Memory Attention Block},
  author =       {Feng, Leo and Tung, Frederick and Hajimirsadeghi, Hossein and Bengio, Yoshua and Ahmed, Mohamed Osama},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {13365--13386},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/feng24i/feng24i.pdf},
  url = 	 {https://proceedings.mlr.press/v235/feng24i.html},
  abstract = 	 {Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant that only requires constant memory. To do so, we first propose an efficient update operation for Cross Attention. Leveraging the update operation, we propose Constant Memory Attention Block (CMAB), a novel attention block that (i) is permutation invariant, (ii) computes its output in constant memory, and (iii) performs constant computation updates. Finally, building on CMAB, we detail Constant Memory Attentive Neural Processes. Empirically, we show CMANPs achieve state-of-the-art results on popular NP benchmarks while being significantly more memory efficient than prior methods.}
}

Endnote

%0 Conference Paper
%T Memory Efficient Neural Processes via Constant Memory Attention Block
%A Leo Feng
%A Frederick Tung
%A Hossein Hajimirsadeghi
%A Yoshua Bengio
%A Mohamed Osama Ahmed
%B Proceedings of the 41st International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Ruslan Salakhutdinov
%E Zico Kolter
%E Katherine Heller
%E Adrian Weller
%E Nuria Oliver
%E Jonathan Scarlett
%E Felix Berkenkamp	
%F pmlr-v235-feng24i
%I PMLR
%P 13365--13386
%U https://proceedings.mlr.press/v235/feng24i.html
%V 235
%X Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant that only requires constant memory. To do so, we first propose an efficient update operation for Cross Attention. Leveraging the update operation, we propose Constant Memory Attention Block (CMAB), a novel attention block that (i) is permutation invariant, (ii) computes its output in constant memory, and (iii) performs constant computation updates. Finally, building on CMAB, we detail Constant Memory Attentive Neural Processes. Empirically, we show CMANPs achieve state-of-the-art results on popular NP benchmarks while being significantly more memory efficient than prior methods.

APA


Feng, L., Tung, F., Hajimirsadeghi, H., Bengio, Y. & Ahmed, M.O.. (2024). Memory Efficient Neural Processes via Constant Memory Attention Block. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:13365-13386 Available from https://proceedings.mlr.press/v235/feng24i.html.

Memory Efficient Neural Processes via Constant Memory Attention Block

Abstract

Cite this Paper

Related Material