EEG Guided Token Selection in VQ for Visual Brain Decoding

Abhishek Rathore; PushapDeep Singh; Arnav Bhavsar

EEG Guided Token Selection in VQ for Visual Brain Decoding

Abhishek Rathore, PushapDeep Singh, Arnav Bhavsar

Proceedings of the 7th Northern Lights Deep Learning Conference (NLDL), PMLR 307:358-363, 2026.

Abstract

Reconstructing visual stimuli from non-invasive Electroencephalography (EEG) is an interesting but challenging task in brain decoding that involves translating noisy neural signals into images via fine-grained generative control. In this work, we introduce a novel and efficient framework that guides a visual token generator by conditioning the generation process on a high-level semantic understanding of the EEG signal. Our method leverages a pre-trained LaBraM-based architecture to derive a robust class prediction from the neural data. In comparison to recent works that involve diffusion models, which require high computational resources and long inference times, our approach utilizes a lightweight and efficient token generator by building upon the bidirectional, parallel decoding capabilities of MaskGIT. This choice of components avoids the high computational requirements typical of large-scale diffusion processes. This focus on efficiency makes our approach not only easier to train but also more viable for potential real-time BCI applications where real-time feedback is crucial. The core of our method is a straightforward yet powerful two-stage process. First, the EEG classifier distills the complex input signal into a class label. In the second stage, this label serves as a direct condition for the pre-trained token generator. The generator, guided by this class information, then produces a sequence of discrete latent codes that are semantically consistent with the original stimulus. This neurally-guided token sequence is finally rendered into a high-fidelity image by a pretrained decoder, completing an efficient pathway from brain activity to visual representation

Cite this Paper

BibTeX

@InProceedings{pmlr-v307-rathore26a,
  title = 	 {{EEG} Guided Token Selection in {VQ} for Visual Brain Decoding},
  author =       {Rathore, Abhishek and Singh, PushapDeep and Bhavsar, Arnav},
  booktitle = 	 {Proceedings of the 7th Northern Lights Deep Learning Conference (NLDL)},
  pages = 	 {358--363},
  year = 	 {2026},
  editor = 	 {Kim, Hyeongji and Ramírez Rivera, Adín and Ricaud, Benjamin},
  volume = 	 {307},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {06--08 Jan},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v307/main/assets/rathore26a/rathore26a.pdf},
  url = 	 {https://proceedings.mlr.press/v307/rathore26a.html},
  abstract = 	 {Reconstructing visual stimuli from non-invasive Electroencephalography (EEG) is an interesting but challenging task in brain decoding that involves translating noisy neural signals into images via fine-grained generative control. In this work, we introduce a novel and efficient framework that guides a visual token generator by conditioning the generation process on a high-level semantic understanding of the EEG signal. Our method leverages a pre-trained LaBraM-based architecture to derive a robust class prediction from the neural data. 
 In comparison to recent works that involve diffusion models, which require high computational resources and long inference times, our approach utilizes a lightweight and efficient token generator by building upon the bidirectional, parallel decoding capabilities of MaskGIT. This choice of components avoids the high computational requirements typical of large-scale diffusion processes. This focus on efficiency makes our approach not only easier to train but also more viable for potential real-time BCI applications where real-time feedback is crucial. 
 The core of our method is a straightforward yet powerful two-stage process. First, the  EEG classifier distills the complex input signal into a class label. In the second stage, this label serves as a direct condition for the pre-trained token generator. The generator, guided by this class information, then produces a sequence of discrete latent codes that are semantically consistent with the original stimulus. This neurally-guided token sequence is finally rendered into a high-fidelity image by a pretrained decoder, completing an efficient pathway from brain activity to visual representation}
}

Endnote

%0 Conference Paper
%T EEG Guided Token Selection in VQ for Visual Brain Decoding
%A Abhishek Rathore
%A PushapDeep Singh
%A Arnav Bhavsar
%B Proceedings of the 7th Northern Lights Deep Learning Conference (NLDL)
%C Proceedings of Machine Learning Research
%D 2026
%E Hyeongji Kim
%E Adín Ramírez Rivera
%E Benjamin Ricaud	
%F pmlr-v307-rathore26a
%I PMLR
%P 358--363
%U https://proceedings.mlr.press/v307/rathore26a.html
%V 307
%X Reconstructing visual stimuli from non-invasive Electroencephalography (EEG) is an interesting but challenging task in brain decoding that involves translating noisy neural signals into images via fine-grained generative control. In this work, we introduce a novel and efficient framework that guides a visual token generator by conditioning the generation process on a high-level semantic understanding of the EEG signal. Our method leverages a pre-trained LaBraM-based architecture to derive a robust class prediction from the neural data. 
 In comparison to recent works that involve diffusion models, which require high computational resources and long inference times, our approach utilizes a lightweight and efficient token generator by building upon the bidirectional, parallel decoding capabilities of MaskGIT. This choice of components avoids the high computational requirements typical of large-scale diffusion processes. This focus on efficiency makes our approach not only easier to train but also more viable for potential real-time BCI applications where real-time feedback is crucial. 
 The core of our method is a straightforward yet powerful two-stage process. First, the  EEG classifier distills the complex input signal into a class label. In the second stage, this label serves as a direct condition for the pre-trained token generator. The generator, guided by this class information, then produces a sequence of discrete latent codes that are semantically consistent with the original stimulus. This neurally-guided token sequence is finally rendered into a high-fidelity image by a pretrained decoder, completing an efficient pathway from brain activity to visual representation

APA

Rathore, A., Singh, P. & Bhavsar, A.. (2026). EEG Guided Token Selection in VQ for Visual Brain Decoding. Proceedings of the 7th Northern Lights Deep Learning Conference (NLDL), in Proceedings of Machine Learning Research 307:358-363 Available from https://proceedings.mlr.press/v307/rathore26a.html.

EEG Guided Token Selection in VQ for Visual Brain Decoding

Abstract

Cite this Paper

Related Material