[edit]
G-LaD: Graph-Language alignment for few-shot Diagnosis from fMRI
Proceedings of the First Workshop on NeuroAI Multimodal Intelligence @ AAAI 2026, PMLR 308:180-188, 2026.
Abstract
Decoding human cognitive states from neural activity is a core challenge in artificial intelligence and computational neuroscience. Functional Magnetic Resonance Imaging (fMRI) captures high-dimensional spatiotemporal patterns of brain activity, yet characterizing cognitive states based on modeling the complex, dynamic dependencies among distributed regions remains difficult. While Graph Neural Networks (GNNs) to represent the brain as a structured graph has advanced functional connectivity (FC) analysis, they suffer from limited generalization, reliance on large labeled datasets, and poor transferability across neuro-imaging tasks. We introduce Graph-Language alignment for Diagnosis (G-LaD), that integrates graph representation learning with Large Language Models (LLMs) for data-efficient brain graph classification. G-LaD first pretrains a graph encoder, built upon Graph Isomorphism Network layers using a reconstruction-driven Denoising Autoencoder, to capture structural and topological invariants. In the second stage, distribution-level alignment between graph and language representations is achieved via a Sinkhorn-divergence objective, enabling smooth and transferable cross-modal mapping. Finally, a Chain-of-Thought prompting mechanism guides the LLM to perform reasoning-driven predictions. Empirical evaluations on the ABIDE dataset demonstrate superior few-shot generalization and robust performance of G-LAD in neuro-degenerative disorder classification.