[edit]
Defending Against Text-Based Social Engineering Attacks Using Federated Adversarial Learning
Proceedings of IndabaX Nigeria 2026: Building Scalable AI That Works: From Research to Deployment in Resource-Constrained Environments, PMLR 319:26-36, 2026.
Abstract
Text-based social engineering attacks such as phishing emails and scam messages have remained quite successful because language and obfuscation patterns are constantly adapted by adversaries, and centralised detection approaches raise serious privacy concerns as well as insufficient real-world deployment. This paper proposes a privacy-preserving and robust detection system combining Joint Embedding Predictive Architecture (JEPA) representation learning with federated learning, enabling multiple clients to collaboratively train a global model without exchanging raw user data. To overcome modelling capacity challenges in heterogeneous (non-IID) client distributions, a Mixture-of-Experts (MoE) design and a Kolmogorov–Arnold Network (KAN)-based prediction head are adopted. The global model is trained via iterative local optimisation and server-side aggregation, with robustness induced by a federated adversarial learning stage. Experimental results demonstrate that the JEPA-Federated-MoE/KAN pipeline consistently exhibits excellent detection performance with privacy preservation and enhanced flexibility against adversarial changes.