[edit]
Development of an Enhanced Machine Learning Model for Deception Detection Leveraging Facial Action Units and Linguistic Features
Proceedings of IndabaX Nigeria 2026: Building Scalable AI That Works: From Research to Deployment in Resource-Constrained Environments, PMLR 319:62-73, 2026.
Abstract
This study proposes a bimodal machine learning approach for multiclass deception detection by integrating facial Action Unit (AU) features and linguistic features extracted from a real-life dataset of video interviews of suspected criminal perpetrators, persons of interest, and convicted criminals. Facial features were obtained using the Facial Action Coding System (FACS), while linguistic cues were derived using Linguistic Inquiry Word Count (LIWC) scores. A mid-level data fusion strategy combines the extracted features into a unified representation. A Random Forest classifier applied to 3,720 real-life samples with 80:20 split and 10-fold cross-validation achieved an overall classification accuracy of 88%. Results confirm that combining facial and linguistic cues from real-life datasets provides a richer representation of deceptive behaviour.