[edit]
Speech-Based Parkinson’s Disease Screening: A Deep Learning Approach Using Acoustic Biomarkers and Invertible Neural Networks
Proceedings of IndabaX Nigeria 2026: Building Scalable AI That Works: From Research to Deployment in Resource-Constrained Environments, PMLR 319:355-367, 2026.
Abstract
We propose a deep learning architecture combining deep residual encoders and squeeze-and-excitation attention mechanisms with invertible normalising flows for speech-based Parkinson’s disease (PD) screening. On the Telemonitoring dataset, the model achieves AUC 0.979, 100% specificity, 72.4% sensitivity, and 79.5% accuracy. On the MSR dataset, it achieves 71.1% accuracy, AUC 0.806, with symmetric sensitivity (71.0%) and specificity (71.2%). Invertible normalising flows enable exact density estimation and principled uncertainty quantification, supporting telemedicine applications for smartphone-based remote screening of Parkinson’s disease.