[edit]
INT8 Quantisation for Cassava Leaf Disease Classification on Raspberry Pi
Proceedings of IndabaX Nigeria 2026: Building Scalable AI That Works: From Research to Deployment in Resource-Constrained Environments, PMLR 319:167-178, 2026.
Abstract
We investigate the impact of post-training INT8 quantisation on a CNN trained for the Kaggle Cassava Leaf Disease Classification task (5 classes). Comparing FP32 baseline versus statically quantised INT8 model, the FP32 model achieved 81.5% accuracy (F1 = 0.676, 6.23 MB, 3.04 ms CPU latency), whereas the INT8 model shrank to 1.87 MB but accuracy dropped to 11.9% (58 ms on Raspberry Pi 4). These results reveal a drastic size-accuracy trade-off, highlighting pitfalls and practical lessons for deploying CNNs on edge devices in resource-constrained settings.