[edit]
STRESNET & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs
Proceedings of the The 39th Canadian Conference on Artificial Intelligence, PMLR 318:38-51, 2026.
Abstract
Recent advancements in lightweight neural networks have significantly improved the efficiency of deploying deep learning models on edge hardware. However, most existing architectures still compromise accuracy for latency, which limits their applicability on MCU/NPU-based devices. In this work, we introduce two new model families — STResNet for image classification and STYOLO for object detection — jointly optimized for accuracy, efficiency, and memory footprint on resource-constrained platforms. The proposed STResNet series (ranging from Nano to Tiny variants) achieves competitive ImageNet-1K accuracy within a 4M parameter budget. Specifically, STResNetMilli attains 70.0% Top-1 accuracy with only 3.0M parameters, outperforming MobileNetV1 and ShuffleNetV2 at comparable computational complexity. For object detection, STYOLOMicro and STYOLOMilli achieve 30.5% and 33.6% mAP, respectively, on the MS-COCO dataset, surpassing YOLOv5n and YOLOX-Nano in both accuracy and efficiency. Furthermore, when STResNetMilli is used as a backbone with the Ultralytics detection head, it approaches the performance of the YOLOv11n model under the latest Ultralytics training environment.