[edit]
HistoEffiCrossFormer: Cross-Attention CNN–Transformer Fusion with Multi-Scale Tokens for Ovarian Cancer Histopathology Classification
Proceedings of IndabaX Nigeria 2026: Building Scalable AI That Works: From Research to Deployment in Resource-Constrained Environments, PMLR 319:320-330, 2026.
Abstract
We propose HistoEffiCrossFormer, a lightweight CNN–Transformer hybrid for five-class classification of ovarian cancer histopathology images. The pipeline uses Macenko-inspired stain normalisation, EfficientNet-B0 feature extraction, squeeze-and-excitation channel attention, multi-scale tokenisation, a 2-layer transformer encoder, and cross-attention fusion. Benchmarked against SqueezeNet, ShuffleNetV2, AlexNet, and Xception, HistoEffiCrossFormer achieves 0.8267 test accuracy with AUC 0.974, outperforming lightweight CNN baselines (0.7067–0.7867 accuracy) and closely matching Xception’s AUC (0.975), motivating further validation in external cohorts and whole-slide pipelines.