[edit]
TopoCAM: ROI-Driven Topological Signatures in 3D Medical Imaging
Proceedings of the Fifth Machine Learning for Health Symposium, PMLR 297:41-54, 2026.
Abstract
Accurate classification of {3D} medical images is challenging due to the high dimensionality of volumetric data and the scarcity of well-annotated clinical datasets. We propose a hybrid framework that couples explainable deep learning with topological data analysis (TDA). First, we compute layer-weighted Grad-CAM across multiple network layers, upsample and normalize the maps to the input grid, and threshold them to produce a binary region-of-interest (ROI) mask. We then apply this mask to the input volume to obtain a segmented image that suppresses irrelevant anatomy while preserving clinically salient structures. Within these attention-derived ROIs and segmented images, we compute cubical persistent homology to derive compact topological descriptors that capture diagnostically meaningful features. Across both {3D} volumes and {2D} medical imaging benchmarks, this segmentation-guided TDA pipeline surpasses strong {3D} {CNN} and Transformer baselines, yielding higher accuracy and improved robustness in limited-data settings while providing localized, interpretable evidence for clinical decision support.