Learning Structure-Aware Foundational Representation of Rat Testicular Tubules Using Multiple Instance Learning

Vedang Kshirsagar, Saketh Juturu, Geetank Raipuria, Nitin Singhal
Proceedings of The 9th International Conference on Medical Imaging with Deep Learning, PMLR 315:3131-3151, 2026.

Abstract

Testicular toxicity is a critical factor in preclinical drug safety assessment, yet automated modelling of testicular abnormalities remains largely unexplored. Unlike liver or kidney tissue, the testis tissue is organized into tubules that vary substantially in size and structure, making fixed-resolution patch classification ineffective. We first demonstrate that resizing tubules significantly degrades performance particularly for larger sized tubules and a Multiple Instance Learning (MIL) model offers substantial improvements. Building on this, we introduce TBA-MIL, a transformer-based aggregation model with learnable positional embeddings that encodes the structure of tubules and is pre-trained using a self-supervised Masked Instance Modelling (MIM-MIL) framework, learning tubule representations from large-scale unlabeled data. Across four tubule types, TBA-MIL with MIM-MIL outperforms state-of-the-art MIL models and establishes a strong baseline for automated testicular toxicity assessment. Additionally, we evaluate the proposed framework on an independent toxicological study and show that the predicted abnormality distributions significantly differentiate control and treated animal tissues, consistent with expert pathologists’ assessment.

Cite this Paper


BibTeX
@InProceedings{pmlr-v315-kshirsagar26a, title = {Learning Structure-Aware Foundational Representation of Rat Testicular Tubules Using Multiple Instance Learning}, author = {Kshirsagar, Vedang and Juturu, Saketh and Raipuria, Geetank and Singhal, Nitin}, booktitle = {Proceedings of The 9th International Conference on Medical Imaging with Deep Learning}, pages = {3131--3151}, year = {2026}, editor = {Huo, Yuankai and Gao, Mingchen and Kuo, Chang-Fu and Jin, Yueming and Deng, Ruining}, volume = {315}, series = {Proceedings of Machine Learning Research}, month = {08--10 Jul}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v315/main/assets/kshirsagar26a/kshirsagar26a.pdf}, url = {https://proceedings.mlr.press/v315/kshirsagar26a.html}, abstract = {Testicular toxicity is a critical factor in preclinical drug safety assessment, yet automated modelling of testicular abnormalities remains largely unexplored. Unlike liver or kidney tissue, the testis tissue is organized into tubules that vary substantially in size and structure, making fixed-resolution patch classification ineffective. We first demonstrate that resizing tubules significantly degrades performance particularly for larger sized tubules and a Multiple Instance Learning (MIL) model offers substantial improvements. Building on this, we introduce TBA-MIL, a transformer-based aggregation model with learnable positional embeddings that encodes the structure of tubules and is pre-trained using a self-supervised Masked Instance Modelling (MIM-MIL) framework, learning tubule representations from large-scale unlabeled data. Across four tubule types, TBA-MIL with MIM-MIL outperforms state-of-the-art MIL models and establishes a strong baseline for automated testicular toxicity assessment. Additionally, we evaluate the proposed framework on an independent toxicological study and show that the predicted abnormality distributions significantly differentiate control and treated animal tissues, consistent with expert pathologists’ assessment.} }
Endnote
%0 Conference Paper %T Learning Structure-Aware Foundational Representation of Rat Testicular Tubules Using Multiple Instance Learning %A Vedang Kshirsagar %A Saketh Juturu %A Geetank Raipuria %A Nitin Singhal %B Proceedings of The 9th International Conference on Medical Imaging with Deep Learning %C Proceedings of Machine Learning Research %D 2026 %E Yuankai Huo %E Mingchen Gao %E Chang-Fu Kuo %E Yueming Jin %E Ruining Deng %F pmlr-v315-kshirsagar26a %I PMLR %P 3131--3151 %U https://proceedings.mlr.press/v315/kshirsagar26a.html %V 315 %X Testicular toxicity is a critical factor in preclinical drug safety assessment, yet automated modelling of testicular abnormalities remains largely unexplored. Unlike liver or kidney tissue, the testis tissue is organized into tubules that vary substantially in size and structure, making fixed-resolution patch classification ineffective. We first demonstrate that resizing tubules significantly degrades performance particularly for larger sized tubules and a Multiple Instance Learning (MIL) model offers substantial improvements. Building on this, we introduce TBA-MIL, a transformer-based aggregation model with learnable positional embeddings that encodes the structure of tubules and is pre-trained using a self-supervised Masked Instance Modelling (MIM-MIL) framework, learning tubule representations from large-scale unlabeled data. Across four tubule types, TBA-MIL with MIM-MIL outperforms state-of-the-art MIL models and establishes a strong baseline for automated testicular toxicity assessment. Additionally, we evaluate the proposed framework on an independent toxicological study and show that the predicted abnormality distributions significantly differentiate control and treated animal tissues, consistent with expert pathologists’ assessment.
APA
Kshirsagar, V., Juturu, S., Raipuria, G. & Singhal, N.. (2026). Learning Structure-Aware Foundational Representation of Rat Testicular Tubules Using Multiple Instance Learning. Proceedings of The 9th International Conference on Medical Imaging with Deep Learning, in Proceedings of Machine Learning Research 315:3131-3151 Available from https://proceedings.mlr.press/v315/kshirsagar26a.html.

Related Material