Label conditioned segmentation

Tianyu Ma; Benjamin C Lee; Mert R Sabuncu

Label conditioned segmentation

Tianyu Ma, Benjamin C Lee, Mert R Sabuncu

Proceedings of The 5th International Conference on Medical Imaging with Deep Learning, PMLR 172:847-857, 2022.

Abstract

Semantic segmentation is an important task in computer vision that is often tackled with convolutional neural networks (CNNs). A CNN learns to produce pixel-level predictions through training on pairs of images and their corresponding ground-truth segmentation labels. For segmentation tasks with multiple classes, the standard approach is to use a network that computes a multi-channel probabilistic segmentation map, with each channel representing one class. In applications where the image grid size (e.g., when it is a 3D volume) and/or the number of labels is relatively large, the standard (baseline) approach can become prohibitively expensive for our computational resources. In this paper, we propose a simple yet effective method to address this challenge. In our approach, the segmentation network produces a single-channel output, while being conditioned on a single class label, which determines the output class of the network. Our method, called label conditioned segmentation (LCS), can be used to segment images with a very large number of classes, which might be infeasible for the baseline approach. We also demonstrate in the experiments that label conditioning can improve the accuracy of a given backbone architecture. Finally, as we show in our results, an LCS model can produce previously unseen fine-grained labels during inference time, when only coarse labels were available during training. We provide our code here: https://github.com/tym002/Label-conditioned-segmentation

Cite this Paper

BibTeX


@InProceedings{pmlr-v172-ma22a,
  title = 	 {Label conditioned segmentation},
  author =       {Ma, Tianyu and Lee, Benjamin C and Sabuncu, Mert R},
  booktitle = 	 {Proceedings of The 5th International Conference on Medical Imaging with Deep Learning},
  pages = 	 {847--857},
  year = 	 {2022},
  editor = 	 {Konukoglu, Ender and Menze, Bjoern and Venkataraman, Archana and Baumgartner, Christian and Dou, Qi and Albarqouni, Shadi},
  volume = 	 {172},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {06--08 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v172/ma22a/ma22a.pdf},
  url = 	 {https://proceedings.mlr.press/v172/ma22a.html},
  abstract = 	 {Semantic segmentation is an important task in computer vision that is often tackled with convolutional neural networks (CNNs). A CNN learns to produce pixel-level predictions through training on pairs of images and their corresponding ground-truth segmentation labels. For segmentation tasks with multiple classes, the standard approach is to use a network that computes a multi-channel probabilistic segmentation map, with each channel representing one class. In applications where the image grid size (e.g., when it is a 3D volume) and/or the number of labels is relatively large, the standard (baseline) approach can become prohibitively expensive for our computational resources. In this paper, we propose a simple yet effective method to address this challenge. In our approach, the segmentation network produces a single-channel output, while being conditioned on a single class label, which determines the output class of the network. Our method, called label conditioned segmentation (LCS), can be used to segment images with a very large number of classes, which might be infeasible for the baseline approach. We also demonstrate in the experiments that label conditioning can improve the accuracy of a given backbone architecture. Finally, as we show in our results, an LCS model can produce previously unseen fine-grained labels during inference time, when only coarse labels were available during training. We provide our code here: https://github.com/tym002/Label-conditioned-segmentation}
}

Endnote

%0 Conference Paper
%T Label conditioned segmentation
%A Tianyu Ma
%A Benjamin C Lee
%A Mert R Sabuncu
%B Proceedings of The 5th International Conference on Medical Imaging with Deep Learning
%C Proceedings of Machine Learning Research
%D 2022
%E Ender Konukoglu
%E Bjoern Menze
%E Archana Venkataraman
%E Christian Baumgartner
%E Qi Dou
%E Shadi Albarqouni	
%F pmlr-v172-ma22a
%I PMLR
%P 847--857
%U https://proceedings.mlr.press/v172/ma22a.html
%V 172
%X Semantic segmentation is an important task in computer vision that is often tackled with convolutional neural networks (CNNs). A CNN learns to produce pixel-level predictions through training on pairs of images and their corresponding ground-truth segmentation labels. For segmentation tasks with multiple classes, the standard approach is to use a network that computes a multi-channel probabilistic segmentation map, with each channel representing one class. In applications where the image grid size (e.g., when it is a 3D volume) and/or the number of labels is relatively large, the standard (baseline) approach can become prohibitively expensive for our computational resources. In this paper, we propose a simple yet effective method to address this challenge. In our approach, the segmentation network produces a single-channel output, while being conditioned on a single class label, which determines the output class of the network. Our method, called label conditioned segmentation (LCS), can be used to segment images with a very large number of classes, which might be infeasible for the baseline approach. We also demonstrate in the experiments that label conditioning can improve the accuracy of a given backbone architecture. Finally, as we show in our results, an LCS model can produce previously unseen fine-grained labels during inference time, when only coarse labels were available during training. We provide our code here: https://github.com/tym002/Label-conditioned-segmentation

APA


Ma, T., Lee, B.C. & Sabuncu, M.R.. (2022). Label conditioned segmentation. Proceedings of The 5th International Conference on Medical Imaging with Deep Learning, in Proceedings of Machine Learning Research 172:847-857 Available from https://proceedings.mlr.press/v172/ma22a.html.

Related Material

Download PDF