SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects

Ning Gao, Vien Anh Ngo, Hanna Ziesche, Gerhard Neumann
Proceedings of The 7th Conference on Robot Learning, PMLR 229:1572-1595, 2023.

Abstract

To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of the critical aspects. Most existing approaches have difficulties to extend predictions to scenarios where novel object instances are continuously introduced, especially with heavy occlusions. In this work, we propose a few-shot pose estimation (FSPE) approach called SA6D, which uses a self-adaptive segmentation module to identify the novel target object and construct a point cloud model of the target object using only a small number of cluttered reference images. Unlike existing methods, SA6D does not require object-centric reference images or any additional object information, making it a more generalizable and scalable solution across categories. We evaluate SA6D on real-world tabletop object datasets and demonstrate that SA6D outperforms existing FSPE methods, particularly in cluttered scenes with occlusions, while requiring fewer reference images.

Cite this Paper


BibTeX
@InProceedings{pmlr-v229-gao23a, title = {SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects}, author = {Gao, Ning and Ngo, Vien Anh and Ziesche, Hanna and Neumann, Gerhard}, booktitle = {Proceedings of The 7th Conference on Robot Learning}, pages = {1572--1595}, year = {2023}, editor = {Tan, Jie and Toussaint, Marc and Darvish, Kourosh}, volume = {229}, series = {Proceedings of Machine Learning Research}, month = {06--09 Nov}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v229/gao23a/gao23a.pdf}, url = {https://proceedings.mlr.press/v229/gao23a.html}, abstract = {To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of the critical aspects. Most existing approaches have difficulties to extend predictions to scenarios where novel object instances are continuously introduced, especially with heavy occlusions. In this work, we propose a few-shot pose estimation (FSPE) approach called SA6D, which uses a self-adaptive segmentation module to identify the novel target object and construct a point cloud model of the target object using only a small number of cluttered reference images. Unlike existing methods, SA6D does not require object-centric reference images or any additional object information, making it a more generalizable and scalable solution across categories. We evaluate SA6D on real-world tabletop object datasets and demonstrate that SA6D outperforms existing FSPE methods, particularly in cluttered scenes with occlusions, while requiring fewer reference images.} }
Endnote
%0 Conference Paper %T SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects %A Ning Gao %A Vien Anh Ngo %A Hanna Ziesche %A Gerhard Neumann %B Proceedings of The 7th Conference on Robot Learning %C Proceedings of Machine Learning Research %D 2023 %E Jie Tan %E Marc Toussaint %E Kourosh Darvish %F pmlr-v229-gao23a %I PMLR %P 1572--1595 %U https://proceedings.mlr.press/v229/gao23a.html %V 229 %X To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of the critical aspects. Most existing approaches have difficulties to extend predictions to scenarios where novel object instances are continuously introduced, especially with heavy occlusions. In this work, we propose a few-shot pose estimation (FSPE) approach called SA6D, which uses a self-adaptive segmentation module to identify the novel target object and construct a point cloud model of the target object using only a small number of cluttered reference images. Unlike existing methods, SA6D does not require object-centric reference images or any additional object information, making it a more generalizable and scalable solution across categories. We evaluate SA6D on real-world tabletop object datasets and demonstrate that SA6D outperforms existing FSPE methods, particularly in cluttered scenes with occlusions, while requiring fewer reference images.
APA
Gao, N., Ngo, V.A., Ziesche, H. & Neumann, G.. (2023). SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects. Proceedings of The 7th Conference on Robot Learning, in Proceedings of Machine Learning Research 229:1572-1595 Available from https://proceedings.mlr.press/v229/gao23a.html.

Related Material