O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning

Kaichun Mo; Yuzhe Qin; Fanbo Xiang; Hao Su; Leonidas Guibas

O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning

Kaichun Mo, Yuzhe Qin, Fanbo Xiang, Hao Su, Leonidas Guibas

Proceedings of the 5th Conference on Robot Learning, PMLR 164:1666-1677, 2022.

Abstract

Contrary to the vast literature in modeling, perceiving, and understanding agent-object (e.g., human-object, hand-object, robot-object) interaction in computer vision and robotics, very few past works have studied the task of object-object interaction, which also plays an important role in robotic manipulation and planning tasks. There is a rich space of object-object interaction scenarios in our daily life, such as placing an object on a messy tabletop, fitting an object inside a drawer, pushing an object using a tool, etc. In this paper, we propose a unified affordance learning framework to learn object-object interaction for various tasks. By constructing four object-object interaction task environments using physical simulation (SAPIEN) and thousands of ShapeNet models with rich geometric diversity, we are able to conduct large-scale object-object affordance learning without the need for human annotations or demonstrations. At the core of technical contribution, we propose an object-kernel point convolution network to reason about detailed interaction between two objects. Experiments on large-scale synthetic data and real-world data prove the effectiveness of the proposed approach.

Cite this Paper

BibTeX


@InProceedings{pmlr-v164-mo22b,
  title = 	 {O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning},
  author =       {Mo, Kaichun and Qin, Yuzhe and Xiang, Fanbo and Su, Hao and Guibas, Leonidas},
  booktitle = 	 {Proceedings of the 5th Conference on Robot Learning},
  pages = 	 {1666--1677},
  year = 	 {2022},
  editor = 	 {Faust, Aleksandra and Hsu, David and Neumann, Gerhard},
  volume = 	 {164},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {08--11 Nov},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v164/mo22b/mo22b.pdf},
  url = 	 {https://proceedings.mlr.press/v164/mo22b.html},
  abstract = 	 {Contrary to the vast literature in modeling, perceiving, and understanding agent-object (e.g., human-object, hand-object, robot-object) interaction in computer vision and robotics, very few past works have studied the task of object-object interaction, which also plays an important role in robotic manipulation and planning tasks. There is a rich space of object-object interaction scenarios in our daily life, such as placing an object on a messy tabletop, fitting an object inside a drawer, pushing an object using a tool, etc. In this paper, we propose a unified affordance learning framework to learn object-object interaction for various tasks. By constructing four object-object interaction task environments using physical simulation (SAPIEN) and thousands of ShapeNet models with rich geometric diversity, we are able to conduct large-scale object-object affordance learning without the need for human annotations or demonstrations. At the core of technical contribution, we propose an object-kernel point convolution network to reason about detailed interaction between two objects. Experiments on large-scale synthetic data and real-world data prove the effectiveness of the proposed approach.}
}

Endnote

%0 Conference Paper
%T O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning
%A Kaichun Mo
%A Yuzhe Qin
%A Fanbo Xiang
%A Hao Su
%A Leonidas Guibas
%B Proceedings of the 5th Conference on Robot Learning
%C Proceedings of Machine Learning Research
%D 2022
%E Aleksandra Faust
%E David Hsu
%E Gerhard Neumann	
%F pmlr-v164-mo22b
%I PMLR
%P 1666--1677
%U https://proceedings.mlr.press/v164/mo22b.html
%V 164
%X Contrary to the vast literature in modeling, perceiving, and understanding agent-object (e.g., human-object, hand-object, robot-object) interaction in computer vision and robotics, very few past works have studied the task of object-object interaction, which also plays an important role in robotic manipulation and planning tasks. There is a rich space of object-object interaction scenarios in our daily life, such as placing an object on a messy tabletop, fitting an object inside a drawer, pushing an object using a tool, etc. In this paper, we propose a unified affordance learning framework to learn object-object interaction for various tasks. By constructing four object-object interaction task environments using physical simulation (SAPIEN) and thousands of ShapeNet models with rich geometric diversity, we are able to conduct large-scale object-object affordance learning without the need for human annotations or demonstrations. At the core of technical contribution, we propose an object-kernel point convolution network to reason about detailed interaction between two objects. Experiments on large-scale synthetic data and real-world data prove the effectiveness of the proposed approach.

APA


Mo, K., Qin, Y., Xiang, F., Su, H. & Guibas, L.. (2022). O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning. Proceedings of the 5th Conference on Robot Learning, in Proceedings of Machine Learning Research 164:1666-1677 Available from https://proceedings.mlr.press/v164/mo22b.html.

O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning

Abstract

Cite this Paper

Related Material