Reinforcement Learning of Active Vision for Manipulating Objects under Occlusions

Ricson Cheng, Arpit Agarwal, Katerina Fragkiadaki
Proceedings of The 2nd Conference on Robot Learning, PMLR 87:422-431, 2018.

Abstract

We consider artificial agents that learn to jointly control their gripper and camera in order to reinforcement learn manipulation policies in the presence of occlusions from distractor objects. Distractors often occlude the object of interest and cause it to disappear from the field of view. We propose hand/eye controllers that learn to move the camera to keep the object within the field of view and visible, in coordination to manipulating it to achieve the desired goal, e.g., pushing it to a target location. We incorporate structural biases of object-centric attention within our actor-critic architectures, which our experiments suggest to be a key for good performance. Our results further highlight the importance of curriculum with regards to environment difficulty. The resulting active vision / manipulation policies outperform static camera setups for a variety of cluttered environments.

Cite this Paper


BibTeX
@InProceedings{pmlr-v87-cheng18a, title = {Reinforcement Learning of Active Vision for Manipulating Objects under Occlusions}, author = {Cheng, Ricson and Agarwal, Arpit and Fragkiadaki, Katerina}, booktitle = {Proceedings of The 2nd Conference on Robot Learning}, pages = {422--431}, year = {2018}, editor = {Billard, Aude and Dragan, Anca and Peters, Jan and Morimoto, Jun}, volume = {87}, series = {Proceedings of Machine Learning Research}, month = {29--31 Oct}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v87/cheng18a/cheng18a.pdf}, url = {https://proceedings.mlr.press/v87/cheng18a.html}, abstract = {We consider artificial agents that learn to jointly control their gripper and camera in order to reinforcement learn manipulation policies in the presence of occlusions from distractor objects. Distractors often occlude the object of interest and cause it to disappear from the field of view. We propose hand/eye controllers that learn to move the camera to keep the object within the field of view and visible, in coordination to manipulating it to achieve the desired goal, e.g., pushing it to a target location. We incorporate structural biases of object-centric attention within our actor-critic architectures, which our experiments suggest to be a key for good performance. Our results further highlight the importance of curriculum with regards to environment difficulty. The resulting active vision / manipulation policies outperform static camera setups for a variety of cluttered environments. } }
Endnote
%0 Conference Paper %T Reinforcement Learning of Active Vision for Manipulating Objects under Occlusions %A Ricson Cheng %A Arpit Agarwal %A Katerina Fragkiadaki %B Proceedings of The 2nd Conference on Robot Learning %C Proceedings of Machine Learning Research %D 2018 %E Aude Billard %E Anca Dragan %E Jan Peters %E Jun Morimoto %F pmlr-v87-cheng18a %I PMLR %P 422--431 %U https://proceedings.mlr.press/v87/cheng18a.html %V 87 %X We consider artificial agents that learn to jointly control their gripper and camera in order to reinforcement learn manipulation policies in the presence of occlusions from distractor objects. Distractors often occlude the object of interest and cause it to disappear from the field of view. We propose hand/eye controllers that learn to move the camera to keep the object within the field of view and visible, in coordination to manipulating it to achieve the desired goal, e.g., pushing it to a target location. We incorporate structural biases of object-centric attention within our actor-critic architectures, which our experiments suggest to be a key for good performance. Our results further highlight the importance of curriculum with regards to environment difficulty. The resulting active vision / manipulation policies outperform static camera setups for a variety of cluttered environments.
APA
Cheng, R., Agarwal, A. & Fragkiadaki, K.. (2018). Reinforcement Learning of Active Vision for Manipulating Objects under Occlusions. Proceedings of The 2nd Conference on Robot Learning, in Proceedings of Machine Learning Research 87:422-431 Available from https://proceedings.mlr.press/v87/cheng18a.html.

Related Material