Semi-Supervised Learning of Decision-Making Models for Human-Robot Collaboration

Vaibhav V. Unhelkar, Shen Li, Julie A. Shah
Proceedings of the Conference on Robot Learning, PMLR 100:192-203, 2020.

Abstract

We consider human-robot collaboration in sequential tasks with known task objectives. For interaction planning in this setting, the utility of models for decision-making under uncertainty has been demonstrated across domains. However, in practice, specifying the model parameters remains challenging, requiring significant effort from the robot developer. To alleviate this challenge, we present ADACORL, a framework to specify decision-making models and generate robot behavior for interaction. Central to our approach are a factored task model and a semi-supervised algorithm to learn models of human behavior. We demonstrate that our specification approach, despite significantly fewer labels, generates models (and policies) that perform equally well or better than models learned with supervised data. By leveraging pre-computed performance bounds and an online planner, ADACORL can generate robot behavior for collaborative tasks with large state spaces (> 1 million states) and short planning times (< 0.5 s).

Cite this Paper


BibTeX
@InProceedings{pmlr-v100-unhelkar20a, title = {Semi-Supervised Learning of Decision-Making Models for Human-Robot Collaboration}, author = {Unhelkar, Vaibhav V. and Li, Shen and Shah, Julie A.}, booktitle = {Proceedings of the Conference on Robot Learning}, pages = {192--203}, year = {2020}, editor = {Kaelbling, Leslie Pack and Kragic, Danica and Sugiura, Komei}, volume = {100}, series = {Proceedings of Machine Learning Research}, month = {30 Oct--01 Nov}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v100/unhelkar20a/unhelkar20a.pdf}, url = {https://proceedings.mlr.press/v100/unhelkar20a.html}, abstract = {We consider human-robot collaboration in sequential tasks with known task objectives. For interaction planning in this setting, the utility of models for decision-making under uncertainty has been demonstrated across domains. However, in practice, specifying the model parameters remains challenging, requiring significant effort from the robot developer. To alleviate this challenge, we present ADACORL, a framework to specify decision-making models and generate robot behavior for interaction. Central to our approach are a factored task model and a semi-supervised algorithm to learn models of human behavior. We demonstrate that our specification approach, despite significantly fewer labels, generates models (and policies) that perform equally well or better than models learned with supervised data. By leveraging pre-computed performance bounds and an online planner, ADACORL can generate robot behavior for collaborative tasks with large state spaces (> 1 million states) and short planning times (< 0.5 s).} }
Endnote
%0 Conference Paper %T Semi-Supervised Learning of Decision-Making Models for Human-Robot Collaboration %A Vaibhav V. Unhelkar %A Shen Li %A Julie A. Shah %B Proceedings of the Conference on Robot Learning %C Proceedings of Machine Learning Research %D 2020 %E Leslie Pack Kaelbling %E Danica Kragic %E Komei Sugiura %F pmlr-v100-unhelkar20a %I PMLR %P 192--203 %U https://proceedings.mlr.press/v100/unhelkar20a.html %V 100 %X We consider human-robot collaboration in sequential tasks with known task objectives. For interaction planning in this setting, the utility of models for decision-making under uncertainty has been demonstrated across domains. However, in practice, specifying the model parameters remains challenging, requiring significant effort from the robot developer. To alleviate this challenge, we present ADACORL, a framework to specify decision-making models and generate robot behavior for interaction. Central to our approach are a factored task model and a semi-supervised algorithm to learn models of human behavior. We demonstrate that our specification approach, despite significantly fewer labels, generates models (and policies) that perform equally well or better than models learned with supervised data. By leveraging pre-computed performance bounds and an online planner, ADACORL can generate robot behavior for collaborative tasks with large state spaces (> 1 million states) and short planning times (< 0.5 s).
APA
Unhelkar, V.V., Li, S. & Shah, J.A.. (2020). Semi-Supervised Learning of Decision-Making Models for Human-Robot Collaboration. Proceedings of the Conference on Robot Learning, in Proceedings of Machine Learning Research 100:192-203 Available from https://proceedings.mlr.press/v100/unhelkar20a.html.

Related Material