Agile Catching with Whole-Body MPC and Blackbox Policy Learning

Saminda Abeyruwan; Alex Bewley; Nicholas Matthew Boffi; Krzysztof Marcin Choromanski; David B D’Ambrosio; Deepali Jain; Pannag R Sanketi; Anish Shankar; Vikas Sindhwani; Sumeet Singh; Jean-Jacques Slotine; Stephen Tu

Agile Catching with Whole-Body MPC and Blackbox Policy Learning

Saminda Abeyruwan, Alex Bewley, Nicholas Matthew Boffi, Krzysztof Marcin Choromanski, David B D’Ambrosio, Deepali Jain, Pannag R Sanketi, Anish Shankar, Vikas Sindhwani, Sumeet Singh, Jean-Jacques Slotine, Stephen Tu

Proceedings of The 5th Annual Learning for Dynamics and Control Conference, PMLR 211:851-863, 2023.

Abstract

We address a benchmark task in agile robotics: catching objects thrown at high-speed. This is a challenging task that involves tracking, intercepting, and cradling a thrown object with access only to visual observations of the object and the proprioceptive state of the robot, all within a fraction of a second. We present the relative merits of two fundamentally different solution strategies: (i) Model Predictive Control using accelerated constrained trajectory optimization, and (ii) Reinforcement Learning using zeroth-order optimization. We provide insights into various performance tradeoffs including sample efficiency, sim-to-real transfer, robustness to distribution shifts, and whole-body multimodality via extensive on-hardware experiments. We conclude with proposals on fusing "classical" and "learning-based" techniques for agile robot control.

Cite this Paper

BibTeX


@InProceedings{pmlr-v211-abeyruwan23a,
  title = 	 {Agile Catching with Whole-Body MPC and Blackbox Policy Learning},
  author =       {Abeyruwan, Saminda and Bewley, Alex and Boffi, Nicholas Matthew and Choromanski, Krzysztof Marcin and D'Ambrosio, David B and Jain, Deepali and Sanketi, Pannag R and Shankar, Anish and Sindhwani, Vikas and Singh, Sumeet and Slotine, Jean-Jacques and Tu, Stephen},
  booktitle = 	 {Proceedings of The 5th Annual Learning for Dynamics and Control Conference},
  pages = 	 {851--863},
  year = 	 {2023},
  editor = 	 {Matni, Nikolai and Morari, Manfred and Pappas, George J.},
  volume = 	 {211},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {15--16 Jun},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v211/abeyruwan23a/abeyruwan23a.pdf},
  url = 	 {https://proceedings.mlr.press/v211/abeyruwan23a.html},
  abstract = 	 {We address a benchmark task in agile robotics: catching objects thrown at high-speed. This is a challenging task that involves tracking, intercepting, and cradling a thrown object with access only to visual observations of the object and the proprioceptive state of the robot, all within a fraction of a second. We present the relative merits of two fundamentally different solution strategies: (i) Model Predictive Control using accelerated constrained trajectory optimization, and (ii) Reinforcement Learning using zeroth-order optimization. We provide insights into various performance tradeoffs including sample efficiency, sim-to-real transfer, robustness to distribution shifts, and whole-body multimodality via extensive on-hardware experiments. We conclude with proposals on fusing "classical" and "learning-based" techniques for agile robot control. }
}

Endnote

%0 Conference Paper
%T Agile Catching with Whole-Body MPC and Blackbox Policy Learning
%A Saminda Abeyruwan
%A Alex Bewley
%A Nicholas Matthew Boffi
%A Krzysztof Marcin Choromanski
%A David B D’Ambrosio
%A Deepali Jain
%A Pannag R Sanketi
%A Anish Shankar
%A Vikas Sindhwani
%A Sumeet Singh
%A Jean-Jacques Slotine
%A Stephen Tu
%B Proceedings of The 5th Annual Learning for Dynamics and Control Conference
%C Proceedings of Machine Learning Research
%D 2023
%E Nikolai Matni
%E Manfred Morari
%E George J. Pappas	
%F pmlr-v211-abeyruwan23a
%I PMLR
%P 851--863
%U https://proceedings.mlr.press/v211/abeyruwan23a.html
%V 211
%X We address a benchmark task in agile robotics: catching objects thrown at high-speed. This is a challenging task that involves tracking, intercepting, and cradling a thrown object with access only to visual observations of the object and the proprioceptive state of the robot, all within a fraction of a second. We present the relative merits of two fundamentally different solution strategies: (i) Model Predictive Control using accelerated constrained trajectory optimization, and (ii) Reinforcement Learning using zeroth-order optimization. We provide insights into various performance tradeoffs including sample efficiency, sim-to-real transfer, robustness to distribution shifts, and whole-body multimodality via extensive on-hardware experiments. We conclude with proposals on fusing "classical" and "learning-based" techniques for agile robot control.

APA


Abeyruwan, S., Bewley, A., Boffi, N.M., Choromanski, K.M., D’Ambrosio, D.B., Jain, D., Sanketi, P.R., Shankar, A., Sindhwani, V., Singh, S., Slotine, J. & Tu, S.. (2023). Agile Catching with Whole-Body MPC and Blackbox Policy Learning. Proceedings of The 5th Annual Learning for Dynamics and Control Conference, in Proceedings of Machine Learning Research 211:851-863 Available from https://proceedings.mlr.press/v211/abeyruwan23a.html.

Agile Catching with Whole-Body MPC and Blackbox Policy Learning

Abstract

Cite this Paper

Related Material