Smart Forgetting for Safe  Online Learning with Gaussian Processes

Jonas Umlauft; Thomas Beckers; Alexandre Capone; Armin Lederer; Sandra Hirche

Smart Forgetting for Safe Online Learning with Gaussian Processes

Jonas Umlauft, Thomas Beckers, Alexandre Capone, Armin Lederer, Sandra Hirche

Proceedings of the 2nd Conference on Learning for Dynamics and Control, PMLR 120:160-169, 2020.

Abstract

The identification of unknown dynamical systems using supervised learning enables model-based control of systems that cannot be modeled based on first principles. While most control literature focuses on the analysis of a static dataset, online learning control, where data points are added while the controller is running, has rarely been studied in depth. In this paper, we present a novel approach for online learning control based on Gaussian process models. To avoid computational difficulties with growing datasets, we propose a safe forgetting mechanism. Using an entropy criterion, data points are evaluated with respect to the future trajectory of the closed loop system and are “forgotten” if the stability of the system can further be guaranteed. The approach is evaluated in a simulation and in a robotic experiment to show its real-time capability.

Cite this Paper

BibTeX


@InProceedings{pmlr-v120-umlauft20a,
  title = 	 {Smart Forgetting for Safe  Online Learning with Gaussian Processes},
  author =       {Umlauft, Jonas and Beckers, Thomas and Capone, Alexandre and Lederer, Armin and Hirche, Sandra},
  booktitle = 	 {Proceedings of the 2nd Conference on Learning for Dynamics and Control},
  pages = 	 {160--169},
  year = 	 {2020},
  editor = 	 {Bayen, Alexandre M. and Jadbabaie, Ali and Pappas, George and Parrilo, Pablo A. and Recht, Benjamin and Tomlin, Claire and Zeilinger, Melanie},
  volume = 	 {120},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {10--11 Jun},
  publisher =    {PMLR},
  pdf = 	 {http://proceedings.mlr.press/v120/umlauft20a/umlauft20a.pdf},
  url = 	 {https://proceedings.mlr.press/v120/umlauft20a.html},
  abstract = 	 {The identification of unknown dynamical systems using supervised learning enables model-based control of systems that cannot be modeled based on first principles. While most control literature focuses on the analysis of a static dataset, online learning control, where data points are added while the controller is running, has rarely been studied in depth. In this paper, we present a novel approach for online learning control based on Gaussian process models. To avoid computational difficulties with growing datasets, we propose a safe forgetting mechanism. Using an entropy criterion, data points are evaluated with respect to the future trajectory of the closed loop system and are “forgotten” if the stability of the system can further be guaranteed. The approach is evaluated in a simulation and in a robotic experiment to show its real-time capability.}
}

Endnote

%0 Conference Paper
%T Smart Forgetting for Safe  Online Learning with Gaussian Processes
%A Jonas Umlauft
%A Thomas Beckers
%A Alexandre Capone
%A Armin Lederer
%A Sandra Hirche
%B Proceedings of the 2nd Conference on Learning for Dynamics and Control
%C Proceedings of Machine Learning Research
%D 2020
%E Alexandre M. Bayen
%E Ali Jadbabaie
%E George Pappas
%E Pablo A. Parrilo
%E Benjamin Recht
%E Claire Tomlin
%E Melanie Zeilinger	
%F pmlr-v120-umlauft20a
%I PMLR
%P 160--169
%U https://proceedings.mlr.press/v120/umlauft20a.html
%V 120
%X The identification of unknown dynamical systems using supervised learning enables model-based control of systems that cannot be modeled based on first principles. While most control literature focuses on the analysis of a static dataset, online learning control, where data points are added while the controller is running, has rarely been studied in depth. In this paper, we present a novel approach for online learning control based on Gaussian process models. To avoid computational difficulties with growing datasets, we propose a safe forgetting mechanism. Using an entropy criterion, data points are evaluated with respect to the future trajectory of the closed loop system and are “forgotten” if the stability of the system can further be guaranteed. The approach is evaluated in a simulation and in a robotic experiment to show its real-time capability.

APA


Umlauft, J., Beckers, T., Capone, A., Lederer, A. & Hirche, S.. (2020). Smart Forgetting for Safe  Online Learning with Gaussian Processes. Proceedings of the 2nd Conference on Learning for Dynamics and Control, in Proceedings of Machine Learning Research 120:160-169 Available from https://proceedings.mlr.press/v120/umlauft20a.html.

Related Material

Download PDF