Residual Neural Terminal Constraint for MPC-based Collision Avoidance in Dynamic Environments

Bojan Derajic; Mohamed-Khalil Bouzidi; Sebastian Bernhard; Wolfgang Hönig

Residual Neural Terminal Constraint for MPC-based Collision Avoidance in Dynamic Environments

Bojan Derajic, Mohamed-Khalil Bouzidi, Sebastian Bernhard, Wolfgang Hönig

Proceedings of The 9th Conference on Robot Learning, PMLR 305:1452-1469, 2025.

Abstract

In this paper, we propose a hybrid MPC local planner that uses a learning-based approximation of a time-varying safe set, derived from local observations and applied as the MPC terminal constraint. This set can be represented as a zero-superlevel set of the value function computed via Hamilton-Jacobi (HJ) reachability analysis, which is infeasible in real-time. We exploit the property that the HJ value function can be expressed as a difference of the corresponding signed distance function (SDF) and a non-negative residual function. The residual component is modeled as a neural network with non-negative output and subtracted from the computed SDF, resulting in a real-time value function estimate that is at least as safe as the SDF by design. Additionally, we parametrize the neural residual by a hypernetwork to improve real-time performance and generalization properties. The proposed method is compared with three state-of-the-art methods in simulations and hardware experiments, achieving up to 30% higher success rates compared to the best baseline while requiring a similar computational effort and producing high-quality (low travel-time) solutions.

Cite this Paper

BibTeX

@InProceedings{pmlr-v305-derajic25a,
  title = 	 {Residual Neural Terminal Constraint for MPC-based Collision Avoidance in Dynamic Environments},
  author =       {Derajic, Bojan and Bouzidi, Mohamed-Khalil and Bernhard, Sebastian and H\"{o}nig, Wolfgang},
  booktitle = 	 {Proceedings of The 9th Conference on Robot Learning},
  pages = 	 {1452--1469},
  year = 	 {2025},
  editor = 	 {Lim, Joseph and Song, Shuran and Park, Hae-Won},
  volume = 	 {305},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {27--30 Sep},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v305/main/assets/derajic25a/derajic25a.pdf},
  url = 	 {https://proceedings.mlr.press/v305/derajic25a.html},
  abstract = 	 {In this paper, we propose a hybrid MPC local planner that uses a learning-based approximation of a time-varying safe set, derived from local observations and applied as the MPC terminal constraint. This set can be represented as a zero-superlevel set of the value function computed via Hamilton-Jacobi (HJ) reachability analysis, which is infeasible in real-time. We exploit the property that the HJ value function can be expressed as a difference of the corresponding signed distance function (SDF) and a non-negative residual function. The residual component is modeled as a neural network with non-negative output and subtracted from the computed SDF, resulting in a real-time value function estimate that is at least as safe as the SDF by design. Additionally, we parametrize the neural residual by a hypernetwork to improve real-time performance and generalization properties. The proposed method is compared with three state-of-the-art methods in simulations and hardware experiments, achieving up to 30% higher success rates compared to the best baseline while requiring a similar computational effort and producing high-quality (low travel-time) solutions.}
}

Endnote

%0 Conference Paper
%T Residual Neural Terminal Constraint for MPC-based Collision Avoidance in Dynamic Environments
%A Bojan Derajic
%A Mohamed-Khalil Bouzidi
%A Sebastian Bernhard
%A Wolfgang Hönig
%B Proceedings of The 9th Conference on Robot Learning
%C Proceedings of Machine Learning Research
%D 2025
%E Joseph Lim
%E Shuran Song
%E Hae-Won Park	
%F pmlr-v305-derajic25a
%I PMLR
%P 1452--1469
%U https://proceedings.mlr.press/v305/derajic25a.html
%V 305
%X In this paper, we propose a hybrid MPC local planner that uses a learning-based approximation of a time-varying safe set, derived from local observations and applied as the MPC terminal constraint. This set can be represented as a zero-superlevel set of the value function computed via Hamilton-Jacobi (HJ) reachability analysis, which is infeasible in real-time. We exploit the property that the HJ value function can be expressed as a difference of the corresponding signed distance function (SDF) and a non-negative residual function. The residual component is modeled as a neural network with non-negative output and subtracted from the computed SDF, resulting in a real-time value function estimate that is at least as safe as the SDF by design. Additionally, we parametrize the neural residual by a hypernetwork to improve real-time performance and generalization properties. The proposed method is compared with three state-of-the-art methods in simulations and hardware experiments, achieving up to 30% higher success rates compared to the best baseline while requiring a similar computational effort and producing high-quality (low travel-time) solutions.

APA

Derajic, B., Bouzidi, M., Bernhard, S. & Hönig, W.. (2025). Residual Neural Terminal Constraint for MPC-based Collision Avoidance in Dynamic Environments. Proceedings of The 9th Conference on Robot Learning, in Proceedings of Machine Learning Research 305:1452-1469 Available from https://proceedings.mlr.press/v305/derajic25a.html.

Residual Neural Terminal Constraint for MPC-based Collision Avoidance in Dynamic Environments

Abstract

Cite this Paper

Related Material