DEQ-MPC : Deep Equilibrium Model Predictive Control

Swaminathan Gurumurthy, Khai Nguyen, Arun L Bishop, J Zico Kolter, Zachary Manchester
Proceedings of The 9th Conference on Robot Learning, PMLR 305:961-980, 2025.

Abstract

Incorporating task-specific priors within a policy or network architecture is crucial for enhancing safety and improving representation and generalization in robotic control problems. Differentiable Model Predictive Control (MPC) layers have proven effective for embedding these priors, such as constraints and cost functions, directly within the architecture, enabling end-to-end training. However, current methods often treat the solver and the neural network as separate, independent entities, leading to suboptimal integration. In this work, we propose a novel approach that co-develops the solver and architecture unifying the optimization solver and network inference problems. Specifically, we formulate this as a \textit{joint fixed-point problem} over the coupled network outputs and necessary conditions of the optimization problem. We solve this problem in an iterative manner where we alternate between network forward passes and optimization iterations. Through extensive ablations in various robotic control tasks, we demonstrate that our approach results in richer representations and more stable training, while naturally accommodating warm starting, a key requirement for MPC.

Cite this Paper


BibTeX
@InProceedings{pmlr-v305-gurumurthy25a, title = {DEQ-MPC : Deep Equilibrium Model Predictive Control}, author = {Gurumurthy, Swaminathan and Nguyen, Khai and Bishop, Arun L and Kolter, J Zico and Manchester, Zachary}, booktitle = {Proceedings of The 9th Conference on Robot Learning}, pages = {961--980}, year = {2025}, editor = {Lim, Joseph and Song, Shuran and Park, Hae-Won}, volume = {305}, series = {Proceedings of Machine Learning Research}, month = {27--30 Sep}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v305/main/assets/gurumurthy25a/gurumurthy25a.pdf}, url = {https://proceedings.mlr.press/v305/gurumurthy25a.html}, abstract = {Incorporating task-specific priors within a policy or network architecture is crucial for enhancing safety and improving representation and generalization in robotic control problems. Differentiable Model Predictive Control (MPC) layers have proven effective for embedding these priors, such as constraints and cost functions, directly within the architecture, enabling end-to-end training. However, current methods often treat the solver and the neural network as separate, independent entities, leading to suboptimal integration. In this work, we propose a novel approach that co-develops the solver and architecture unifying the optimization solver and network inference problems. Specifically, we formulate this as a \textit{joint fixed-point problem} over the coupled network outputs and necessary conditions of the optimization problem. We solve this problem in an iterative manner where we alternate between network forward passes and optimization iterations. Through extensive ablations in various robotic control tasks, we demonstrate that our approach results in richer representations and more stable training, while naturally accommodating warm starting, a key requirement for MPC.} }
Endnote
%0 Conference Paper %T DEQ-MPC : Deep Equilibrium Model Predictive Control %A Swaminathan Gurumurthy %A Khai Nguyen %A Arun L Bishop %A J Zico Kolter %A Zachary Manchester %B Proceedings of The 9th Conference on Robot Learning %C Proceedings of Machine Learning Research %D 2025 %E Joseph Lim %E Shuran Song %E Hae-Won Park %F pmlr-v305-gurumurthy25a %I PMLR %P 961--980 %U https://proceedings.mlr.press/v305/gurumurthy25a.html %V 305 %X Incorporating task-specific priors within a policy or network architecture is crucial for enhancing safety and improving representation and generalization in robotic control problems. Differentiable Model Predictive Control (MPC) layers have proven effective for embedding these priors, such as constraints and cost functions, directly within the architecture, enabling end-to-end training. However, current methods often treat the solver and the neural network as separate, independent entities, leading to suboptimal integration. In this work, we propose a novel approach that co-develops the solver and architecture unifying the optimization solver and network inference problems. Specifically, we formulate this as a \textit{joint fixed-point problem} over the coupled network outputs and necessary conditions of the optimization problem. We solve this problem in an iterative manner where we alternate between network forward passes and optimization iterations. Through extensive ablations in various robotic control tasks, we demonstrate that our approach results in richer representations and more stable training, while naturally accommodating warm starting, a key requirement for MPC.
APA
Gurumurthy, S., Nguyen, K., Bishop, A.L., Kolter, J.Z. & Manchester, Z.. (2025). DEQ-MPC : Deep Equilibrium Model Predictive Control. Proceedings of The 9th Conference on Robot Learning, in Proceedings of Machine Learning Research 305:961-980 Available from https://proceedings.mlr.press/v305/gurumurthy25a.html.

Related Material