Contrastive Forward Prediction Reinforcement Learning for Adaptive Fault-Tolerant Legged Robots

Yangqing Fu, Yang Zhang, Qiyue Yang, Liyun Yan, Zhanxiang Cao, Yue Gao
Proceedings of The 9th Conference on Robot Learning, PMLR 305:3285-3303, 2025.

Abstract

In complex environments, adaptive and fault-tolerant capabilities are essential for legged robot locomotion. To address this challenge, this study proposes a reinforcement learning framework that integrates contrastive learning with forward prediction to achieve fault-tolerant locomotion for legged robots. This framework constructs a forward prediction model with contrastive learning, incorporating a comparator and a forward model. The forward model predicts the robot’s subsequent state, and the comparator compares these predictions with actual states to generate critical prediction errors. These errors are systematically integrated into the controller, facilitating the continuous adjustment and refinement of control signals.Experiments on quadruped robots across different terrains and various joint damage scenarios have verified the effectiveness of our method, especially the functions of the comparator and the forward model. Furthermore, robots can adapt to locked joints without prior training, demonstrating zero-shot transfer capability. Finally, the proposed method demonstrates universal applicability to both quadruped and hexapod robots, highlighting its potential for broader applications in legged robotics.

Cite this Paper


BibTeX
@InProceedings{pmlr-v305-fu25b, title = {Contrastive Forward Prediction Reinforcement Learning for Adaptive Fault-Tolerant Legged Robots}, author = {Fu, Yangqing and Zhang, Yang and Yang, Qiyue and Yan, Liyun and Cao, Zhanxiang and Gao, Yue}, booktitle = {Proceedings of The 9th Conference on Robot Learning}, pages = {3285--3303}, year = {2025}, editor = {Lim, Joseph and Song, Shuran and Park, Hae-Won}, volume = {305}, series = {Proceedings of Machine Learning Research}, month = {27--30 Sep}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v305/main/assets/fu25b/fu25b.pdf}, url = {https://proceedings.mlr.press/v305/fu25b.html}, abstract = {In complex environments, adaptive and fault-tolerant capabilities are essential for legged robot locomotion. To address this challenge, this study proposes a reinforcement learning framework that integrates contrastive learning with forward prediction to achieve fault-tolerant locomotion for legged robots. This framework constructs a forward prediction model with contrastive learning, incorporating a comparator and a forward model. The forward model predicts the robot’s subsequent state, and the comparator compares these predictions with actual states to generate critical prediction errors. These errors are systematically integrated into the controller, facilitating the continuous adjustment and refinement of control signals.Experiments on quadruped robots across different terrains and various joint damage scenarios have verified the effectiveness of our method, especially the functions of the comparator and the forward model. Furthermore, robots can adapt to locked joints without prior training, demonstrating zero-shot transfer capability. Finally, the proposed method demonstrates universal applicability to both quadruped and hexapod robots, highlighting its potential for broader applications in legged robotics.} }
Endnote
%0 Conference Paper %T Contrastive Forward Prediction Reinforcement Learning for Adaptive Fault-Tolerant Legged Robots %A Yangqing Fu %A Yang Zhang %A Qiyue Yang %A Liyun Yan %A Zhanxiang Cao %A Yue Gao %B Proceedings of The 9th Conference on Robot Learning %C Proceedings of Machine Learning Research %D 2025 %E Joseph Lim %E Shuran Song %E Hae-Won Park %F pmlr-v305-fu25b %I PMLR %P 3285--3303 %U https://proceedings.mlr.press/v305/fu25b.html %V 305 %X In complex environments, adaptive and fault-tolerant capabilities are essential for legged robot locomotion. To address this challenge, this study proposes a reinforcement learning framework that integrates contrastive learning with forward prediction to achieve fault-tolerant locomotion for legged robots. This framework constructs a forward prediction model with contrastive learning, incorporating a comparator and a forward model. The forward model predicts the robot’s subsequent state, and the comparator compares these predictions with actual states to generate critical prediction errors. These errors are systematically integrated into the controller, facilitating the continuous adjustment and refinement of control signals.Experiments on quadruped robots across different terrains and various joint damage scenarios have verified the effectiveness of our method, especially the functions of the comparator and the forward model. Furthermore, robots can adapt to locked joints without prior training, demonstrating zero-shot transfer capability. Finally, the proposed method demonstrates universal applicability to both quadruped and hexapod robots, highlighting its potential for broader applications in legged robotics.
APA
Fu, Y., Zhang, Y., Yang, Q., Yan, L., Cao, Z. & Gao, Y.. (2025). Contrastive Forward Prediction Reinforcement Learning for Adaptive Fault-Tolerant Legged Robots. Proceedings of The 9th Conference on Robot Learning, in Proceedings of Machine Learning Research 305:3285-3303 Available from https://proceedings.mlr.press/v305/fu25b.html.

Related Material