Feasibility Consistent Representation Learning for Safe Reinforcement Learning

Zhepeng Cen; Yihang Yao; Zuxin Liu; Ding Zhao

Feasibility Consistent Representation Learning for Safe Reinforcement Learning

Zhepeng Cen, Yihang Yao, Zuxin Liu, Ding Zhao

Proceedings of the 41st International Conference on Machine Learning, PMLR 235:6002-6019, 2024.

Abstract

In the field of safe reinforcement learning (RL), finding a balance between satisfying safety constraints and optimizing reward performance presents a significant challenge. A key obstacle in this endeavor is the estimation of safety constraints, which is typically more difficult than estimating a reward metric due to the sparse nature of the constraint signals. To address this issue, we introduce a novel framework named Feasibility Consistent Safe Reinforcement Learning (FCSRL). This framework combines representation learning with feasibility-oriented objectives to identify and extract safety-related information from the raw state for safe RL. Leveraging self-supervised learning techniques and a more learnable safety metric, our approach enhances the policy learning and constraint estimation. Empirical evaluations across a range of vector-state and image-based tasks demonstrate that our method is capable of learning a better safety-aware embedding and achieving superior performance than previous representation learning baselines.

Cite this Paper

BibTeX


@InProceedings{pmlr-v235-cen24b,
  title = 	 {Feasibility Consistent Representation Learning for Safe Reinforcement Learning},
  author =       {Cen, Zhepeng and Yao, Yihang and Liu, Zuxin and Zhao, Ding},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {6002--6019},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/cen24b/cen24b.pdf},
  url = 	 {https://proceedings.mlr.press/v235/cen24b.html},
  abstract = 	 {In the field of safe reinforcement learning (RL), finding a balance between satisfying safety constraints and optimizing reward performance presents a significant challenge. A key obstacle in this endeavor is the estimation of safety constraints, which is typically more difficult than estimating a reward metric due to the sparse nature of the constraint signals. To address this issue, we introduce a novel framework named Feasibility Consistent Safe Reinforcement Learning (FCSRL). This framework combines representation learning with feasibility-oriented objectives to identify and extract safety-related information from the raw state for safe RL. Leveraging self-supervised learning techniques and a more learnable safety metric, our approach enhances the policy learning and constraint estimation. Empirical evaluations across a range of vector-state and image-based tasks demonstrate that our method is capable of learning a better safety-aware embedding and achieving superior performance than previous representation learning baselines.}
}

Endnote

%0 Conference Paper
%T Feasibility Consistent Representation Learning for Safe Reinforcement Learning
%A Zhepeng Cen
%A Yihang Yao
%A Zuxin Liu
%A Ding Zhao
%B Proceedings of the 41st International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Ruslan Salakhutdinov
%E Zico Kolter
%E Katherine Heller
%E Adrian Weller
%E Nuria Oliver
%E Jonathan Scarlett
%E Felix Berkenkamp	
%F pmlr-v235-cen24b
%I PMLR
%P 6002--6019
%U https://proceedings.mlr.press/v235/cen24b.html
%V 235
%X In the field of safe reinforcement learning (RL), finding a balance between satisfying safety constraints and optimizing reward performance presents a significant challenge. A key obstacle in this endeavor is the estimation of safety constraints, which is typically more difficult than estimating a reward metric due to the sparse nature of the constraint signals. To address this issue, we introduce a novel framework named Feasibility Consistent Safe Reinforcement Learning (FCSRL). This framework combines representation learning with feasibility-oriented objectives to identify and extract safety-related information from the raw state for safe RL. Leveraging self-supervised learning techniques and a more learnable safety metric, our approach enhances the policy learning and constraint estimation. Empirical evaluations across a range of vector-state and image-based tasks demonstrate that our method is capable of learning a better safety-aware embedding and achieving superior performance than previous representation learning baselines.

APA


Cen, Z., Yao, Y., Liu, Z. & Zhao, D.. (2024). Feasibility Consistent Representation Learning for Safe Reinforcement Learning. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:6002-6019 Available from https://proceedings.mlr.press/v235/cen24b.html.

Feasibility Consistent Representation Learning for Safe Reinforcement Learning

Abstract

Cite this Paper

Related Material