Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Yu Chen; Xiangcheng Zhang; Siwei Wang; Longbo Huang

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Yu Chen, Xiangcheng Zhang, Siwei Wang, Longbo Huang

Proceedings of the 41st International Conference on Machine Learning, PMLR 235:7748-7791, 2024.

Abstract

In the realm of reinforcement learning (RL), accounting for risk is crucial for making decisions under uncertainty, particularly in applications where safety and reliability are paramount. In this paper, we introduce a general framework on Risk-Sensitive Distributional Reinforcement Learning (RS-DisRL), with static Lipschitz Risk Measures (LRM) and general function approximation. Our framework covers a broad class of risk-sensitive RL, and facilitates analysis of the impact of estimation functions on the effectiveness of RSRL strategies and evaluation of their sample complexity. We design two innovative meta-algorithms: RS-DisRL-M, a model-based strategy for model-based function approximation, and RS-DisRL-V, a model-free approach for general value function approximation. With our novel estimation techniques via Least Squares Regression (LSR) and Maximum Likelihood Estimation (MLE) in distributional RL with augmented Markov Decision Process (MDP), we derive the first $\widetilde{\mathcal{O}}(\sqrt{K})$ dependency of the regret upper bound for RSRL with static LRM, marking a pioneering contribution towards statistically efficient algorithms in this domain.

Cite this Paper

BibTeX

@InProceedings{pmlr-v235-chen24bf,
  title = 	 {Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation},
  author =       {Chen, Yu and Zhang, Xiangcheng and Wang, Siwei and Huang, Longbo},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {7748--7791},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/chen24bf/chen24bf.pdf},
  url = 	 {https://proceedings.mlr.press/v235/chen24bf.html},
  abstract = 	 {In the realm of reinforcement learning (RL), accounting for risk is crucial for making decisions under uncertainty, particularly in applications where safety and reliability are paramount. In this paper, we introduce a general framework on Risk-Sensitive Distributional Reinforcement Learning (RS-DisRL), with static Lipschitz Risk Measures (LRM) and general function approximation. Our framework covers a broad class of risk-sensitive RL, and facilitates analysis of the impact of estimation functions on the effectiveness of RSRL strategies and evaluation of their sample complexity. We design two innovative meta-algorithms: RS-DisRL-M, a model-based strategy for model-based function approximation, and RS-DisRL-V, a model-free approach for general value function approximation. With our novel estimation techniques via Least Squares Regression (LSR) and Maximum Likelihood Estimation (MLE) in distributional RL with augmented Markov Decision Process (MDP), we derive the first $\widetilde{\mathcal{O}}(\sqrt{K})$ dependency of the regret upper bound for RSRL with static LRM, marking a pioneering contribution towards statistically efficient algorithms in this domain.}
}

Endnote

%0 Conference Paper
%T Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
%A Yu Chen
%A Xiangcheng Zhang
%A Siwei Wang
%A Longbo Huang
%B Proceedings of the 41st International Conference on Machine Learning
%C Proceedings of Machine Learning Research
%D 2024
%E Ruslan Salakhutdinov
%E Zico Kolter
%E Katherine Heller
%E Adrian Weller
%E Nuria Oliver
%E Jonathan Scarlett
%E Felix Berkenkamp	
%F pmlr-v235-chen24bf
%I PMLR
%P 7748--7791
%U https://proceedings.mlr.press/v235/chen24bf.html
%V 235
%X In the realm of reinforcement learning (RL), accounting for risk is crucial for making decisions under uncertainty, particularly in applications where safety and reliability are paramount. In this paper, we introduce a general framework on Risk-Sensitive Distributional Reinforcement Learning (RS-DisRL), with static Lipschitz Risk Measures (LRM) and general function approximation. Our framework covers a broad class of risk-sensitive RL, and facilitates analysis of the impact of estimation functions on the effectiveness of RSRL strategies and evaluation of their sample complexity. We design two innovative meta-algorithms: RS-DisRL-M, a model-based strategy for model-based function approximation, and RS-DisRL-V, a model-free approach for general value function approximation. With our novel estimation techniques via Least Squares Regression (LSR) and Maximum Likelihood Estimation (MLE) in distributional RL with augmented Markov Decision Process (MDP), we derive the first $\widetilde{\mathcal{O}}(\sqrt{K})$ dependency of the regret upper bound for RSRL with static LRM, marking a pioneering contribution towards statistically efficient algorithms in this domain.

APA

Chen, Y., Zhang, X., Wang, S. & Huang, L.. (2024). Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation. Proceedings of the 41st International Conference on Machine Learning, in Proceedings of Machine Learning Research 235:7748-7791 Available from https://proceedings.mlr.press/v235/chen24bf.html.

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Abstract

Cite this Paper

Related Material