| Authors | Akifumi Wachi, Xun Shen, Yanan Sui |
| Journal | IJCAI |
| Year | 2024 |
| DOI | 10.24963/ijcai.2024/913 |
What Problem It Solves
Provides a canonical overview or reference point for the relevant DoOperator research area.
Provides a canonical overview or reference point for the relevant DoOperator research area.
A survey of safe RL constraint formulations, representative algorithms, and the relationships among common constrained decision-making criteria.
Use when orienting a new paper, blog post, benchmark, or research plan in this area.
Do not cite the overview as evidence that a specific method works in a specific deployment setting without checking the underlying primary paper.
Related papers
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang, Lihong Li · 2015
PaperReinforcement Learning: An Introduction
Richard S. Sutton, Andrew G. Barto · 2018
PaperA Comprehensive Survey on Safe Reinforcement Learning
Javier Garcia, Fernando Fernandez · 2015
PaperNear-Optimal Reinforcement Learning in Dynamic Treatment Regimes
Junzhe Zhang, Elias Bareinboim · 2019