| Authors | Javier Garcia, Fernando Fernandez |
| Journal | Journal of Machine Learning Research |
| Year | 2015 |
What Problem It Solves
Provides a canonical overview or reference point for the relevant DoOperator research area.
Provides a canonical overview or reference point for the relevant DoOperator research area.
A classic survey of safe reinforcement learning, including risk-sensitive criteria, constrained exploration, safety during learning, and external guidance.
Use when orienting a new paper, blog post, benchmark, or research plan in this area.
Do not cite the overview as evidence that a specific method works in a specific deployment setting without checking the underlying primary paper.
Related papers
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang, Lihong Li · 2015
PaperReinforcement Learning: An Introduction
Richard S. Sutton, Andrew G. Barto · 2018
PaperA Survey of Constraint Formulations in Safe Reinforcement Learning
Akifumi Wachi, Xun Shen, Yanan Sui · 2024
PaperNear-Optimal Reinforcement Learning in Dynamic Treatment Regimes
Junzhe Zhang, Elias Bareinboim · 2019