← Research / Reinforcement Learning

Reinforcement Learning: An Introduction

Read full paper →
AuthorsRichard S. Sutton, Andrew G. Barto
JournalMIT Press
Year2018

What Problem It Solves

Provides a canonical overview or reference point for the relevant DoOperator research area.

What problem it solves

Provides a canonical overview or reference point for the relevant DoOperator research area.

How it works

The standard textbook introduction to reinforcement learning, covering MDPs, value functions, temporal-difference learning, policy gradients, and core algorithms.

When to use it

Use when orienting a new paper, blog post, benchmark, or research plan in this area.

Limitations and failure modes

Do not cite the overview as evidence that a specific method works in a specific deployment setting without checking the underlying primary paper.

Read full paper →More Reinforcement Learning

Related papers

Paper

Doubly Robust Off-policy Value Evaluation for Reinforcement Learning

Nan Jiang, Lihong Li · 2015

Paper

A Survey of Constraint Formulations in Safe Reinforcement Learning

Akifumi Wachi, Xun Shen, Yanan Sui · 2024

Paper

A Comprehensive Survey on Safe Reinforcement Learning

Javier Garcia, Fernando Fernandez · 2015

Paper

Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes

Junzhe Zhang, Elias Bareinboim · 2019