Reinforcement Learning: An Introduction — DoOperator Research

Authors	Richard S. Sutton, Andrew G. Barto
Journal	MIT Press
Year	2018

What Problem It Solves

Provides a canonical overview or reference point for the relevant DoOperator research area.

What problem it solves

Provides a canonical overview or reference point for the relevant DoOperator research area.

How it works

The standard textbook introduction to reinforcement learning, covering MDPs, value functions, temporal-difference learning, policy gradients, and core algorithms.

When to use it

Use when orienting a new paper, blog post, benchmark, or research plan in this area.

Limitations and failure modes

Do not cite the overview as evidence that a specific method works in a specific deployment setting without checking the underlying primary paper.

Read full paper →More Reinforcement Learning →