← Research / Sequential Decisions

Designing Optimal Dynamic Treatment Regimes: A Causal Reinforcement Learning Approach

PaperWikiSequential DecisionsModerate
Read full paper →
AuthorsJunzhe Zhang
JournalInternational Conference on Machine Learning
Year2020
Citations16

What Problem It Solves

Policy optimization in treatment settings can be invalid if it ignores how treatments were assigned and how confounders evolve.

What problem it solves

Policy optimization in treatment settings can be invalid if it ignores how treatments were assigned and how confounders evolve.

How it works

It combines causal estimands for treatment regimes with reinforcement-learning-style policy optimization.

When to use it

Use as the main citation for CRL in adaptive treatment design and health decision support.

Limitations and failure modes

The paper targets settings where the needed causal quantities are identifiable; data gaps or unmodeled hidden confounding still limit conclusions.

Read full paper →More Sequential Decisions

Related papers

Paper

Reinforcement Learning: An Introduction

Richard S. Sutton, Andrew G. Barto · 2018

Paper

A Survey of Constraint Formulations in Safe Reinforcement Learning

Akifumi Wachi, Xun Shen, Yanan Sui · 2024

Paper

Off-Policy Policy Evaluation for Sequential Decisions under Unobserved Confounding

Hongseok Namkoong, Ramtin Keramati, Steve Yadlowsky +1 more · 2020

Paper

Markov Decision Processes with Unobserved Confounders: A Causal Approach

Junzhe Zhang, Elias Bareinboim · 2016