← Research / Causal Inference

Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes

Read full paper →
AuthorsJunzhe Zhang, Elias Bareinboim
JournalNeurIPS
Year2019

What Problem It Solves

Clinical and behavioral treatment policies require sequential decisions under causal constraints, not just high-reward policies in simulator MDPs.

What problem it solves

Clinical and behavioral treatment policies require sequential decisions under causal constraints, not just high-reward policies in simulator MDPs.

How it works

The method frames DTR learning as an RL problem while preserving causal identification requirements for treatment effects.

When to use it

Use for health, medicine, and personalization settings where policies adapt to patient or user history.

Limitations and failure modes

Applicability depends on measured histories being rich enough, or on additional causal structure when confounding is hidden.

Read full paper →More Causal Inference

Related papers

Paper

Estimation and Inference of Heterogeneous Treatment Effects using Random Forests

Stefan Wager, Susan Athey · 2017

Paper

Causal inference in statistics: An overview

Judea Pearl · 2009

Paper

Towards Causal Representation Learning

Bernhard Scholkopf, Francesco Locatello, Stefan Bauer +4 more · 2021

Paper

Elements of Causal Inference: Foundations and Learning Algorithms

Jonas Peters, Dominik Janzing, Bernhard Scholkopf · 2017