Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity

This paper concerns the central issues of model robustness and sample efficiency in offline reinforcement learning (RL), which aims to learn to perform decision making from history data without active exploration.

Open

Year: 2022
ArXiv: arxiv.org/abs/2208.05767
URL: arxiv.org/abs/2208.05767v4
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2208.05767v4
TL;DR: Semantic Scholar

Attribution policy →