Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity
This paper concerns the central issues of model robustness and sample efficiency in offline reinforcement learning (RL), which aims to learn to perform decision making from history data without active exploration.
- Year
- 2022
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.