In this paper, a unified framework for exploration in reinforcement learning (RL) is proposed based on an option-critic model. The proposed framework learns to integrate a set of diverse exploration strategies so that the agent can adaptively select the most effective exploration strategy over time to realize a relevant exploration-exploitation trade-off for each given task. The effectiveness of the proposed exploration framework is demonstrated by various experiments in the MiniGrid and Atari environments.
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework
A unified exploration framework in RL using option-critic models integrates diverse strategies for adaptive exploration-exploitation trade-offs, validated in MiniGrid and Atari environments.
- Year
- 2023
- Venue
- arXiv 2023
- Authors
- 3
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2310.03342v2ARXIV-DEFAULT
- TL;DR
- Semantic Scholar