Soft Actor-Critic is a state-of-the-art reinforcement learning algorithm for continuous action settings that is not applicable to discrete action settings. Many important settings involve discrete actions, however, and so here we derive an alternative version of the Soft Actor-Critic algorithm that is applicable to discrete action settings. We then show that, even without any hyperparameter tuning, it is competitive with the tuned model-free state-of-the-art on a selection of games from the Atari suite.
Soft Actor-Critic for Discrete Action Settings
An alternative version of the Soft Actor-Critic algorithm for discrete action settings is competitive with tuned model-free state-of-the-art methods on Atari games without hyperparameter tuning.
- Year
- 2019
- Venue
- arXiv 2019
- Authors
- 1
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/1910.07207v2ARXIV-DEFAULT
- TL;DR
- Semantic Scholar