0

Soft Actor-Critic for Discrete Action Settings

An alternative version of the Soft Actor-Critic algorithm for discrete action settings is competitive with tuned model-free state-of-the-art methods on Atari games without hyperparameter tuning.

Year
2019
Venue
arXiv 2019
Authors
1
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/1910.07207v2ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Soft Actor-Critic is a state-of-the-art reinforcement learning algorithm for continuous action settings that is not applicable to discrete action settings. Many important settings involve discrete actions, however, and so here we derive an alternative version of the Soft Actor-Critic algorithm that is applicable to discrete action settings. We then show that, even without any hyperparameter tuning, it is competitive with the tuned model-free state-of-the-art on a selection of games from the Atari suite.

Authors

1