Learned world models summarize an agent's experience to facilitate learning complex behaviors. While learning world models from high-dimensional sensory inputs is becoming feasible through deep learning, there are many potential ways for deriving behaviors from them. We present Dreamer, a reinforcement learning agent that solves long-horizon tasks from images purely by latent imagination. We efficiently learn behaviors by propagating analytic gradients of learned state values back through trajectories imagined in the compact state space of a learned world model. On 20 challenging visual control tasks, Dreamer exceeds existing approaches in data-efficiency, computation time, and final performance.
Dream to Control: Learning Behaviors by Latent Imagination
Dreamer, a reinforcement learning agent, uses latent imagination to solve complex tasks from images, outperforming existing methods in data efficiency, computation time, and final performance.
- Year
- 2019
- Venue
- dream-to-control-learning-behaviors-by-latent-1
- Authors
- 4
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/1912.01603v3ARXIV-DEFAULT
- TL;DR
- Semantic Scholar