Bridging the Gap Between Target Networks and Functional Regularization
Bootstrapping is behind much of the successes of Deep Reinforcement Learning. However, learning the value function via bootstrapping often leads to unstable training due to fast-changing target values.
- Year
- 2022
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.