Adaptive Discounting of Training Time Attacks
Among the most insidious attacks on Reinforcement Learning (RL) solutions are training-time attacks (TTAs) that create loopholes and backdoors in the learned behaviour. Not limited to a simple disruption, constructive TTAs (C-TTAs) are now available, where the attacker forces a…
- Year
- 2024
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.