Cite
Notes
Only stored in your browser.
Attribution
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges
arXiv 2026
from 1 papers
Bowen Chen
Changze Lv
Feiran Zhang
Jiakang Yuan
Jingwen Xu
Kaitao Song
Muling Wu
Muzhao Tian
Qi Qian
Ruicheng Yin