Cite
Notes
Only stored in your browser.
Attribution
Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation
arXiv 2026
from 1 papers
Lei LI
Siqi Ouyang
Yifeng Liu