Cite
Notes
Only stored in your browser.
Attribution
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning
arXiv 2025
from 1 papers
Nathanaël Beau
Roman Plaud