Cite
Notes
Only stored in your browser.
Attribution
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards
arXiv 2026
from 1 papers
Boxi Cao
Hongyu Lin
Le Sun
Min He
Xianpei Han
Xueru Wen
Yaojie Lu
Zhengzhao Ma