Cite
Notes
Only stored in your browser.
Attribution
S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
arXiv 2025
from 1 papers
Bang Zhang
Cheng Liu
Jia Li
researcher
Jiaqi Chen
Nan Du
Peisong Wang
Ruotian Ma
Xin Zhou