Cite
Notes
Only stored in your browser.
Attribution
RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning
arXiv 2025
from 1 papers
Dina Katabi
Duane S. Boning
Maohao Shen
Zhang-Wei Hong
Zhengqi Gao