Cite
Notes
Only stored in your browser.
Attribution
SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs
arXiv 2025
Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
arXiv 2024
from 2 papers
Xiaolin Qin
Haojia Hui
Jingyang Shan
Kaiwen Long
Lei Ren
Mo Guang
Rinyoichi Takezoe
Tianyi Wang
Yangge Qian
Yaqian Li