Cite
Notes
Only stored in your browser.
Attribution
Universal Reasoning Model
arXiv 2025
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
One-shot Entropy Minimization
Interpretable Contrastive Monte Carlo Tree Search Reasoning
arXiv 2024
from 4 papers
Bryan Dai
Joey Zhou
Haoming Luo
Lynx Chen
Aiwei Liu
Boye Niu
Chong Luo
Haotian Xu
He Xing
Hongzhang Liu