Cite
Notes
Only stored in your browser.
Attribution
One-shot Entropy Minimization
arXiv 2025
Universal Reasoning Model
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
from 3 papers
Bryan Dai
Zitian Gao
Haoming Luo
Lynx Chen
Chong Luo
He Xing
Kai Qiu
Qingnan Ren
Ran Tao
Tian Xie