Yijia Luo
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling
arXiv 2026
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library
arXiv 2025
Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers