Kongcheng Zhang
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
arXiv 2025
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
arXiv 2025
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
arXiv 2025
Reasoning with Reinforced Functional Token Tuning
arXiv 2025
Odyssey: Empowering Minecraft Agents with Open-World Skills
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers