Yuchang Sun
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
arXiv 2026
R^3L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification
arXiv 2026
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
arXiv 2025
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
arXiv 2025
Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends
arXiv 2025
Exploring Selective Layer Fine-Tuning in Federated Learning
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers