Yanxi Chen
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
arXiv 2026
R^3L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification
arXiv 2026
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
arXiv 2025
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
arXiv 2025
Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends
arXiv 2025
AHA: Aligning Large Audio-Language Models for Reasoning Hallucinations via Counterfactual Hard Negatives
arXiv 2025
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models
arXiv 2024
AMG: Avatar Motion Guided Video Generation
arXiv 2024
Affiliations
Frequent co-authors
10from 8 papers