Xuchen Pan
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6R^3L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification
arXiv 2026
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
arXiv 2025
Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends
arXiv 2025
Very Large-Scale Multi-Agent Simulation in AgentScope
arXiv 2024
EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models
arXiv 2024
Data-Juicer: A One-Stop Data Processing System for Large Language Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers