Yuexiang Xie
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
arXiv 2026
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications
arXiv 2025
Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models
arXiv 2025
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
arXiv 2025
Very Large-Scale Multi-Agent Simulation in AgentScope
arXiv 2024
$β$-DPO: Direct Preference Optimization with Dynamic $β$
arXiv 2024
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
arXiv 2024
Exploring Selective Layer Fine-Tuning in Federated Learning
arXiv 2024
Data-Juicer: A One-Stop Data Processing System for Large Language Models
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers