Wenxiang Chen
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Better Process Supervision with Bi-directional Rewarding Signals
arXiv 2025
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping
arXiv 2025
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
arXiv 2025
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
arXiv 2024
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
arXiv 2024
The Rise and Potential of Large Language Model Based Agents: A Survey
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers