Zhepei Wei
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories
arXiv 2026
G-Zero: Self-Play for Open-Ended Generation from Zero Data
arXiv 2026
The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
arXiv 2025
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers