Wee Sun Lee
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Can Vision-Language Models Solve the Shell Game?
arXiv 2026
Rethinking the Trust Region in LLM Reinforcement Learning
arXiv 2026
Understanding R1-Zero-Like Training: A Critical Perspective
arXiv 2025
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
arXiv 2025
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
arXiv 2025
SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization
arXiv 2025
Defeating the Training-Inference Mismatch via FP16
arXiv 2025
Reasoning-CV: Fine-tuning Powerful Reasoning LLMs for Knowledge-Assisted Claim Verification
arXiv 2025
GEM: A Gym for Agentic LLMs
arXiv 2025
Sample-Efficient Alignment for LLMs
arXiv 2024
Affiliations
Frequent co-authors
10from 10 papers