Wee Sun Lee

Papers: 10

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

10papers

Authored papers

Can Vision-Language Models Solve the Shell Game?

arXiv 2026

2026

Rethinking the Trust Region in LLM Reinforcement Learning

arXiv 2026

2026

Understanding R1-Zero-Like Training: A Critical Perspective

arXiv 2025

2025

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

arXiv 2025

2025

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

arXiv 2025

2025

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

arXiv 2025

2025

Defeating the Training-Inference Mismatch via FP16

arXiv 2025

2025

Reasoning-CV: Fine-tuning Powerful Reasoning LLMs for Knowledge-Assisted Claim Verification

arXiv 2025

2025

GEM: A Gym for Agentic LLMs

arXiv 2025

2025

Sample-Efficient Alignment for LLMs

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 10 papers

Min Lin

Zichen Liu

Chao Du

Penghui Qi

Tianyu Pang

Changyu Chen

Xiangxin Zhou

Bo Liu

researcher

2 shared papers

Simon Yu

researcher

2 shared papers

Weiyan Shi

2 shared papers