Simon Shaolei Du

Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

6papers

Authored papers

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

arXiv 2025

2025

ThetaEvolve: Test-time Learning on Open Problems

arXiv 2025

2025

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

arXiv 2025

2025

Spurious Rewards: Rethinking Training Signals in RLVR

arXiv 2025

2025

LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning

arXiv 2023

2023

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 6 papers

Yiping Wang

4 shared papers

Zhiyuan Zeng

3 shared papers

Baolin Peng

2 shared papers

Hannaneh Hajishirzi

professor

Hao Cheng

Liliang Ren

Pang Wei Koh

Runlong Zhou

Shuohang Wang

Shuyue Stella Li