Weijie Shi
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3R^3L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification
arXiv 2026
TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents
arXiv 2026
DIDS: Domain Impact-aware Data Sampling for Large Language Model Training
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 3 papers