Yifei Zhou
- Papers
- 9
Cite
Notes
Only stored in your browser.
9papers
Authored papers
9Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
arXiv 2025
Learning Adaptive Parallel Reasoning with Language Models
arXiv 2025
Digi-Q: Learning Q-Value Functions for Training Device-Control Agents
arXiv 2025
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks
arXiv 2025
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
arXiv 2024
Autonomous Evaluation and Refinement of Digital Agents
arXiv 2024
Aligning Large Language Models with Representation Editing: A Control Perspective
arXiv 2024
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
arXiv 2023
$BT^2$: Backward-compatible Training with Basis Transformation
arXiv 2022
Affiliations
No known affiliations.
Frequent co-authors
10from 9 papers