Zhengyin Du
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
arXiv 2025
MIR-Bench: Can Your LLM Recognize Complicated Patterns via Many-Shot In-Context Reasoning?
arXiv 2025
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
arXiv 2025
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
arXiv 2025
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
arXiv 2025
TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers