Qi Gu
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Self-Distilled Agentic Reinforcement Learning
arXiv 2026
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning
arXiv 2026
VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions
arXiv 2026
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
arXiv 2026
AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation
arXiv 2026
LongCat-Flash-Thinking-2601 Technical Report
arXiv 2026
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs
arXiv 2026
V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts
arXiv 2026
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
arXiv 2025
Affiliations
Frequent co-authors
10from 9 papers