Cheng Qian
- Papers
- 24
Cite
Notes
Only stored in your browser.
Authored papers
24Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
arXiv 2026
Code as Agent Harness
arXiv 2026
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data
arXiv 2026
CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing
arXiv 2026
Agentic Reasoning for Large Language Models
arXiv 2026
VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting
arXiv 2025
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
arXiv 2025
ToolRL: Reward is All Tool Learning Needs
arXiv 2025
SMART: Self-Aware Agent for Tool Overuse Mitigation
arXiv 2025
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents
arXiv 2025
RM-R1: Reward Modeling as Reasoning
arXiv 2025
Internal Activation as the Polar Star for Steering Unsafe LLM Behavior
arXiv 2025
LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
arXiv 2025
From Word to World: Can Large Language Models be Implicit Text-based World Models?
arXiv 2025
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
arXiv 2025
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
arXiv 2025
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning
arXiv 2025
UserBench: An Interactive Gym Environment for User-Centric Agents
arXiv 2025
Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts
arXiv 2025
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe
arXiv 2025
Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
arXiv 2024
Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents
arXiv 2024
Tool Learning with Foundation Models
arXiv 2023
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
arXiv 2023
Affiliations
Frequent co-authors
10from 24 papers
Heng Ji
professor
Zhiyuan Liu
professor
Hongru Wang
Xiusi Chen
Caiming Xiong
researcher
Haolin Chen
Huan Wang
Shelby Heinecke
Silvio Savarese
researcher
Weiran Yao