0

Cheng Qian

Papers
24

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
24papers

Authored papers

24

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

arXiv 2026

2026

Code as Agent Harness

arXiv 2026

2026

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

arXiv 2026

2026

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

arXiv 2026

2026

Agentic Reasoning for Large Language Models

arXiv 2026

2026

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

arXiv 2025

2025

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

arXiv 2025

2025

ToolRL: Reward is All Tool Learning Needs

arXiv 2025

2025

SMART: Self-Aware Agent for Tool Overuse Mitigation

arXiv 2025

2025

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

arXiv 2025

2025

RM-R1: Reward Modeling as Reasoning

arXiv 2025

2025

Internal Activation as the Polar Star for Steering Unsafe LLM Behavior

arXiv 2025

2025

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering

arXiv 2025

2025

From Word to World: Can Large Language Models be Implicit Text-based World Models?

arXiv 2025

2025

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

arXiv 2025

2025

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering

arXiv 2025

2025

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

arXiv 2025

2025

UserBench: An Interactive Gym Environment for User-Centric Agents

arXiv 2025

2025

Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts

arXiv 2025

2025

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

arXiv 2025

2025

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

arXiv 2024

2024

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents

arXiv 2024

2024

Tool Learning with Foundation Models

arXiv 2023

2023

CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 24 papers