Weiran Yao
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?
arXiv 2026
ActionStudio: A Lightweight Framework for Data and Training of Large Action Models
arXiv 2025
LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
arXiv 2025
Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics
arXiv 2025
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
arXiv 2025
MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models
arXiv 2025
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning
arXiv 2025
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
arXiv 2025
UserBench: An Interactive Gym Environment for User-Centric Agents
arXiv 2025
CoDA: Coding LM via Diffusion Adaptation
arXiv 2025
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
arXiv 2024
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
arXiv 2024
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
arXiv 2023
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
arXiv 2023
Affiliations
Frequent co-authors
10from 14 papers