Tianbao Xie
PhD student at the University of Hong Kong (XLang Lab); lead author of OSWorld, the standard benchmark for computer-use / GUI agents.
- Role
- grad-student
- Currently at
- HKU XLANG Lab
- twitter.com/TianbaoX
- GitHub
- github.com/timothyxxx
- Scholar
- scholar.google.com/citations
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents
arXiv 2026
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System
arXiv 2026
OSWorld-Verified: A Cleaner, More Reliable Computer-Use Benchmark
blog
Qwen2.5-VL Technical Report
arXiv 2025
Qwen3-VL Technical Report
arXiv 2025
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
arXiv 2025
xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations
arXiv 2025
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
arXiv 2025
OpenCUA: Open Foundations for Computer-Use Agents
arXiv 2025
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
arXiv 2025
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
arXiv 2025
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
NeurIPS
Cradle: Empowering Foundation Agents Towards General Computer Control
arXiv 2024
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
arXiv 2024
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
arXiv 2024
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
arXiv 2024
OpenAgents: An Open Platform for Language Agents in the Wild
arXiv 2023
Lemur: Harmonizing Natural Language and Code for Language Agents
arXiv 2023
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
arXiv 2023
Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction
arXiv 2023
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
arXiv 2022
In-Context Learning for Few-Shot Dialogue State Tracking
arXiv 2022
Binding Language Models in Symbolic Languages
arXiv 2022
A Survey on Spoken Language Understanding: Recent Advances and New Frontiers
arXiv 2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
arXiv 2021
Eval contributions
2Affiliations
Frequent co-authors
10from 25 papers
Tao Yu
professor
Yiheng Xu
researcher
Caiming Xiong
researcher
Danyang Zhang
researcher
Jixuan Chen
researcher
Victor Zhong
researcher
Ruisheng Cao
researcher
Yitao Liu
researcher
Zhoujun Cheng
researcher
Dongchan Shin
researcher