0

Tianbao Xie

PhD student at the University of Hong Kong (XLang Lab); lead author of OSWorld, the standard benchmark for computer-use / GUI agents.

Role
grad-student
Currently at
HKU XLANG Lab
Papers
25

Cite

Notes

Only stored in your browser.

25papers·2eval contribs

Authored papers

25

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

arXiv 2026

2026

RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System

arXiv 2026

2026

OSWorld-Verified: A Cleaner, More Reliable Computer-Use Benchmark

blog

2025

Qwen2.5-VL Technical Report

arXiv 2025

2025

Qwen3-VL Technical Report

arXiv 2025

2025

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

arXiv 2025

2025

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

arXiv 2025

2025

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

arXiv 2025

2025

OpenCUA: Open Foundations for Computer-Use Agents

arXiv 2025

2025

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

arXiv 2025

2025

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

arXiv 2025

2025

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

NeurIPS

2024

Cradle: Empowering Foundation Agents Towards General Computer Control

arXiv 2024

2024

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

arXiv 2024

2024

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

arXiv 2024

2024

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

arXiv 2024

2024

OpenAgents: An Open Platform for Language Agents in the Wild

arXiv 2023

2023

Lemur: Harmonizing Natural Language and Code for Language Agents

arXiv 2023

2023

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

arXiv 2023

2023

Mobile-Env: Building Qualified Evaluation Benchmarks for LLM-GUI Interaction

arXiv 2023

2023

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models

arXiv 2022

2022

In-Context Learning for Few-Shot Dialogue State Tracking

arXiv 2022

2022

Binding Language Models in Symbolic Languages

arXiv 2022

2022

A Survey on Spoken Language Understanding: Recent Advances and New Frontiers

arXiv 2021

2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

arXiv 2021

2021

Eval contributions

2

Affiliations

Currently at

HKU XLANG Lab

grad-student · university lab

Frequent co-authors

10

from 25 papers