Zhenyu Wu
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent
arXiv 2026
UniFinEval: Towards Unified Evaluation of Financial Multimodal Models across Text, Images and Videos
arXiv 2026
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
arXiv 2025
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
arXiv 2025
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
arXiv 2025
Implicit Search via Discrete Diffusion: A Study on Chess
arXiv 2025
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
arXiv 2025
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
arXiv 2024
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
arXiv 2024
Embodied Instruction Following in Unknown Environments
arXiv 2024
Instructing Large Language Models to Identify and Ignore Irrelevant Conditions
arXiv 2024
Embodied Task Planning with Large Language Models
arXiv 2023
Get an A in Math: Progressive Rectification Prompting
arXiv 2023
SOAR: Scene-debiasing Open-set Action Recognition
soar-scene-debiasing-open-set-action
E^2TAD: An Energy-Efficient Tracking-based Action Detector
arXiv 2022
Affiliations
Frequent co-authors
10from 15 papers