Kaiwen Zhou
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
ICCV 2025
HiconAgent: History Context-aware Policy Optimization for GUI Agents
arXiv 2025
VPTracker: Global Vision-Language Tracking via Visual Prompt and MLLM
arXiv 2025
GUI-G1: Understanding R1-Zero-Like Training for Visual Grounding in GUI Agents
arXiv 2025
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
arXiv 2024
Multimodal Situational Safety
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers