Kangjia Zhao
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model
arXiv 2025
ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
arXiv 2024
GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent
arXiv 2024
Benchmarking Sequential Visual Input Reasoning and Prediction in Multimodal Large Language Models
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers