Yiming Zhao
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Flow-OPD: On-Policy Distillation for Flow Matching Models
arXiv 2026
VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation
arXiv 2026
SaaSBench: Exploring the Boundaries of Coding Agents in Long-Horizon Enterprise SaaS Engineering
arXiv 2026
SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents
arXiv 2026
SCOPE: Structured Decomposition and Conditional Skill Orchestration for Complex Image Generation
arXiv 2026
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning
arXiv 2025
Agentic Jigsaw Interaction Learning for Enhancing Visual Perception and Reasoning in Vision-Language Models
arXiv 2025
CLEA: Closed-Loop Embodied Agent for Enhancing Task Execution in Dynamic Environments
arXiv 2025
V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction
arXiv 2025
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering
arXiv 2024
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
arXiv 2023
Affiliations
Frequent co-authors
10from 11 papers