Shengyu Zhang
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13SafePred: A Predictive Guardrail for Computer-Using Agents via World Models
arXiv 2026
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
arXiv 2025
Efficient Agents: Building Effective Agents While Reducing Cost
arXiv 2025
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
arXiv 2025
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection
arXiv 2025
UnicEdit-10M: A Dataset and Benchmark Breaking the Scale-Quality Barrier via Unified Verification for Reasoning-Enriched Edits
arXiv 2025
HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization
arXiv 2025
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
arXiv 2025
Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion
arXiv 2025
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
arXiv 2025
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
arXiv 2025
Reinforcement Learning Enhanced LLMs: A Survey
arXiv 2024
Instruction Tuning for Large Language Models: A Survey
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers