Shengyuan Ding
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation
arXiv 2026
Visual-ERM: Reward Modeling for Visual Equivalence
arXiv 2026
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
arXiv 2026
ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents
arXiv 2026
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
arXiv 2026
Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning
arXiv 2026
InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model
arXiv 2025
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference
arXiv 2025
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning
arXiv 2025
SPARK: Synergistic Policy And Reward Co-Evolving Framework
arXiv 2025
MM-IFEngine: Towards Multimodal Instruction Following
arXiv 2025
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM
arXiv 2025
Affiliations
Frequent co-authors
10from 12 papers