Shuang Chen
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Flow-OPD: On-Policy Distillation for Flow Matching Models
arXiv 2026
Gen-Searcher: Reinforcing Agentic Search for Image Generation
arXiv 2026
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery
arXiv 2026
Exploring Reasoning Reward Model for Agents
arXiv 2026
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding
arXiv 2026
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
arXiv 2026
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis
arXiv 2026
Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
arXiv 2026
Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?
arXiv 2026
Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents
arXiv 2026
OneThinker: All-in-one Reasoning Model for Image and Video
arXiv 2025
Interleaving Reasoning for Better Text-to-Image Generation
arXiv 2025
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
arXiv 2025
FUSU: A Multi-temporal-source Land Use Change Segmentation Dataset for Fine-grained Urban Semantic Understanding
arXiv 2024
Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
arXiv 2024
Affiliations
Frequent co-authors
10from 15 papers