Kaituo Feng
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19OpenGame: Open Agentic Coding for Games
arXiv 2026
Flow-OPD: On-Policy Distillation for Flow Matching Models
arXiv 2026
Gen-Searcher: Reinforcing Agentic Search for Image Generation
arXiv 2026
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
arXiv 2026
Exploring Reasoning Reward Model for Agents
arXiv 2026
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
arXiv 2026
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis
arXiv 2026
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning
arXiv 2026
Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
arXiv 2025
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
arXiv 2025
MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs
arXiv 2025
OneThinker: All-in-one Reasoning Model for Image and Video
arXiv 2025
EditThinker: Unlocking Iterative Reasoning for Any Image Editor
arXiv 2025
AdaTooler-V: Adaptive Tool-Use for Images and Videos
arXiv 2025
Architecture Decoupling Is Not All You Need For Unified Multimodal Model
arXiv 2025
SpaceVista: All-Scale Visual Spatial Reasoning from mm to km
arXiv 2025
Video-R1: Reinforcing Video Reasoning in MLLMs
arXiv 2025
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
arXiv 2024
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
arXiv 2024
Affiliations
Frequent co-authors
10from 19 papers