0

Kaituo Feng

Papers
19

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
19papers

Authored papers

19

OpenGame: Open Agentic Coding for Games

arXiv 2026

2026

Flow-OPD: On-Policy Distillation for Flow Matching Models

arXiv 2026

2026

Gen-Searcher: Reinforcing Agentic Search for Image Generation

arXiv 2026

2026

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

arXiv 2026

2026

Exploring Reasoning Reward Model for Agents

arXiv 2026

2026

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

arXiv 2026

2026

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

arXiv 2026

2026

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

arXiv 2026

2026

Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback

arXiv 2025

2025

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

arXiv 2025

2025

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

arXiv 2025

2025

OneThinker: All-in-one Reasoning Model for Image and Video

arXiv 2025

2025

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

arXiv 2025

2025

AdaTooler-V: Adaptive Tool-Use for Images and Videos

arXiv 2025

2025

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

arXiv 2025

2025

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

arXiv 2025

2025

Video-R1: Reinforcing Video Reasoning in MLLMs

arXiv 2025

2025

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

arXiv 2024

2024

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

10

from 19 papers