Rongyao Fang
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images
arXiv 2025
FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark
arXiv 2025
MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning
arXiv 2025
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
arXiv 2025
CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms
arXiv 2025
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
arXiv 2025
GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning
arXiv 2025
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
arXiv 2025
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
arXiv 2024
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation
ICCV 2025
Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking
arXiv 2023
RBGNet: Ray-based Grouping for 3D Object Detection
CVPR 2022 1
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
arXiv 2021
Affiliations
Frequent co-authors
10from 13 papers