Zhuofan Zong
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving
arXiv 2026
FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation
arXiv 2026
SlidesGen-Bench: Evaluating Slides Generation via Computational and Quantitative Metrics
arXiv 2026
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
arXiv 2025
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
arXiv 2025
WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning
arXiv 2025
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
arXiv 2024
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
arXiv 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
arXiv 2024
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
ICCV 2023 1
DETRs with Collaborative Hybrid Assignments Training
ICCV 2023 1
Large-batch Optimization for Dense Visual Predictions
arXiv 2022
Self-slimmed Vision Transformer
arXiv 2021
Affiliations
Frequent co-authors
10from 13 papers