Quan Sun
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
arXiv 2026
GEBench: Benchmarking Image Generation Models as GUI Environments
arXiv 2026
STEP3-VL-10B Technical Report
arXiv 2026
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
arXiv 2025
Step1X-Edit: A Practical Framework for General Image Editing
arXiv 2025
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
arXiv 2025
Emu3: Next-Token Prediction is All You Need
arXiv 2024
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
arXiv 2024
Diffusion Feedback Helps CLIP See Better
arXiv 2024
EVA-CLIP: Improved Training Techniques for CLIP at Scale
arXiv 2023
CapsFusion: Rethinking Image-Text Data at Scale
CVPR 2024 1
Generative Multimodal Models are In-Context Learners
CVPR 2024 1
Affiliations
Frequent co-authors
10from 12 papers