Cite
Notes
Only stored in your browser.
Attribution
Image Generation with a Sphere Encoder
arXiv 2026
Zero-Shot Vision Encoder Grafting via LLM Surrogates
ICCV 2025
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
arXiv 2025
Object Recognition as Next Token Prediction
CVPR 2024 1
from 4 papers
Tom Goldstein
Furong Huang
Menglin Jia
Zikui Cai
Abhinav Bhatele
Ang Li
Bor-Chun Chen
Charles Wang
Deqing Fu
Hengduo Li