Cite
Notes
Only stored in your browser.
Attribution
FireRed-OCR Technical Report
arXiv 2026
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs
arXiv 2025
from 2 papers
Boxiang Zhou
Changhao Qiao
Chunxiao Fan
Gang Liu
Hao Wu
Jian Wu
Kai Zuo
Manjie Xu
Phellon Chen
Wenxin Yu