Cite
Notes
Only stored in your browser.
Attribution
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs
arXiv 2025
from 1 papers
Jiawei Zhou
Yanhong Li