Cite
Notes
Only stored in your browser.
Attribution
Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation
arXiv 2025
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
arXiv 2024
from 2 papers
Dezhi Luo
Hao Yan
Jiahe Ding
Joyce Chai
Liang Yin
Minghui Liao
Run Peng
Sihan Xu
Wei Chen
Xiang Bai