Gangyan Zeng
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding
arXiv 2025
VidText: Towards Comprehensive Evaluation for Video Text Understanding
arXiv 2025
Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
arXiv 2024
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers