Junjie Fei
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Small Vision-Language Models are Smart Compressors for Long Video Understanding
arXiv 2026
WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation
ICCV 2025
Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents
document-haystacks-vision-language-reasoning-1
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
arXiv 2023
Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
ICCV 2023 1
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers