Junjie Fei

Cite

Notes

Only stored in your browser.

Attribution

5papers

Authored papers

Small Vision-Language Models are Smart Compressors for Long Video Understanding

arXiv 2026

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

ICCV 2025

Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents

document-haystacks-vision-language-reasoning-1

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

arXiv 2023

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

ICCV 2023 1

No known affiliations.

from 5 papers

Jun Chen

Mohamed Elhoseiny

Chun-Mei Feng

Dannong Xu

Jinrui Zhang

Teng Wang

Chenchen Zhu

Chengjie Wang

Chong Zhou

Feng Zheng