Xiaoqian Shen
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation
ICCV 2025
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens
arXiv 2024
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
arXiv 2024
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
arXiv 2024
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
arXiv 2023
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
arXiv 2023
StoryGPT-V: Large Language Models as Consistent Story Visualizers
CVPR 2025 1
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers