Xueyan Zou
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Quantitative Video World Model Evaluation for Geometric-Consistency
arXiv 2026
M3: 3D-Spatial MultiModal Memory
arXiv 2025
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
arXiv 2023
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
arXiv 2023
A Simple Framework for Open-Vocabulary Segmentation and Detection
ICCV 2023 1
Visual In-Context Prompting
CVPR 2024 1
Semantic-SAM: Segment and Recognize Anything at Any Granularity
arXiv 2023
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
arXiv 2023
Interfacing Foundation Models' Embeddings
arXiv 2023
Generalized Decoding for Pixel, Image, and Language
CVPR 2023 1
Progressive Temporal Feature Alignment Network for Video Inpainting
CVPR 2021 1
Affiliations
Frequent co-authors
10from 11 papers