Haibo Wang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding
arXiv 2026
Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
arXiv 2024
Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering
arXiv 2024
Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers