Shenghao Fu
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8ObjEmbed: Towards Universal Multimodal Object Embeddings
arXiv 2026
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
CVPR 2025 1
HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
arXiv 2025
ViSpeak: Visual Instruction Feedback in Streaming Videos
ICCV 2025
IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation
arXiv 2025
HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context
arXiv 2025
LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning
arXiv 2025
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
ICCV 2023 1
Affiliations
Frequent co-authors
10from 8 papers