Lanqing Hong
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection
arXiv 2026
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
arXiv 2025
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement
arXiv 2025
InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search
arXiv 2025
Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?
arXiv 2025
Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning
arXiv 2025
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025 1
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
arXiv 2023
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
arXiv 2023
DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models
ICCV 2023 1
MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation
arXiv 2023
DDP: Diffusion Model for Dense Visual Prediction
ICCV 2023 1
Generalizing Few-Shot NAS with Gradient Matching
generalizing-few-shot-nas-with-gradient
Self-Supervised Aggregation of Diverse Experts for Test-Agnostic Long-Tailed Recognition
arXiv 2021
Affiliations
Frequent co-authors
10from 14 papers