Ming Lu
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation
arXiv 2026
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
CVPR 2025 1
TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinforcement Learning
arXiv 2025
VCU-Bridge: Hierarchical Visual Connotation Understanding via Semantic Bridging
arXiv 2025
Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference
arXiv 2025
MC-LLaVA: Multi-Concept Personalized Vision-Language Model
arXiv 2024
Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs
ICCV 2025
Proximity QA: Unleashing the Power of Multi-Modal Large Language Models for Spatial Proximity Analysis
arXiv 2024
3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views
arXiv 2024
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
arXiv 2023
I-MedSAM: Implicit Medical Image Segmentation with Segment Anything
arXiv 2023
Lossy Image Compression with Quantized Hierarchical VAEs
arXiv 2022
Adaptive Patch Exiting for Scalable Single Image Super-Resolution
arXiv 2022
NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results
arXiv 2021
SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution
arXiv 2021
Affiliations
Frequent co-authors
10from 15 papers