Haoxuan You
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8HoliTom: Holistic Token Merging for Fast Video Large Language Models
arXiv 2025
Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models
arXiv 2025
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
arXiv 2025
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
CVPR 2025 1
Ferret: Refer and Ground Anything Anywhere at Any Granularity
arXiv 2023
IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
arXiv 2023
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
arXiv 2023
Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework
arXiv 2022
Affiliations
Frequent co-authors
10from 8 papers