Renshan Zhang
- Papers
- 3
Cite
Notes
Only stored in your browser.
3papers
Authored papers
3FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
ICCV 2025
CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
arXiv 2025
Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
8from 3 papers