Fanhu Zeng
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Imagination Helps Visual Reasoning, But Not Yet in Latent Space
arXiv 2026
Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning
arXiv 2026
Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality
arXiv 2025
A Comprehensive Survey on Continual Learning in Generative Models
arXiv 2025
Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation
arXiv 2025
HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model
arXiv 2025
VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?
arXiv 2025
ChartEdit: How Far Are MLLMs From Automating Chart Analysis? Evaluating MLLMs' Capability via Chart Editing
arXiv 2025
Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection
arXiv 2024
ModalPrompt: Towards Efficient Multimodal Continual Instruction Tuning with Dual-Modality Guided Prompt
arXiv 2024
Affiliations
Frequent co-authors
10from 10 papers