Cite
Notes
Only stored in your browser.
Attribution
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation
arXiv 2026
Video2Layout: Recall and Reconstruct Metric-Grounded Cognitive Map for Spatial Reasoning
arXiv 2025
from 2 papers
Wang Xu
Chi Chen
Conghui Zhu
Da Peng
Haiyan Zhao
Helu Zhi
Jingjing Huang
Liang Wang
Maosong Sun
professor
Qi Shi