Cite
Notes
Only stored in your browser.
Attribution
GRIT: Teaching MLLMs to Think with Images
arXiv 2025
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
arXiv 2024
MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens
arXiv 2023
from 3 papers
Xin Eric Wang
Xuehai He
Yue Fan
Ching-Chen Kuo
Diji Yang
Jiachen Li
JianFeng Wang
Kevin Lin
Lijuan Wang
Linjie Li