Cite
Notes
Only stored in your browser.
Attribution
MLVU: Benchmarking Multi-task Long Video Understanding
CVPR 2025 1
Emu3: Next-Token Prediction is All You Need
arXiv 2024
Efficient Multimodal Learning from Data-centric Perspective
SVIT: Scaling up Visual Instruction Tuning
arXiv 2023
from 4 papers
Bo Zhao
Tiejun Huang
Muyang He
Xi Yang
Yueze Wang
Bo Zhang
BoWen Zhang
Fan Zhang
Guang Liu
Jianhao Yuan