Enxin Song

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark

arXiv 2025

VideoNSA: Native Sparse Attention Scales Video Understanding

arXiv 2025

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

arXiv 2024

MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

CVPR 2024 1

No known affiliations.

from 4 papers

Wenhao Chai

Gaoang Wang

Jianwen Xie

Tian Ye

Ethan Armand

Feiyang Wu

Guanhong Wang

Haiyang Xu

Haoyang Zhou

Haozhe Chi