Cite
Notes
Only stored in your browser.
Attribution
Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation
arXiv 2025
From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis
arXiv 2024
from 2 papers
Jian Guan
Rui Yan
Wei Wu