Jianrui Zhang
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Your Embedding Model is SMARTer Than You Think
arXiv 2026
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
arXiv 2026
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs
arXiv 2026
Reasoning-Augmented Representations for Multimodal Retrieval
arXiv 2026
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
arXiv 2024
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos
vinoground-scrutinizing-lmms-over-dense
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
arXiv 2024
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation
arXiv 2024
Affiliations
Frequent co-authors
10from 8 papers