Weiming Ren

University of Waterloo / TIGER-Lab researcher; works on video LLMs and multimodal benchmarks.

Role: grad-student
Currently at: TIGER-Lab
Twitter: Unknown
GitHub: github.com/WeimingRen
Scholar: scholar.google.com/scholar
Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/scholar

Attribution policy →

13papers

Authored papers

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

arXiv 2026

2026

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

arXiv 2026

2026

VecGlypher: Unified Vector Glyph Generation with Language Models

arXiv 2026

2026

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

ICCV 2025

2025

VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation

arXiv 2025

2025

Scaling Zero-Shot Reference-to-Video Generation

arXiv 2025

2025

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

arXiv 2025

2025

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

arXiv 2025

2025

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

NeurIPS

2024

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

arXiv 2024

2024

AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks

arXiv 2024

2024

Video Diffusion Models: A Survey

arXiv 2024

2024

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

CVPR 2024 1

2023

Affiliations

Currently at

TIGER-Lab

grad-student · university lab

Frequent co-authors

from 13 papers

Wenhu Chen

professor

9 shared papers

Cong Wei

5 shared papers

Ge Zhang

researcher

Sen He

Tao Xiang

Zhiheng Liu

Haonan Qiu

Xiaoke Huang

Yuren Cong

Zhaochong An