Weiming Ren
University of Waterloo / TIGER-Lab researcher; works on video LLMs and multimodal benchmarks.
- Role
- grad-student
- Currently at
- TIGER-Lab
- Unknown
- GitHub
- github.com/WeimingRen
- Scholar
- scholar.google.com/scholar
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
arXiv 2026
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time
arXiv 2026
VecGlypher: Unified Vector Glyph Generation with Language Models
arXiv 2026
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation
arXiv 2025
Scaling Zero-Shot Reference-to-Video Generation
arXiv 2025
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
arXiv 2025
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
arXiv 2025
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
ICCV 2025
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
NeurIPS
AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks
arXiv 2024
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
arXiv 2024
Video Diffusion Models: A Survey
arXiv 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024 1
Affiliations
Frequent co-authors
10from 13 papers