James M. Rehg
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding
arXiv 2026
Toward Cognitive Supersensing in Multimodal Large Language Model
arXiv 2026
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
CVPR 2025 1
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
arXiv 2024
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models
CVPR 2024 1
ZeroShape: Regression-based Zero-shot Shape Reconstruction
CVPR 2024 1
REBAR: Retrieval-Based Reconstruction for Time-series Contrastive Learning
arXiv 2023
LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs
CVPR 2024 1
Ego4D: Around the World in 3,000 Hours of Egocentric Video
CVPR 2022 1
Fine-Grained Head Pose Estimation Without Keypoints
arXiv 2017
Dockerface: an Easy to Install and Use Faster R-CNN Face Detector in a Docker Container
arXiv 2017
Affiliations
Frequent co-authors
10from 11 papers