Rui Qian
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Seed1.5-VL Technical Report
arXiv 2025
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
CVPR 2025 1
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
CVPR 2025 1
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
ICCV 2025
Reasoning to Attend: Try to Understand How <SEG> Token Works
CVPR 2025 1
VideoPrism: A Foundational Visual Encoder for Video Understanding
arXiv 2024
SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition
arXiv 2024
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
ICCV 2023 1
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
arXiv 2023
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos
ICCV 2023 1
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
NeurIPS 2021 12
Spatiotemporal Contrastive Video Representation Learning
CVPR 2021 1
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation
CVPR 2021 1
Affiliations
Frequent co-authors
10from 13 papers