Shuangrui Ding
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation
arXiv 2026
Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition
arXiv 2026
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
CVPR 2025 1
SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
arXiv 2025
Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
CVPR 2025 1
SAM 3: Segment Anything with Concepts
arXiv 2025
SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction
arXiv 2025
ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing
arXiv 2025
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
ICCV 2025
SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition
arXiv 2024
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation
arXiv 2023
Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation
ICCV 2023 1
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos
ICCV 2023 1
Affiliations
Frequent co-authors
10from 13 papers