Yujie Zhong
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17Let ViT Speak: Generative Language-Image Pre-training
arXiv 2026
ThinkGen: Generalized Thinking for Visual Generation
arXiv 2025
v-CLR: View-Consistent Learning for Open-World Instance Segmentation
CVPR 2025 1
DisTime: Distribution-based Time Representation for Video Large Language Models
ICCV 2025
Mr. DETR: Instructive Multi-Route Training for Detection Transformers
CVPR 2025 1
DriveMM: All-in-One Large Multimodal Model for Autonomous Driving
arXiv 2024
LinVT: Empower Your Image-level Large Language Model to Understand Videos
arXiv 2024
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
arXiv 2024
HyperSeg: Towards Universal Visual Segmentation with Large Language Model
arXiv 2024
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network
ICCV 2023 1
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
arXiv 2023
SoccerNet 2023 Challenges Results
arXiv 2023
TriDet: Temporal Action Detection with Relative Boundary Modeling
CVPR 2023 1
CounTR: Transformer-based Generalised Visual Counting
arXiv 2022
ReAct: Temporal Action Detection with Relational Queries
arXiv 2022
SoccerNet 2022 Challenges Results
arXiv 2022
PromptDet: Towards Open-vocabulary Detection using Uncurated Images
arXiv 2022
Affiliations
Frequent co-authors
10from 17 papers