Peize Sun
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16Perception Encoder: The best visual embeddings are not at the output of the network
arXiv 2025
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
arXiv 2025
SAM 3: Segment Anything with Concepts
arXiv 2025
PixelFlow: Pixel-Space Generative Models with Flow
arXiv 2025
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
arXiv 2024
ControlAR: Controllable Image Generation with Autoregressive Models
arXiv 2024
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
arXiv 2024
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
arXiv 2024
Going Denser with Open-Vocabulary Part Segmentation
ICCV 2023 1
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
arXiv 2023
Semantic-SAM: Segment and Recognize Anything at Any Granularity
arXiv 2023
DiffusionDet: Diffusion Model for Object Detection
ICCV 2023 1
Language as Queries for Referring Video Object Segmentation
CVPR 2022 1
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
bytetrack-multi-object-tracking-by
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
CVPR 2022 1
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
CVPR 2021 1
Affiliations
Frequent co-authors
10from 16 papers