Peize Sun

Papers: 16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

16papers

Authored papers

Perception Encoder: The best visual embeddings are not at the output of the network

arXiv 2025

2025

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

arXiv 2025

2025

SAM 3: Segment Anything with Concepts

arXiv 2025

2025

PixelFlow: Pixel-Space Generative Models with Flow

arXiv 2025

2025

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

arXiv 2024

2024

ControlAR: Controllable Image Generation with Autoregressive Models

arXiv 2024

2024

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

arXiv 2024

2024

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

arXiv 2024

2024

Going Denser with Open-Vocabulary Part Segmentation

ICCV 2023 1

2023

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

arXiv 2023

2023

Semantic-SAM: Segment and Recognize Anything at Any Granularity

arXiv 2023

2023

DiffusionDet: Diffusion Model for Object Detection

ICCV 2023 1

2022

Language as Queries for Referring Video Object Segmentation

CVPR 2022 1

2022

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

bytetrack-multi-object-tracking-by

2021

DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion

CVPR 2022 1

2021

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

CVPR 2021 1

2020

Affiliations

No known affiliations.

Frequent co-authors

from 16 papers

Ping Luo

Shoufa Chen

Shilong Zhang

Yi Jiang

Zehuan Yuan

Chongjian Ge

Christoph Feichtenhofer

Feng Li

Jie Wu

Nikhila Ravi