Shilong Zhang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
arXiv 2025
PixelFlow: Pixel-Space Generative Models with Flow
arXiv 2025
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
arXiv 2024
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
arXiv 2024
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
arXiv 2024
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans
arXiv 2023
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
arXiv 2023
RTMDet: An Empirical Study of Designing Real-Time Object Detectors
arXiv 2022
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection
consistent-teacher-provides-better
Scale-Equalizing Pyramid Convolution for Object Detection
scale-equalizing-pyramid-convolution-for-1
Affiliations
Frequent co-authors
10from 10 papers