Yi Jiang
- Papers
- 26
Cite
Notes
Only stored in your browser.
Authored papers
26Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens
arXiv 2026
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation
arXiv 2026
Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play
arXiv 2026
SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution
arXiv 2026
UniTok: A Unified Tokenizer for Visual Generation and Understanding
arXiv 2025
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
arXiv 2025
Waver: Wave Your Way to Lifelike Video Generation
arXiv 2025
Unified Continuous Generative Models
arXiv 2025
SA-Occ: Satellite-Assisted 3D Occupancy Prediction in Real World
arXiv 2025
DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction
arXiv 2025
Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
CVPR 2025 1
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
CVPR 2025 1
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
arXiv 2024
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
arXiv 2024
Liquid: Language Models are Scalable Multi-modal Generators
arXiv 2024
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
arXiv 2024
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling
arXiv 2023
General Object Foundation Model for Images and Videos at Scale
CVPR 2024 1
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces
arXiv 2023
Recognize Any Regions
arXiv 2023
EGC: Image Generation and Classification via a Diffusion Energy-Based Model
ICCV 2023 1
Language as Queries for Referring Video Object Segmentation
CVPR 2022 1
MetaFormer: A Unified Meta Framework for Fine-Grained Recognition
arXiv 2022
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
bytetrack-multi-object-tracking-by
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion
CVPR 2022 1
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
CVPR 2021 1
Affiliations
Frequent co-authors
10from 26 papers