0

Zehuan Yuan

Papers
24

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
24papers

Authored papers

24

Generative Refinement Networks for Visual Synthesis

arXiv 2026

2026

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

arXiv 2026

2026

UniTok: A Unified Tokenizer for Visual Generation and Understanding

arXiv 2025

2025

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

arXiv 2025

2025

Waver: Wave Your Way to Lifelike Video Generation

arXiv 2025

2025

DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction

arXiv 2025

2025

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

arXiv 2024

2024

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

CVPR 2025 1

2024

TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

CVPR 2025 1

2024

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

arXiv 2024

2024

OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation

arXiv 2024

2024

Liquid: Language Models are Scalable Multi-modal Generators

arXiv 2024

2024

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

arXiv 2024

2024

Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling

arXiv 2023

2023

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

arXiv 2023

2023

EGC: Image Generation and Classification via a Diffusion Energy-Based Model

ICCV 2023 1

2023

General Object Foundation Model for Images and Videos at Scale

CVPR 2024 1

2023

Recognize Any Regions

arXiv 2023

2023

MetaFormer: A Unified Meta Framework for Fine-Grained Recognition

arXiv 2022

2022

Language as Queries for Referring Video Object Segmentation

CVPR 2022 1

2022

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

bytetrack-multi-object-tracking-by

2021

DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion

CVPR 2022 1

2021

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

CVPR 2021 1

2020

Slimmable Generative Adversarial Networks

arXiv 2020

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 24 papers