Zhiheng Liu
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12VecGlypher: Unified Vector Glyph Generation with Language Models
arXiv 2026
Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
arXiv 2026
WavFlow: Audio Generation in Waveform Space
arXiv 2026
Scaling Zero-Shot Reference-to-Video Generation
arXiv 2025
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
arXiv 2025
From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model
arXiv 2025
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
arXiv 2025
Soundwave: Less is More for Speech-Text Alignment in LLMs
arXiv 2025
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
arXiv 2025
DanceGRPO: Unleashing GRPO on Visual Generation
arXiv 2025
MagicQuill: An Intelligent Interactive Image Editing System
CVPR 2025 1
AniDoc: Animation Creation Made Easier
CVPR 2025 1
Affiliations
Frequent co-authors
10from 12 papers