Zhiliang Peng
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7VibeVoice Technical Report
arXiv 2025
Multimodal Latent Language Modeling with Next-Token Diffusion
arXiv 2024
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
arXiv 2023
Kosmos-2: Grounding Multimodal Large Language Models to the World
arXiv 2023
BEiT v2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
arXiv 2022
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks
arXiv 2022
Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
ICCV 2023 1
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers