Xiaoke Huang
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation
arXiv 2026
VecGlypher: Unified Vector Glyph Generation with Language Models
arXiv 2026
Scaling Zero-Shot Reference-to-Video Generation
arXiv 2025
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
arXiv 2025
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
arXiv 2025
m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models
arXiv 2025
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization
arXiv 2024
VoCo-LLaMA: Towards Vision Compression with Large Language Models
CVPR 2025 1
Efficient Meshy Neural Fields for Animatable Human Avatars
arXiv 2023
Segment and Caption Anything
CVPR 2024 1
Affiliations
Frequent co-authors
10from 10 papers