Jihan Yang
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
arXiv 2026
UniTok: A Unified Tokenizer for Visual Generation and Understanding
arXiv 2025
Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs
arXiv 2025
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
arXiv 2024
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
CVPR 2025 1
V-IRL: Grounding Virtual Intelligence in Real Life
arXiv 2024
PLA: Language-Driven Open-Vocabulary 3D Scene Understanding
CVPR 2023 1
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers