Yi Ma

Papers: 11

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

11papers

Authored papers

Language-Image Alignment with Fixed Text Encoders

arXiv 2025

2025

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

arXiv 2025

2025

CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images

arXiv 2025

2025

Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos

arXiv 2025

2025

Masked Completion via Structured Diffusion with White-Box Transformers

arXiv 2024

2024

CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM

arXiv 2024

2024

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

CVPR 2024 1

2024

Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation

arXiv 2024

2024

Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition

arXiv 2024

2024

Breaking the Curse of Dimensionality: Diffusion Models Efficiently Learn Low-Dimensional Distributions

arXiv 2024

2024

Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 11 papers

Shengbang Tong

Shenghua Gao

Chenyu Wang

Chun-Hsiao Yeh

Ta-Ying Cheng

Tianzhe Chu

Yubei Chen

Yuexiang Zhai

Ziyang Wu

Andrew Markham