Yi Ma
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
arXiv 2025
Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos
arXiv 2025
CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images
arXiv 2025
Language-Image Alignment with Fixed Text Encoders
arXiv 2025
Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
arXiv 2024
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
arXiv 2024
Masked Completion via Structured Diffusion with White-Box Transformers
arXiv 2024
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM
arXiv 2024
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
CVPR 2024 1
Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
arXiv 2024
Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models
arXiv 2023
Affiliations
Frequent co-authors
10from 11 papers