Yan Ma
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model
arXiv 2026
Thinking with Generated Images
arXiv 2025
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
arXiv 2025
Generative AI Act II: Test Time Scaling Drives Cognition Engineering
arXiv 2025
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers
arXiv 2025
Visual Programmability: A Guide for Code-as-Thought in Chart Understanding
arXiv 2025
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
arXiv 2025
One RL to See Them All: Visual Triple Unified Reinforcement Learning
arXiv 2025
MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation
arXiv 2024
ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation
arXiv 2024
Weak-to-Strong Reasoning
arXiv 2024
Affiliations
Frequent co-authors
10from 11 papers