Yingwei Ma
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute
arXiv 2025
RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization
arXiv 2025
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
arXiv 2024
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
arXiv 2024
UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts
arXiv 2024
Alibaba LingmaAgent: Improving Automated Issue Resolution via Comprehensive Repository Exploration
arXiv 2024
Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement
arXiv 2024
At Which Training Stage Does Code Data Help LLMs Reasoning?
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers