Shirong Ma

Papers: 7

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

7papers

Authored papers

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

preprint

2025

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

arXiv 2025

2025

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

arXiv 2025

2025

DeepSeek-V3 Technical Report

arXiv 2024

2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

arXiv 2024

2024

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

arXiv 2024

2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 7 papers

Zhihong Shao

Chong Ruan

Daya Guo

Dejian Yang

Junxiao Song

Liyue Zhang

Qihao Zhu

Wenjun Gao

Bingxuan Wang

Chenggang Zhao