Runxin Xu

Papers: 10

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

10papers

Authored papers

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

preprint

2025

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

arXiv 2025

2025

DeepSeek-V3 Technical Report

arXiv 2024

2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

arXiv 2024

2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

arXiv 2024

2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

arXiv 2024

2024

Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models

arXiv 2024

2024

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

arXiv 2024

2024

Towards a Unified View of Preference Learning for Large Language Models: A Survey

arXiv 2024

2024

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 10 papers

Peiyi Wang

Daya Guo

Damai Dai

Dejian Yang

Deli Chen

Junxiao Song

Qihao Zhu

Xiao Bi

Zhihong Shao

Aixin Liu