Runxin Xu
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
arXiv 2024
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
arXiv 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models
arXiv 2024
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
arXiv 2024
Towards a Unified View of Preference Learning for Large Language Models: A Survey
arXiv 2024
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
arXiv 2024
Affiliations
Frequent co-authors
10from 10 papers