Zhibin Gou
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition
arXiv 2025
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
arXiv 2024
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
arXiv 2024
Rho-1: Not All Tokens Are What You Need
arXiv 2024
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
arXiv 2024
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
arXiv 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
arXiv 2023
Long Time No See! Open-Domain Conversation with Long-Term Persona Memory
Findings (ACL) 2022 5
Affiliations
Frequent co-authors
10from 11 papers