Zhibin Gou

Papers: 11

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

11papers

Authored papers

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

preprint

2025

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

arXiv 2025

2025

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

arXiv 2025

2025

DeepSeek-V3 Technical Report

arXiv 2024

2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

arXiv 2024

2024

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

arXiv 2024

2024

Rho-1: Not All Tokens Are What You Need

arXiv 2024

2024

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

arXiv 2024

2024

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

arXiv 2023

2023

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

arXiv 2023

2023

Long Time No See! Open-Domain Conversation with Long-Term Persona Memory

Findings (ACL) 2022 5

2022

Affiliations

No known affiliations.

Frequent co-authors

from 11 papers

Zhihong Shao

Chong Ruan

Dejian Yang

Junxiao Song

Liyue Zhang

Qihao Zhu

Shirong Ma

Wenjun Gao

Z. Z. Ren

Daya Guo