Wanjia Zhao

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey

arXiv 2026

2026

CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning

arXiv 2026

2026

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

preprint

2025

OpenThoughts: Data Recipes for Reasoning Models

arXiv 2025

2025

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

arXiv 2025

2025

SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning

arXiv 2025

2025

DeepSeek-V3 Technical Report

arXiv 2024

2024

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

arXiv 2024

2024

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Chong Ruan

Dejian Yang

Haocheng Wang

Huajian Xin

Junxiao Song

Liyue Zhang

Qihao Zhu

Wenjun Gao

Z. F. Wu

Z. Z. Ren