Zhiwei He

Papers: 10

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

10papers

Authored papers

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

arXiv 2025

2025

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

arXiv 2025

2025

DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning

arXiv 2025

2025

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

arXiv 2025

2025

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

arXiv 2024

2024

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

arXiv 2023

2023

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

arXiv 2023

2023

Exploring Human-Like Translation Strategy with Large Language Models

arXiv 2023

2023

Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models

arXiv 2023

2023

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

ACL 2022 5

2022

Affiliations

No known affiliations.

Frequent co-authors

from 10 papers

Zhaopeng Tu

Rui Wang

Tian Liang

Jiahao Xu

Shuming Shi

Xing Wang

Zhuosheng Zhang

Dong Yu

Haitao Mi

Wenxiang Jiao