Zhiwei He
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
arXiv 2025
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents
arXiv 2025
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning
arXiv 2025
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
arXiv 2025
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding
arXiv 2024
Exploring Human-Like Translation Strategy with Large Language Models
arXiv 2023
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate
arXiv 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
arXiv 2023
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
arXiv 2023
Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation
ACL 2022 5
Affiliations
Frequent co-authors
10from 10 papers