Peiyi Wang
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
Chain-of-Thought Tokens are Computer Program Variables
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
arXiv 2024
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
arXiv 2024
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
arXiv 2024
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding
arXiv 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
Towards a Unified View of Preference Learning for Large Language Models: A Survey
arXiv 2024
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
arXiv 2024
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
arXiv 2024
ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization
arXiv 2024
Large Language Models are not Fair Evaluators
arXiv 2023
Rationale-Enhanced Language Models are Better Continual Relation Learners
arXiv 2023
Enhancing Continual Relation Extraction via Classifier Decomposition
arXiv 2023
Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation
arXiv 2022
Affiliations
Frequent co-authors
10from 16 papers