Peiyi Wang

Papers: 16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

16papers

Authored papers

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

preprint

2025

Chain-of-Thought Tokens are Computer Program Variables

arXiv 2025

2025

DeepSeek-V3 Technical Report

arXiv 2024

2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

arXiv 2024

2024

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

arXiv 2024

2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

arXiv 2024

2024

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

arXiv 2024

2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

arXiv 2024

2024

Towards a Unified View of Preference Learning for Large Language Models: A Survey

arXiv 2024

2024

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

arXiv 2024

2024

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

arXiv 2024

2024

ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization

arXiv 2024

2024

Large Language Models are not Fair Evaluators

arXiv 2023

2023

Rationale-Enhanced Language Models are Better Continual Relation Learners

arXiv 2023

2023

Enhancing Continual Relation Extraction via Classifier Decomposition

arXiv 2023

2023

Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

from 16 papers

Runxin Xu

Daya Guo

Junxiao Song

Qihao Zhu

Tianyu Liu

Xiao Bi

Zhifang Sui

Zhihong Shao

Bingxuan Wang

Chenggang Zhao