Zhengyan Zhang

Papers: 14

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

14papers

Authored papers

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

preprint

2025

DeepSeek-V3 Technical Report

arXiv 2024

2024

ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models

arXiv 2024

2024

Robust and Scalable Model Editing for Large Language Models

arXiv 2024

2024

InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory

arXiv 2024

2024

Plug-and-Play Knowledge Injection for Pre-trained Language Models

arXiv 2023

2023

Plug-and-Play Document Modules for Pre-trained Models

arXiv 2023

2023

MoEfication: Transformer Feed-forward Layers are Mixtures of Experts

Findings (ACL) 2022 5

2021

CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models

cpt-colorful-prompt-tuning-for-pre-trained-1

2021

Sub-Character Tokenization for Chinese Pretrained Language Models

arXiv 2021

2021

Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger

ACL 2021 5

2021

CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

arXiv 2020

2020

CPM: A Large-scale Generative Chinese Pre-trained Language Model

arXiv 2020

2020

KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

from 14 papers

Zhiyuan Liu

professor

12 shared papers

Maosong Sun

professor

Xu Han

Yankai Lin

Chaojun Xiao

Fanchao Qi

Jie zhou

Peng Li

Xiaozhi Wang

Aixin Liu