Zhengyan Zhang
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
DeepSeek-V3 Technical Report
arXiv 2024
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models
arXiv 2024
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
arXiv 2024
Robust and Scalable Model Editing for Large Language Models
arXiv 2024
Plug-and-Play Knowledge Injection for Pre-trained Language Models
arXiv 2023
Plug-and-Play Document Modules for Pre-trained Models
arXiv 2023
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts
Findings (ACL) 2022 5
Sub-Character Tokenization for Chinese Pretrained Language Models
arXiv 2021
Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger
ACL 2021 5
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
cpt-colorful-prompt-tuning-for-pre-trained-1
CPM: A Large-scale Generative Chinese Pre-trained Language Model
arXiv 2020
CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models
arXiv 2020
KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation
arXiv 2019
Affiliations
Frequent co-authors
10from 14 papers