Damai Dai
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
arXiv 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
arXiv 2024
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
arXiv 2024
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
arXiv 2024
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
arXiv 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models
arXiv 2024
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
arXiv 2023
StableMoE: Stable Routing Strategy for Mixture of Experts
ACL 2022 5
Calibrating Factual Knowledge in Pretrained Language Models
arXiv 2022
A Survey on In-context Learning
arXiv 2022
Affiliations
Frequent co-authors
10from 13 papers