Zhenda Xie
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
arXiv 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
arXiv 2024
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
arXiv 2024
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
arXiv 2024
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
arXiv 2024
DeepSeek-VL: Towards Real-World Vision-Language Understanding
arXiv 2024
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
arXiv 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
arXiv 2023
SimMIM: A Simple Framework for Masked Image Modeling
CVPR 2022 1
Self-Supervised Learning with Swin Transformers
arXiv 2021
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning
CVPR 2021 1
Affiliations
Frequent co-authors
10from 15 papers