Zhen Huang
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12daVinci-LLM:Towards the Science of Pretraining
arXiv 2026
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling
arXiv 2026
daVinci-Dev: Agent-native Mid-training for Software Engineering
arXiv 2026
Data Darwinism Part I: Unlocking the Value of Scientific Data for Pre-training
arXiv 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
LIMO: Less is More for Reasoning
arXiv 2025
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?
arXiv 2024
RuleRAG: Rule-guided retrieval-augmented generation with language models for question answering
arXiv 2024
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?
arXiv 2024
Affiliations
Frequent co-authors
10from 12 papers