Zhiyu Wu
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling
arXiv 2025
The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization
arXiv 2025
Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
arXiv 2024
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers