Shiyu Wang
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
arXiv 2025
ActionStudio: A Lightweight Framework for Data and Training of Large Action Models
arXiv 2025
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
arXiv 2025
MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models
arXiv 2025
Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
arXiv 2025
CoDA: Coding LM via Diffusion Adaptation
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
arXiv 2024
TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting
timemixer-decomposable-multiscale-mixing-for
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
NeuroBOLT: Resting-state EEG-to-fMRI Synthesis with Multi-dimensional Feature Mapping
arXiv 2024
Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making
arXiv 2024
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
arXiv 2024
Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models
arXiv 2024
iTransformer: Inverted Transformers Are Effective for Time Series Forecasting
arXiv 2023
Affiliations
Frequent co-authors
10from 16 papers