Wen Sun
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
arXiv 2026
Step-DeepResearch Technical Report
arXiv 2025
Value-Guided Search for Efficient Chain-of-Thought Reasoning
arXiv 2025
Step-Audio 2 Technical Report
arXiv 2025
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
arXiv 2025
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
arXiv 2025
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model
arXiv 2025
Step-GUI Technical Report
arXiv 2025
REBEL: Reinforcement Learning via Regressing Relative Rewards
arXiv 2024
Dataset Reset Policy Optimization for RLHF
arXiv 2024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
arXiv 2024
On Speeding Up Language Model Evaluation
arXiv 2024
Learning to Generate Better Than Your LLM
arXiv 2023
Distributional Offline Policy Evaluation with Predictive Error Guarantees
arXiv 2023
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
arXiv 2023
Affiliations
Frequent co-authors
10from 15 papers