Shenzhi Wang
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models
arXiv 2026
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models
arXiv 2026
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
arXiv 2025
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
arXiv 2025
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
arXiv 2024
Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
arXiv 2024
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints
arXiv 2024
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
train-once-get-a-family-state-adaptive
Affiliations
Frequent co-authors
10from 8 papers