Xiangxin Zhou
- Papers
- 9
Cite
Notes
Only stored in your browser.
9papers
Authored papers
9Reinforcing Few-step Generators via Reward-Tilted Distribution Matching
arXiv 2026
Rethinking the Trust Region in LLM Reinforcement Learning
arXiv 2026
Defeating the Training-Inference Mismatch via FP16
arXiv 2025
Reinforcing General Reasoning without Verifiers
arXiv 2025
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
arXiv 2025
Variational Reasoning for Language Models
arXiv 2025
GEM: A Gym for Agentic LLMs
arXiv 2025
GSLB: The Graph Structure Learning Benchmark
gslb-the-graph-structure-learning-benchmark
Global Sparse Momentum SGD for Pruning Very Deep Neural Networks
global-sparse-momentum-sgd-for-pruning-very-1
Affiliations
No known affiliations.
Frequent co-authors
10from 9 papers