Wentian Zhao
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning
arXiv 2025
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training
arXiv 2025
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
arXiv 2025
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
arXiv 2025
CalibQuant: 1-Bit KV Cache Quantization for Multimodal LLMs
arXiv 2025
Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers