Pengyu Cheng
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR
arXiv 2026
Kimi-VL Technical Report
arXiv 2025
Search Self-play: Pushing the Frontier of Agent Capability without Supervision
arXiv 2025
Self-playing Adversarial Language Game Enhances LLM Reasoning
arXiv 2024
On Diversified Preferences of Large Language Model Alignment
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers