Yuxiao Ye
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Learning While Staying Curious: Entropy-Preserving Supervised Fine-Tuning via Adaptive Self-Distillation for Large Reasoning Models
arXiv 2026
GARDO: Reinforcing Diffusion Models without Reward Hacking
arXiv 2025
OpenCUA: Open Foundations for Computer-Use Agents
arXiv 2025
Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards
arXiv 2025
PET-SQL: A Prompt-Enhanced Two-Round Refinement of Text-to-SQL with Cross-consistency
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers