Qi Yi
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4AT^2PO: Agentic Turn-based Policy Optimization via Tree Search
arXiv 2026
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
arXiv 2025
Online Prototype Alignment for Few-shot Policy Transfer
arXiv 2023
Unlearnable Clusters: Towards Label-agnostic Unlearnable Examples
CVPR 2023 1
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers