Yi-Kai Zhang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4LongCat-Flash-Thinking-2601 Technical Report
arXiv 2026
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs
arXiv 2026
V_{0.5}: Generalist Value Model as a Prior for Sparse RL Rollouts
arXiv 2026
Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing
arXiv 2025
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers