Kai Yang
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation
arXiv 2026
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
arXiv 2026
ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation
arXiv 2026
Debiased Model-based Representations for Sample-efficient Continuous Control
arXiv 2026
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
arXiv 2026
Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control
arXiv 2025
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL
arXiv 2025
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
arXiv 2024
MolFM: A Multimodal Molecular Foundation Model
arXiv 2023
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
CVPR 2024 1
Accelerated Gradient Methods for Sparse Statistical Learning with Nonconvex Penalties
arXiv 2020
Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task
spider-a-large-scale-human-labeled-dataset-1
Affiliations
Frequent co-authors
10from 13 papers