Li Zhao
- Papers
- 6
Cite
Notes
Only stored in your browser.
6papers
Authored papers
6Alpha-R1: Alpha Screening with LLM Reasoning via Reinforcement Learning
arXiv 2025
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models
arXiv 2025
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
arXiv 2025
DPO Meets PPO: Reinforced Token Optimization for RLHF
arXiv 2024
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
arXiv 2021
Distributional Reinforcement Learning for Multi-Dimensional Reward Functions
NeurIPS 2021 12
Affiliations
No known affiliations.
Frequent co-authors
10from 6 papers