Zhaolin Gao
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5Value-Guided Search for Efficient Chain-of-Thought Reasoning
arXiv 2025
Pre-trained Large Language Models Learn Hidden Markov Models In-context
arXiv 2025
REBEL: Reinforcement Learning via Regressing Relative Rewards
arXiv 2024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
arXiv 2024
End-to-end Training for Recommendation with Language-based User Profiles
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers