Shenao Zhang

Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

6papers

Authored papers

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

arXiv 2025

2025

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

arXiv 2024

2024

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

arXiv 2024

2024

Offline Reinforcement Learning for LLM Multi-Step Reasoning

arXiv 2024

2024

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

arXiv 2023

2023

Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration

NeurIPS 2023 11

2023

Affiliations

No known affiliations.

Frequent co-authors

from 6 papers

Zhaoran Wang

Zhihan Liu

Boyi Liu

Han Zhong

Hao Hu

Donghan Yu

Eugene Ie

Hany Hassan

Hanze Dong

Hiteshi Sharma