Zhihan Liu

Cite

Notes

Only stored in your browser.

Attribution

4papers

Authored papers

Self-Exploring Language Models: Active Preference Elicitation for Online Alignment

arXiv 2024

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

arXiv 2024

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

arXiv 2023

Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration

NeurIPS 2023 11

No known affiliations.

from 4 papers

Shenao Zhang

Zhaoran Wang

Boyi Liu

Han Zhong

Hao Hu

Donghan Yu

Hany Hassan

Hiteshi Sharma

Hongyi Guo

Liyu Chen