Ziniu Hu
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11TreeRL: LLM Reinforcement Learning with On-Policy Tree Search
arXiv 2025
QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
arXiv 2025
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
arXiv 2024
Can Large Language Model Agents Simulate Human Trust Behavior?
arXiv 2024
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models
arXiv 2024
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion
arXiv 2024
Enhancing Large Vision Language Models with Self-Training on Image Comprehension
arXiv 2024
Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
arXiv 2024
AvalonBench: Evaluating LLMs Playing the Game of Avalon
arXiv 2023
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
arXiv 2023
GPT-GNN: Generative Pre-Training of Graph Neural Networks
arXiv 2020
Affiliations
Frequent co-authors
10from 11 papers