Zhiyuan Zeng
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping
arXiv 2026
Olmo 3
arXiv 2025
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
arXiv 2025
ThetaEvolve: Test-time Learning on Open Problems
arXiv 2025
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
arXiv 2025
RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
arXiv 2025
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
arXiv 2025
UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering
arXiv 2025
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?
arXiv 2025
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models
arXiv 2024
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
arXiv 2023
Evaluating Large Language Models at Evaluating Instruction Following
arXiv 2023
Plug-and-Play Knowledge Injection for Pre-trained Language Models
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers
Hannaneh Hajishirzi
professor
Pang Wei Koh
Simon Shaolei Du
Xipeng Qiu
Yiping Wang
Baolin Peng
Danqi Chen
professor
Hamish Ivison
grad-student
Hao Cheng
Liliang Ren