Zhiyuan Hu
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
arXiv 2026
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
arXiv 2026
GTA1: GUI Test-time Scaling Agent
arXiv 2025
JudgeLRM: Large Reasoning Models as a Judge
arXiv 2025
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models
arXiv 2025
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
CVPR 2025 1
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
arXiv 2024
Encoding and Controlling Global Semantics for Long-form Video Question Answering
arXiv 2024
Natural Language Reinforcement Learning
arXiv 2024
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs
arXiv 2023
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration
arXiv 2023
Affiliations
Frequent co-authors
10from 11 papers