Jian Hu

Papers: 8

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

8papers

Authored papers

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

arXiv 2026

2026

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

arXiv 2025

2025

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

arXiv 2025

2025

V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning

arXiv 2025

2025

LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs

arXiv 2025

2025

CoS: Chain-of-Shot Prompting for Long Video Understanding

arXiv 2025

2025

Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation

arXiv 2024

2024

Aligning Language Models with Offline Learning from Human Feedback

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 8 papers

Shaogang Gong

Chenyang Si

Jan Kautz

Mingjie Liu

Shizhe Diao

Wei Li

Ximing Lu

Yi Dong

researcher

2 shared papers

Zixu Cheng

2 shared papers

Binfeng Xu

1 shared paper