Jian Hu
- Papers
- 8
Cite
Notes
Only stored in your browser.
8papers
Authored papers
8ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents
arXiv 2026
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models
arXiv 2025
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
arXiv 2025
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
arXiv 2025
LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs
arXiv 2025
CoS: Chain-of-Shot Prompting for Long Video Understanding
arXiv 2025
Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation
arXiv 2024
Aligning Language Models with Offline Learning from Human Feedback
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 8 papers