Yi Dong
NVIDIA NeMo researcher contributing to the open Nemotron, SteerLM, and HelpSteer LLM-alignment line.
- Role
- researcher
- Currently at
- NVIDIA
- GitHub
- github.com/yidong72
- Scholar
- scholar.google.com/citations
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents
arXiv 2026
FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions
arXiv 2026
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
arXiv 2025
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
arXiv 2025
Safe Pruning LoRA: Robust Distance-Guided Pruning for Safety Alignment in Adaptation of LLMs
arXiv 2025
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts
arXiv 2025
HelpSteer2: Open-source Dataset for Training Top-Performing Reward Models
NeurIPS
Nemotron-4 340B Technical Report
arXiv 2024
RSFNet: A White-Box Image Retouching Approach using Region-Specific Color Filters
ICCV 2023 1
Tool contributions
1Affiliations
Frequent co-authors
10from 9 papers