Yi Dong

NVIDIA NeMo researcher contributing to the open Nemotron, SteerLM, and HelpSteer LLM-alignment line.

Role: researcher
Currently at: NVIDIA
GitHub: github.com/yidong72
Scholar: scholar.google.com/citations
Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

9papers·1tool contribs

Authored papers

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

arXiv 2026

2026

FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions

arXiv 2026

2026

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

arXiv 2025

2025

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

arXiv 2025

2025

Safe Pruning LoRA: Robust Distance-Guided Pruning for Safety Alignment in Adaptation of LLMs

arXiv 2025

2025

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

arXiv 2025

2025

HelpSteer2: Open-source Dataset for Training Top-Performing Reward Models

NeurIPS

2024

Nemotron-4 340B Technical Report

arXiv 2024

2024

RSFNet: A White-Box Image Retouching Approach using Region-Specific Color Filters

ICCV 2023 1

2023

Tool contributions

HelpSteer2

NVIDIA

NVIDIA's permissively-licensed human-annotated preference dataset with 5-axis Likert ratings - engineered to train high-quality reward models.

PreferenceInstruction FollowingSafetyHallucination

Affiliations

Currently at

NVIDIA

researcher · infra

Frequent co-authors

from 9 papers

Jan Kautz

Mingjie Liu

Shizhe Diao

Ximing Lu

Changhao Jiang

Daniel Egert

researcher

2 shared papers

Gerald Shen

engineer

2 shared papers

Hui Li

2 shared papers

Jian Hu

2 shared papers

Jiaqi Zeng

researcher

2 shared papers