Zhilin Wang
NVIDIA researcher known for HelpSteer reward-model datasets, the Nemotron-Reward family, and SteerLM alignment work.
- Role
- researcher
- Currently at
- NVIDIA
- twitter.com/zhilin_wang
- Scholar
- scholar.google.com/citations
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows
arXiv 2026
Llama-Nemotron: Efficient Reasoning Models
arXiv 2025
Evaluating Parameter Efficient Methods for RLVR
arXiv 2025
Will AI Tell Lies to Save Sick Children? Litmus-Testing AI Values Prioritization with AIRiskDilemmas
arXiv 2025
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration
arXiv 2025
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
arXiv 2025
Lost in Literalism: How Supervised Training Shapes Translationese in LLMs
arXiv 2025
HelpSteer2: Open-source Dataset for Training Top-Performing Reward Models
NeurIPS
Nemotron-4 340B Technical Report
arXiv 2024
MAGE: Machine-generated Text Detection in the Wild
arXiv 2023
Tool contributions
1Affiliations
Frequent co-authors
10from 10 papers
Yafu Li
Gerald Shen
engineer
Jiaqi Zeng
researcher
Jimmy Zhang
researcher
Oleksii Kuchaiev
researcher
Olivier Delalleau
researcher
Yu Cheng
Aleksander Ficek
Ameya Sunil Mahabaleshwarkar
Boris Ginsburg