Cite
Notes
Only stored in your browser.
Attribution
RM-Distiller: Exploiting Generative LLM for Reward Model Distillation
arXiv 2026
from 1 papers
Chenglong Wang
Hailong Cao
Hongli Zhou
Hui Huang
Lvyuan Han
Muyun Yang
Tiejun Zhao
Wei Liu
Wenhao Jiang
Xingyuan Bu