Cite
Notes
Only stored in your browser.
Attribution
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
arXiv 2025
from 1 papers
Cheng Peng
Jiangxuan Long
Wei Chu
Weidi Xu
Yuan Qi