Cite
Notes
Only stored in your browser.
Attribution
Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning
arXiv 2025
from 1 papers
Cheng Peng
Shuyao Xu
Wei Chu
Weidi Xu
Yuan Qi