Cite
Notes
Only stored in your browser.
Attribution
Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks
arXiv 2026
Doubly Robust Proximal Causal Learning for Continuous Treatments
arXiv 2023
from 2 papers
Baohua Dong
Chao Ma
Gang Yu
Hangcheng Zhu
Jihuai Zhu
Lyumanshan Ye
PengFei Liu
Pengrui Lu
Ruohui Huang
Shouyan Wang