Cite
Notes
Only stored in your browser.
Attribution
Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks
arXiv 2026
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
arXiv 2024
from 2 papers
Ruohui Huang
Chao Ma
Fengyu Wang
Gang Yu
Hangcheng Zhu
Haoran Sun
Jihuai Zhu
Junjie Li
Lixin Liu
Lyumanshan Ye