Cite
Notes
Only stored in your browser.
Attribution
T-REG: Preference Optimization with Token-Level Reward Regularization
arXiv 2024
Controllable Text Generation with Neurally-Decomposed Oracle
arXiv 2022
from 2 papers
Kai-Wei Chang
Lingxiao Zhao
Nanyun Peng
Shujian Zhang
Sidi Lu
Wenxuan Zhou