Lizhu Zhang

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning

arXiv 2025

Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning

arXiv 2025

Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment

arXiv 2025

No known affiliations.

from 3 papers

Zhuokai Zhao

Jiayi Liu

Xiangjun Fan

Baosheng He

Chaoqi Wang

Chen Zhu

Chenghao Yang

Chenxiao Yang

Hanchao Yu

Hao Ma