Weiting Tan

Cite

Notes

Only stored in your browser.

Attribution

3papers

Authored papers

It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF

arXiv 2024

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

arXiv 2024

Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency

arXiv 2023

No known affiliations.

from 3 papers

Lingfeng Shen

Amr Sharaf

Beidi Chen

Benjamin Van Durme

Boyuan Zheng

Daniel Khashabi

Haoran Xu

Huaxiu Yao

Kenton Murray

Taiming Lu