Shujian Zhang
- Papers
- 5
Cite
Notes
Only stored in your browser.
5papers
Authored papers
5WPO: Enhancing RLHF with Weighted Preference Optimization
arXiv 2024
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
arXiv 2024
T-REG: Preference Optimization with Token-Level Reward Regularization
arXiv 2024
POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained models
arXiv 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
preference-grounded-token-level-guidance-for
Affiliations
No known affiliations.
Frequent co-authors
10from 5 papers