Chaoqi Wang
- Papers
- 4
Cite
Notes
Only stored in your browser.
4papers
Authored papers
4Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
arXiv 2025
GRAPE: Generalizing Robot Policy via Preference Alignment
arXiv 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
arXiv 2024
Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following
arXiv 2024
Affiliations
No known affiliations.
Frequent co-authors
10from 4 papers