Cite
Notes
Only stored in your browser.
Attribution
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
arXiv 2024
Mirror Descent Policy Optimization
mirror-descent-policy-optimization-1
from 2 papers
Jinwoo Shin
Jongheon Jeong
Kimin Lee
Krishnamurthy Dvijotham
KyuYoung Kim
Lior Shani
Manan Tomar
Minyong An
Yonathan Efroni