Cite
Notes
Only stored in your browser.
Attribution
Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles
arXiv 2023
from 1 papers
Tsung-Hui Chang
Zhiwei Tang