UltraFeedback: Boosting Language Models with High-quality Feedback

OpenBMB 64K-prompt preference dataset where 17 LLMs respond and GPT-4 grades each response on instruction-following, truthfulness, honesty, and helpfulness.

Open

Preview
Publisher: Tsinghua OpenBMB
Year: 2023
Venue: ICML
ArXiv: arxiv.org/abs/2310.01377
Code: github.com/OpenBMB/UltraFeedback
Authors: 12
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2310.01377
TL;DR: semanticscholar.org/paper/976ca858496d2d10246b943a44fe75d1ac477639
Code: github.com/OpenBMB/UltraFeedback

Attribution policy →

Introduces 1 artifact - 1 tool

TL;DR

Semantic Scholar

This work validates the effectiveness of scaled AI feedback data in constructing strong open-source chat language models, serving as a solid foundation for future feedback learning research.

Artifacts

Tools

UltraFeedback

Authors

Ganqu Cui Guanming Yao Guotong Xie Lifan Yuan Maosong Sun Ning Ding Wei Zhu Yuan Ni Zhiyuan Liu Yankai Lin Bingxiang He Ruobing Xie