0

Enhancing Chat Language Models by Scaling High-quality Instructional Conversations

Tsinghua/OpenBMB UltraChat - a 1.5M multi-turn dialogue SFT dataset generated by two ChatGPT instances chatting with each other across diverse topics.

Year
2023
Venue
EMNLP
Authors
9
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Introduces 2 artifacts - 1 tool, 1 model

TL;DR

Semantic Scholar

This paper provides a systematically designed, diverse, informative, large-scale dataset of instructional conversations, UltraChat, and fine-tune a LLaMA model to create a powerful conversational model, UltraLLaMA, which consistently outperforms other open-source models, including Vicuna.

Artifacts

2

Authors

9