Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
Tsinghua/OpenBMB UltraChat - a 1.5M multi-turn dialogue SFT dataset generated by two ChatGPT instances chatting with each other across diverse topics.
- Publisher
- Tsinghua OpenBMB
- Year
- 2023
- Venue
- EMNLP
- Authors
- 9
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.
Introduces 2 artifacts - 1 tool, 1 model
TL;DR
Semantic Scholar
This paper provides a systematically designed, diverse, informative, large-scale dataset of instructional conversations, UltraChat, and fine-tune a LLaMA model to create a powerful conversational model, UltraLLaMA, which consistently outperforms other open-source models, including Vicuna.
Artifacts
2Tools
Models