Starling-7B: Improving Helpfulness and Harmlessness with RLAIF
UC Berkeley's Starling chat model trained with RLAIF on the Nectar 3.8M GPT-4-ranked preference dataset, one of the strongest 7B chat models of late 2023.
- Publisher
- University of California, Berkeley
- Year
- 2024
- Venue
- ICML
- Authors
- 8
- Hosting
- External sourcelicense unknown
Cite
Notes
Only stored in your browser.
Introduces 1 artifact - 1 tool
Artifacts
1Tools