0

Starling-7B: Improving Helpfulness and Harmlessness with RLAIF

UC Berkeley's Starling chat model trained with RLAIF on the Nectar 3.8M GPT-4-ranked preference dataset, one of the strongest 7B chat models of late 2023.

Year
2024
Venue
ICML
Authors
8
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.