0

Tulu 3: Pushing Frontiers in Open Language Model Post-Training

Allen AI's fully open post-training recipe (data, code, weights) combining SFT, DPO, and a novel Reinforcement Learning with Verifiable Rewards (RLVR) stage that matches Llama 3 Instruct.

Year
2024
Venue
preprint
Authors
24
Hosting
External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Introduces 4 artifacts - 2 tools, 2 models

TL;DR

Semantic Scholar

This work introduces Tulu 3, a family of fully-open state-of-the-art post-trained models, alongside its data, code, and training recipes, serving as a comprehensive guide for modern post-training techniques.

Artifacts

4

Authors

24