Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Microsoft paper that distills GPT-4 into a 13B Llama model by training on rich step-by-step explanation traces rather than terse SFT pairs.

Open

Publisher: Microsoft Research
Year: 2023
Venue: preprint
ArXiv: arxiv.org/abs/2306.02707
Authors: 6
Hosting: External sourcelicense unknown

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2306.02707
TL;DR: semanticscholar.org/paper/0244aeb7c6927e2fb0c2e668687e160a00737dbe

Attribution policy →

Introduces 2 artifacts - 1 tool, 1 model

TL;DR

Semantic Scholar

Orca is developed, a 13-billion parameter model that learns to imitate the reasoning process of LFMs, indicating that learning from step-by-step explanations, whether these are generated by humans or more advanced AI models, is a promising direction to improve model capabilities and skills.

Artifacts

Tools

OpenOrca

Models

Orca 2 13B

Authors

Ahmed Awadallah Arindam Mitra Ganesh Jawahar Hamid Palangi Sahaj Agarwal Subhabrata Mukherjee