0

Language Model Cascades

Prompted models are extended through test-time interactions or composition into probabilistic programs that handle complex data types, control flow, and dynamic structures, formalized as language model cascades.

Year
2022
Venue
arXiv 2022
Authors
12
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2207.10342v2ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Prompted models have demonstrated impressive few-shot learning abilities. Repeated interactions at test-time with a single model, or the composition of multiple models together, further expands capabilities. These compositions are probabilistic models, and may be expressed in the language of graphical models with random variables whose values are complex data types such as strings. Cases with control flow and dynamic structure require techniques from probabilistic programming, which allow implementing disparate model structures and inference strategies in a unified language. We formalize several existing techniques from this perspective, including scratchpads / chain of thought, verifiers, STaR, selection-inference, and tool use. We refer to the resulting programs as language model cascades.

Authors

12