Llama 4 Scout

Open weights

Meta's small Llama 4 variant - 17B-active / 109B-total MoE that fits on a single H100, with a 10M-token context window.

Publisher: Meta FAIR (Fundamental AI Research)
Family: Llama 4
Params: 109B total / 17B active (16 experts)
Context: 10M
Input
Output
Providers: together fireworks groq ollama self-host
License: Llama Community
Released: Apr 2025
Canonical: llama.com/models/llama-4

Cite

Notes

Only stored in your browser.

Attribution

Benchmark scores: AA dabstep-org AfterQuery

Attribution policy →

Context

10M

Input

$0.17

/1M tok

Output

$0.63

/1M tok

Blended

$0.29

#240/ 458/1M · 3:1

Speed

89

#124/ 458tok/s

Price & speed via Artificial Analysis · ranked across tracked models

Reported on 13 evals across 8 domains - best 84.4% on MATH-500 · #75 of 178

Reported eval scores

13

Humanity's Last Exam (HLE)4.3

AIME 2024: Problems from the American Invitational Mathematics Examination28.3

GPQA Diamond58.7

Terminal-Bench (Hard)1.5

LiveCodeBench29.9

AIME 2025: Problems from the American Invitational Mathematics Examination14

τ²-bench (Tau²-bench)15.5