0

HELM (Holistic Evaluation of Language Models)

Stanford CRFM's open-source Python framework for holistic, reproducible, transparent evaluation of foundation models across many benchmarks and metric axes.

Type
Framework
Runtime
custom
License
Apache-2.0
Size
50+ scenarios, 100+ supported models
Published
Nov 2021

Cite

Notes

Only stored in your browser.