0

∞Bench: Extending Long Context Evaluation Beyond 100K Tokens

Active

LLM benchmark featuring an average data length surpassing 100K tokens. Comprises synthetic and realistic tasks spanning diverse domains in English and Chinese.

Domain
Reasoning
License
mit
Published
Nov 2024
Notable for
Benchmark for evaluating Reasoning.

Cite

Notes

Only stored in your browser.

FAQ

What is ∞Bench: Extending Long Context Evaluation Beyond 100K Tokens?
LLM benchmark featuring an average data length surpassing 100K tokens. Comprises synthetic and realistic tasks spanning diverse domains in English and Chinese.
What license is ∞Bench: Extending Long Context Evaluation Beyond 100K Tokens under?
∞Bench: Extending Long Context Evaluation Beyond 100K Tokens is available under mit.