∞Bench: Extending Long Context Evaluation Beyond 100K Tokens
Active
LLM benchmark featuring an average data length surpassing 100K tokens. Comprises synthetic and realistic tasks spanning diverse domains in English and Chinese.
- Publisher
- Tsinghua University
- Domain
- Reasoning
- License
- mit
- Published
- Nov 2024
- Notable for
- Benchmark for evaluating Reasoning.
Cite
Notes
Only stored in your browser.
FAQ
- What is ∞Bench: Extending Long Context Evaluation Beyond 100K Tokens?
- LLM benchmark featuring an average data length surpassing 100K tokens. Comprises synthetic and realistic tasks spanning diverse domains in English and Chinese.
- What license is ∞Bench: Extending Long Context Evaluation Beyond 100K Tokens under?
- ∞Bench: Extending Long Context Evaluation Beyond 100K Tokens is available under mit.