0

LongBench V2

Fresh

LongBench v2 is designed to assess the ability of LLMs to handle long-context problems requiring deep understanding and reasoning across real-world multitasks.

Type
RL Env
Runtime
ORS
License
unknown
Size
2012 tasks
Published
Feb 2026

Cite

Notes

Only stored in your browser.

Contributors

1