0

Openthoughts Tblite

OpenThoughts-TBLite: A difficulty-calibrated benchmark of 100 tasks for building terminal agents. By OpenThoughts Agent team, Snorkel AI, Bespoke Labs.

Domain
agent-eval
Published
Feb 2026

Cite

Notes

Only stored in your browser.

FAQ

What is Openthoughts Tblite?
OpenThoughts-TBLite: A difficulty-calibrated benchmark of 100 tasks for building terminal agents. By OpenThoughts Agent team, Snorkel AI, Bespoke Labs.