0

Depth QA RL Env (Community)

Fresh

Teaches LLMs when to use search tools vs answer directly. Mixed-difficulty QA with easy/medium/hard questions requiring 0/1/2+ hops.

Type
RL Env
Runtime
tool-use
License
unknown
Size
v0.1.1
Published
Feb 2026

Cite

Notes

Only stored in your browser.

Contributors

1