Thematic Generalization RL Env (Community)

Fresh

This benchmark measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a small set of examples and anti-examples, then detect which item truly fits that theme among a collection of misleading candidates.

Type: RL Env
Tags: Reasoning Single Choice
Runtime: single-turn
License: unknown
Size: v0.1.0
Published: Aug 2025
Canonical: app.primeintellect.ai/dashboard/environments/wondering-camel/thematic-generalization

Cite

Notes

Only stored in your browser.

Attribution

README: api.primeintellect.ai/api/v1/environmentshub/wondering-camel/thematic-generalization/@0.1.0/inspect

Attribution policy →

Contributors

wondering-camel