unicat is an RL env contributor.
Cite
Notes
Only stored in your browser.
Attribution
RL environment for evaluating how well an LLM teaches a concept. Multi-turn dialog (1..N turns per task) between a tutor model and a simulated stud...
AurumDesk B2B negotiation environment for Verifiers / Prime Intellect