Coconot RL Env (Community)
Fresh
Contextual noncompliance evaluation using the AllenAI CoCoNot dataset
Cite
Notes
Only stored in your browser.
Evals this tool implements
1Same problem set, this tool's harness. Run it to score a model on the test.
Contextual noncompliance evaluation using the AllenAI CoCoNot dataset
Cite
Notes
Only stored in your browser.
Same problem set, this tool's harness. Run it to score a model on the test.