0

IFEval: Instruction-Following Evaluation

Active

Evaluates how well language models can strictly follow detailed instructions, such as writing responses with specific word counts or including required keywords.

Domain
Reasoning
License
mit
Published
Oct 2024
Notable for
Benchmark for evaluating Reasoning.

Cite

Notes

Only stored in your browser.

Related tools

10
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is IFEval: Instruction-Following Evaluation?
Evaluates how well language models can strictly follow detailed instructions, such as writing responses with specific word counts or including required keywords.
How can a model improve its IFEval: Instruction-Following Evaluation score?
Tools linked to IFEval: Instruction-Following Evaluation on Sophon include Ifeval RL Env (Arcee AI), Allenai Ifeval RL Env (Dev Team), Ifeval RL Env (Community), Allenai Ifeval RL Env (Prime Intellect) - RL environments, datasets, and scaffolds that target this eval.
What license is IFEval: Instruction-Following Evaluation under?
IFEval: Instruction-Following Evaluation is available under mit.