0

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models

Active

AGIEval is a human-centric benchmark specifically designed to evaluate the general abilities of foundation models in tasks pertinent to human cognition and problem-solving.

Domain
Knowledge
License
mit
Published
May 2026
Notable for
Benchmark for evaluating Knowledge.

Cite

Notes

Only stored in your browser.

FAQ

What is AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models?
AGIEval is a human-centric benchmark specifically designed to evaluate the general abilities of foundation models in tasks pertinent to human cognition and problem-solving.
What license is AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models under?
AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models is available under mit.