AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
Active
AGIEval is a human-centric benchmark specifically designed to evaluate the general abilities of foundation models in tasks pertinent to human cognition and problem-solving.
- Publisher
- Microsoft Research
- Domain
- Knowledge
- License
- mit
- Published
- May 2026
- Notable for
- Benchmark for evaluating Knowledge.
Cite
Notes
Only stored in your browser.
FAQ
- What is AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models?
- AGIEval is a human-centric benchmark specifically designed to evaluate the general abilities of foundation models in tasks pertinent to human cognition and problem-solving.
- What license is AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models under?
- AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models is available under mit.