0

Measuring Massive Multitask Language Understanding

Active

Dataset Card for MMLU

License
mit
Published
May 2026

Cite

Notes

Only stored in your browser.

Sample tasks

5

from the eval dataset

abstract_algebra

Find the degree for the given field extension Q(sqrt(2), sqrt(3), sqrt(18)) over Q.

  • A0
  • B4
  • C2
  • D6
Show 4 more examples
abstract_algebra

Let p = (1, 2, 5, 4)(2, 3) in S_5 . Find the index of <p> in S_5.

  • A8
  • B2
  • C24
  • D120
abstract_algebra

Find all zeros in the indicated finite field of the given polynomial with coefficients in that field. x^5 + 3x^3 + x^2 + 2x in Z_5

  • A0
  • B1
  • C0,1
  • D0,4
abstract_algebra

Statement 1 | A factor group of a non-Abelian group is non-Abelian. Statement 2 | If K is a normal subgroup of H and H is a normal subgroup of G, then K is a normal subgroup of G.

  • ATrue, True
  • BFalse, False
  • CTrue, False
  • DFalse, True
abstract_algebra

Find the product of the given polynomials in the given polynomial ring. f(x) = 4x - 5, g(x) = 2x^2 - 4x + 2 in Z_8[x].

  • A2x^2 + 5
  • B6x^2 + 4x + 6
  • C0
  • Dx^2 + 1

Related tools

3
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Measuring Massive Multitask Language Understanding?
Dataset Card for MMLU
How can a model improve its Measuring Massive Multitask Language Understanding score?
Tools linked to Measuring Massive Multitask Language Understanding on Sophon include Openmed Medknowledge RL Env (Community), MMLU RL Env (Prime Community), MMLU RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Measuring Massive Multitask Language Understanding under?
Measuring Massive Multitask Language Understanding is available under mit.