0

Ineqmath

This adapter brings IneqMath, the dev set of the first inequality-proof Q\&A benchmark for LLMs, into Harbor, enabling standardized evaluation of models on mathematical reasoning and proof construction.

Domain
agent-eval
Published
Nov 2025

Cite

Notes

Only stored in your browser.

FAQ

What is Ineqmath?
This adapter brings IneqMath, the dev set of the first inequality-proof Q\&A benchmark for LLMs, into Harbor, enabling standardized evaluation of models on mathematical reasoning and proof construction.