0

Finqa Reasoning

Financial Question Answering Benchmark Environment for LLM Evaluation

Domain
rl-env
License
mit
Published
Oct 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top score 52.9% by GPT-4.1 Mini - 1 model reporting (1 frontier)

Top models

1
Finqa ReasoningBar chart with 1 bar. Highest value: GPT-4.1 Mini at 52.9.
1 model

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Finqa Reasoning?
Financial Question Answering Benchmark Environment for LLM Evaluation
What is the current top score on Finqa Reasoning?
The top reported score is 52.9% by GPT-4.1 Mini, across 1 model reporting (1 from frontier labs).
How can a model improve its Finqa Reasoning score?
Tools linked to Finqa Reasoning on Sophon include Finqa Reasoning RL Env (Snorkel AI) - RL environments, datasets, and scaffolds that target this eval.
What license is Finqa Reasoning under?
Finqa Reasoning is available under mit.