0

BFCL

BFCL is an evaluation of LLMs' ability to call functions and tools. The dataset represents common function calling use-cases in agents and enterprise workflows.

Domain
rl-env
License
unknown
Published
Jan 2026

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
OpenReward
Attribution policy →

Top score 67.7 by Qwen3 Max Thinking - 1 model reporting

Top models

1
BFCLBar chart with 1 bar. Highest value: Qwen3 Max Thinking at 67.7.
1 model

FAQ

What is BFCL?
BFCL is an evaluation of LLMs' ability to call functions and tools. The dataset represents common function calling use-cases in agents and enterprise workflows.
What is the current top score on BFCL?
The top reported score is 67.7 by Qwen3 Max Thinking, across 1 model reporting.
What license is BFCL under?
BFCL is available under unknown.