0

Agent Prompt Eval Xlam 60k V3

Advanced agent prompt evaluation with DYNAMIC TOOL REGISTRATION - Creates mock functions on-demand from Salesforce/xlam-function-calling-60k dataset

Domain
rl-env
License
unknown
Published
Oct 2025

Cite

Notes

Only stored in your browser.

Attribution

Leaderboard scores
prime-hub
Attribution policy →

Top models

1
Agent Prompt Eval Xlam 60k V3Bar chart with 1 bar. Highest value: Llama 3.1 8B Instant at 20.
1 model

Related tools

1
View all

Implementations, trainers, datasets and scaffolds linked to this eval.

FAQ

What is Agent Prompt Eval Xlam 60k V3?
Advanced agent prompt evaluation with DYNAMIC TOOL REGISTRATION - Creates mock functions on-demand from Salesforce/xlam-function-calling-60k dataset
How can a model improve its Agent Prompt Eval Xlam 60k V3 score?
Tools linked to Agent Prompt Eval Xlam 60k V3 on Sophon include 60k V 3 RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.
What license is Agent Prompt Eval Xlam 60k V3 under?
Agent Prompt Eval Xlam 60k V3 is available under unknown.