Cite
Notes
Only stored in your browser.
Attribution
Towards Evaluation Engineering: An Empirical Study of ML Evaluation Harnesses in the Wild
arXiv 2026
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards
arXiv 2024
from 2 papers
Abdul Ali Bangash
Ahmed E. Hassan
Bram Adams
Filipe Roseiro Côgo
Zehao Wang