Cite
Notes
Only stored in your browser.
Attribution
TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents
arXiv 2026
OntoTune: Ontology-Driven Self-training for Aligning Large Language Models
arXiv 2025
FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models
arXiv 2023
from 3 papers
Bing Zhu
Chengtao Gan
Chenyang Si
Chuqi Wang
Hanyang Cao
Haochen Yin
Haotian Xia
Huajun Chen
Jinyi Niu
Junjie Wang