Cite
Notes
Only stored in your browser.
Attribution
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
arXiv 2024
ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
from 2 papers
Cheng Yang
Chufan Shi
Junjie Wang
Xinyu Zhu
Yujiu Yang
Yuxiang Zhang
Bo Shui
Deng Cai
Gongye Liu
Hanwen Wan