Cite
Notes
Only stored in your browser.
Attribution
DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?
arXiv 2025
ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models
arXiv 2023
from 2 papers
Christian Greisinger
Christoph Leiter
Daniil Larionov
Hongxing Fan
Jing Shao
Lu Sheng
Ran Zhang
Sotaro Takeshita
Steffen Eger
Yanran Chen