Zhipin Wang

Cite

Notes

Only stored in your browser.

Attribution

2papers

Authored papers

DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

arXiv 2025

ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models

arXiv 2023

No known affiliations.

from 2 papers

Christian Greisinger

Christoph Leiter

Daniil Larionov

Hongxing Fan

Jing Shao

Lu Sheng

Ran Zhang

Sotaro Takeshita

Steffen Eger

Yanran Chen