Eunsu Kim
- Papers
- 7
Cite
Notes
Only stored in your browser.
Authored papers
7"I didn't Make the Micro Decisions": Measuring, Inducing, and Exposing Goal-Level AI Contributions in Collaboration
arXiv 2026
MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language
arXiv 2025
Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues
arXiv 2025
BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation
arXiv 2025
Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended Generation
arXiv 2025
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
arXiv 2024
BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
arXiv 2024
Affiliations
Frequent co-authors
10from 7 papers