Guijin Son
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models
arXiv 2026
BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation
arXiv 2025
Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning
arXiv 2025
Revisiting the Uniform Information Density Hypothesis in LLM Reasoning Traces
arXiv 2025
MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
arXiv 2024
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?
arXiv 2024
HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models
arXiv 2023
Removing Non-Stationary Knowledge From Pre-Trained Language Models for Entity-Level Sentiment Classification in Finance
arXiv 2023
Affiliations
Frequent co-authors
10from 8 papers