0

Alice Oh

Papers
19

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
19papers

Authored papers

19

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

arXiv 2026

2026

MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models

arXiv 2026

2026

BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation

arXiv 2025

2025

HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja

arXiv 2025

2025

MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language

arXiv 2025

2025

Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended Generation

arXiv 2025

2025

Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues

arXiv 2025

2025

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

arXiv 2024

2024

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

arXiv 2024

2024

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

arXiv 2024

2024

Survey of Cultural Awareness in Language Models: Text and Beyond

arXiv 2024

2024

Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese

arXiv 2024

2024

When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun

arXiv 2024

2024

Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

arXiv 2024

2024

A 2-step Framework for Automated Literary Translation Evaluation: Its Promises and Pitfalls

arXiv 2024

2024

Exploring Cross-Cultural Differences in English Hate Speech Annotations: From Dataset Construction to Analysis

arXiv 2023

2023

IDK-MRC: Unanswerable Questions for Indonesian Machine Reading Comprehension

arXiv 2022

2022

How to Find Your Friendly Neighborhood: Graph Attention Design with Self-Supervision

how-to-find-your-friendly-neighborhood-graph

2022

KLUE: Korean Language Understanding Evaluation

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 19 papers