Jaewoo Ahn
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Orak: A Foundational Benchmark for Training and Evaluating LLM Agents on Diverse Video Games
arXiv 2025
Can LLMs Deceive CLIP? Benchmarking Adversarial Compositionality of Pre-trained Multimodal Representation via Text Updates
arXiv 2025
FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games
arXiv 2025
ChartCap: Mitigating Hallucination of Dense Chart Captioning
ICCV 2025
Is a Peeled Apple Still Red? Evaluating LLMs' Ability for Conceptual Combination with Property Type
arXiv 2025
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models
arXiv 2024
Who Wrote this Code? Watermarking for Code Generation
arXiv 2023
MPCHAT: Towards Multimodal Persona-Grounded Conversation
arXiv 2023
Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue
ICLR 2020 1
Affiliations
Frequent co-authors
10from 9 papers