Can LLMs Design Good Questions Based on Context?

An automated evaluation method using LLMs assesses generated questions from context across multiple dimensions, revealing unique characteristics compared to human-generated questions.

Open

Preview
Year: 2025
Venue: arXiv 2025
ArXiv: arxiv.org/abs/2501.03491
Authors: 7
Hosting: Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2501.03491ARXIV-DEFAULT
TL;DR: Semantic Scholar

Attribution policy →

Abstract

This paper evaluates questions generated by LLMs from context, comparing them to human-generated questions across six dimensions. We introduce an automated LLM-based evaluation method, focusing on aspects like question length, type, context coverage, and answerability. Our findings highlight unique characteristics of LLM-generated questions, contributing insights that can support further research in question quality and downstream applications.

Authors

Dawn Song Yiyou Sun Basel Alomair Xiaoyuan Liu Yueheng Zhang Atheer Alharbi Hend Alzahrani