A Study on Question-Answer Dataset for LLM Safety Evaluation with a Focus on Illegal Activities

Open

Year: 2026
ArXiv: arxiv.org/abs/2605.29340
URL: arxiv.org/abs/2605.29340
Hosting: Full text hostedCC-BY-4.0

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text: arxiv.org/abs/2605.29340CC-BY-4.0
TL;DR: Semantic Scholar

Attribution policy →

Abstract

In this paper, we discuss question-answer dataset for LLM safety evaluation, with a focus on illegal activities. Specifically, on the basis of manual analysis of AnswerCarefully, we introduce several additional information, methods for creating question-answer examples, and a rubric for evaluating LLM-generated responses. The outcomes of this study are intended to be shared with the "JAI-Trust" project.