Question 1

What is XSTest: A benchmark for identifying exaggerated safety behaviours in LLM''s?

Accepted Answer

Dataset with 250 safe prompts across ten prompt types that well-calibrated models should not refuse, and 200 unsafe prompts as contrasts that models, for most applications, should refuse.

Question 2

What license is XSTest: A benchmark for identifying exaggerated safety behaviours in LLM''s under?

Accepted Answer

XSTest: A benchmark for identifying exaggerated safety behaviours in LLM''s is available under mit.

FAQ