Question 1

What is Toxicity Explanation?

Accepted Answer

Explain why a given text is toxic

Question 2

What is the current top score on Toxicity Explanation?

Accepted Answer

The top reported score is 100.0% by GPT-4.1 Mini, across 1 model reporting (1 from frontier labs).

Question 3

How can a model improve its Toxicity Explanation score?

Accepted Answer

Tools linked to Toxicity Explanation on Sophon include Toxicity Explanation RL Env (Community) - RL environments, datasets, and scaffolds that target this eval.

Question 4

What license is Toxicity Explanation under?

Accepted Answer

Toxicity Explanation is available under unknown.

Toxicity Explanation

Top models

Related tools

Toxicity Explanation RL Env (Community)

FAQ