SEACrowd/toxicity_200
Language DetectionToxicity Identification
Toxicity-200 is a vocabulary list for detecting toxic content in 200 languages. It includes common profanity, insulting terms, hate speech, pornographic terms, and body‑part terms related to sexual activity. Supported languages include ind, ace, bjn, bug, jav.
Source hugging_faceUpdated Jun 24, 2024176 viewsLinked
Inspect dataset