Back to datasets
Dataset assetOpen Source CommunityMachine LearningHate Speech
Rhma/CONAN
The dataset comprises multiple feature fields such as cn_id, hateSpeech, counterSpeech, hsType, hsSubType, cnType, age, gender, and educationLevel. These fields represent various data types, including strings and floating‑point numbers. The dataset includes a split named **train** with 14,988 examples, totaling 4,432,994 bytes. Download size is 696,348 bytes. The configuration name is **default**, and the data files are located at `data/train-*`.
Source
hugging_face
Created
Nov 28, 2025
Updated
May 15, 2024
Signals
286 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
Dataset Features
- cn_id: String
- hateSpeech: String
- counterSpeech: String
- hsType: String
- hsSubType: String
- cnType: String
- age: Float
- gender: String
- educationLevel: String
Dataset Splits
- Training Set:
- Data size: 4,432,994 bytes
- Number of examples: 14,988
Dataset Size
- Download size: 696,348 bytes
- Total dataset size: 4,432,994 bytes
Configuration
- Default configuration:
- Data file pattern:
data/train-*
- Data file pattern:
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.