Back to datasets
Dataset assetOpen Source CommunityMachine LearningHate Speech

Rhma/CONAN

The dataset comprises multiple feature fields such as cn_id, hateSpeech, counterSpeech, hsType, hsSubType, cnType, age, gender, and educationLevel. These fields represent various data types, including strings and floating‑point numbers. The dataset includes a split named **train** with 14,988 examples, totaling 4,432,994 bytes. Download size is 696,348 bytes. The configuration name is **default**, and the data files are located at `data/train-*`.

Source
hugging_face
Created
Nov 28, 2025
Updated
May 15, 2024
Signals
286 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Features

  • cn_id: String
  • hateSpeech: String
  • counterSpeech: String
  • hsType: String
  • hsSubType: String
  • cnType: String
  • age: Float
  • gender: String
  • educationLevel: String

Dataset Splits

  • Training Set:
    • Data size: 4,432,994 bytes
    • Number of examples: 14,988

Dataset Size

  • Download size: 696,348 bytes
  • Total dataset size: 4,432,994 bytes

Configuration

  • Default configuration:
    • Data file pattern: data/train-*
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio