Back to datasets
Dataset assetOpen Source CommunityUser InteractionOnline Communities

stackexchange_stats

The dataset comprises three main features: 'instruction' (string), 'completion' (string), and 'conversations' (list of dictionaries each containing 'from' and 'value' strings). The dataset is split into a training set with 479 samples. Download size is 1,480,576 bytes; total dataset size is 4,176,676 bytes.

Source
huggingface
Created
Dec 14, 2024
Updated
Dec 23, 2024
Signals
210 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Information

  • Features:
    • instruction: type string.
    • completion: type string.
    • conversations: contains the following sub‑features:
      • from: type string.
      • value: type string.

Dataset Split

  • train:
    • num_bytes: 386,997,140 bytes
    • num_examples: 50,000 samples

Dataset Size

  • download_size: 202,954,190 bytes
  • dataset_size: 386,997,140 bytes

Configuration

  • config_name: default
    • data_files:
      • split: train
      • path: data/train-*
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio