Dataset assetOpen Source CommunityUser InteractionOnline Communities

stackexchange_stats

The dataset comprises three main features: 'instruction' (string), 'completion' (string), and 'conversations' (list of dictionaries each containing 'from' and 'value' strings). The dataset is split into a training set with 479 samples. Download size is 1,480,576 bytes; total dataset size is 4,176,676 bytes.

Source

huggingface

Created

Dec 14, 2024

Updated

Dec 23, 2024

Signals

210 views

Availability

Linked source ready

Overview

Dataset description and usage context

Dataset Overview

Dataset Information

Features:
- instruction: type string.
- completion: type string.
- conversations: contains the following sub‑features:
  - from: type string.
  - value: type string.

Dataset Split

train:
- num_bytes: 386,997,140 bytes
- num_examples: 50,000 samples

Dataset Size

download_size: 202,954,190 bytes
dataset_size: 386,997,140 bytes

Configuration

config_name: default
- data_files:
  - split: train
  - path: data/train-*

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio