Back to datasets
Dataset assetOpen Source CommunityUser InteractionOnline Communities
stackexchange_stats
The dataset comprises three main features: 'instruction' (string), 'completion' (string), and 'conversations' (list of dictionaries each containing 'from' and 'value' strings). The dataset is split into a training set with 479 samples. Download size is 1,480,576 bytes; total dataset size is 4,176,676 bytes.
Source
huggingface
Created
Dec 14, 2024
Updated
Dec 23, 2024
Signals
210 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
Dataset Information
- Features:
- instruction: type string.
- completion: type string.
- conversations: contains the following sub‑features:
- from: type string.
- value: type string.
Dataset Split
- train:
- num_bytes: 386,997,140 bytes
- num_examples: 50,000 samples
Dataset Size
- download_size: 202,954,190 bytes
- dataset_size: 386,997,140 bytes
Configuration
- config_name: default
- data_files:
- split: train
- path: data/train-*
- data_files:
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.