DATASET
Open Source Community
sst2_combined
The dataset includes three primary features: 'sentence' (string), 'label' (categorical with two classes: 0 for negative sentiment, 1 for positive sentiment), and 'idx' (integer index). The training set has 68,221 samples, the validation set 872 samples, and the test set 1,821 samples. Total download size is 3,403,184 bytes; total dataset size is 5,110,747 bytes.
Updated 12/14/2024
huggingface
Description
Dataset Overview
Dataset Information
-
Features:
- sentence: type string, representing the sentence.
- label: type categorical, containing two classes:
0: negative sentiment.1: positive sentiment.
- idx: type integer, representing the index.
-
Dataset Split:
- train: training set, 68,221 samples, occupying 4,787,855 bytes.
- validation: validation set, 872 samples, occupying 106,252 bytes.
- test: test set, 1,821 samples, occupying 216,640 bytes.
-
Dataset Size:
- Download size: 3,403,184 bytes.
- Total dataset size: 5,110,747 bytes.
Configuration
- Configuration name: default
- Data files:
- train: path
data/train-* - validation: path
data/validation-* - test: path
data/test-*
- train: path
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Sentiment Analysis
Text Classification
Source
Organization: huggingface
Created: 12/14/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.