JUHE API Marketplace
DATASET
Open Source Community

sst2_combined

The dataset includes three primary features: 'sentence' (string), 'label' (categorical with two classes: 0 for negative sentiment, 1 for positive sentiment), and 'idx' (integer index). The training set has 68,221 samples, the validation set 872 samples, and the test set 1,821 samples. Total download size is 3,403,184 bytes; total dataset size is 5,110,747 bytes.

Updated 12/14/2024
huggingface

Description

Dataset Overview

Dataset Information

  • Features:

    • sentence: type string, representing the sentence.
    • label: type categorical, containing two classes:
      • 0: negative sentiment.
      • 1: positive sentiment.
    • idx: type integer, representing the index.
  • Dataset Split:

    • train: training set, 68,221 samples, occupying 4,787,855 bytes.
    • validation: validation set, 872 samples, occupying 106,252 bytes.
    • test: test set, 1,821 samples, occupying 216,640 bytes.
  • Dataset Size:

    • Download size: 3,403,184 bytes.
    • Total dataset size: 5,110,747 bytes.

Configuration

  • Configuration name: default
  • Data files:
    • train: path data/train-*
    • validation: path data/validation-*
    • test: path data/test-*

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Sentiment Analysis
Text Classification

Source

Organization: huggingface

Created: 12/14/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.