Back to datasets
Dataset assetOpen Source CommunitySentiment AnalysisText Classification

mr

This dataset is intended for text‑classification tasks and contains two features: the text content and a label. Labels are binary, with 'neg' (negative) and 'pos' (positive). The data are split into training, validation, and test sets for model training, validation, and testing, respectively.

Source
huggingface
Created
Nov 28, 2024
Updated
Nov 28, 2024
Signals
384 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Information

  • Features:
    • text: String type.
    • label: Categorical label with two classes:
      • 0: negative (neg)
      • 1: positive (pos)

Data Splits

  • train:
    • Sample count: 8,530
    • Size: 1,074,806 bytes
  • validation:
    • Sample count: 1,066
    • Size: 134,675 bytes
  • test:
    • Sample count: 1,066
    • Size: 135,968 bytes

Dataset Size

  • Download Size: 886,815 bytes
  • Total Size: 1,345,449 bytes

Configuration

  • Config Name: default
  • Data File Paths:
    • train: data/train-*
    • validation: data/validation-*
    • test: data/test-*
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio