mr

This dataset is intended for text‑classification tasks and contains two features: the text content and a label. Labels are binary, with 'neg' (negative) and 'pos' (positive). The data are split into training, validation, and test sets for model training, validation, and testing, respectively.

Updated 11/28/2024

huggingface

Description

Dataset Overview

Dataset Information

Features:
- text: String type.
- label: Categorical label with two classes:
  - 0: negative (neg)
  - 1: positive (pos)

Data Splits

train:
- Sample count: 8,530
- Size: 1,074,806 bytes
validation:
- Sample count: 1,066
- Size: 134,675 bytes
test:
- Sample count: 1,066
- Size: 135,968 bytes

Dataset Size

Download Size: 886,815 bytes
Total Size: 1,345,449 bytes

Configuration

Config Name: default
Data File Paths:
- train: data/train-*
- validation: data/validation-*
- test: data/test-*

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Please login to view download links and access full dataset details.

Topics

Text Classification

Sentiment Analysis

Source

Organization: huggingface

Created: 11/28/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Check Prices →