DATASET
Open Source Community
afmck/text8
The dataset contains three parts: training (train), validation (validation), and test (test). Each part has a sample with different byte sizes. The dataset feature is text (string). Total download size is 54,357,043 bytes, total size is 100,000,012 bytes. Configuration name is default, data file paths correspond to train, validation, test.
Updated 1/15/2024
hugging_face
Description
Dataset Overview
Data Features
- Name: text
- Data Type: string
Data Splits
- Training Set
- Bytes: 90,000,004
- Samples: 1
- Validation Set
- Bytes: 5,000,004
- Samples: 1
- Test Set
- Bytes: 5,000,004
- Samples: 1
Data Size
- Download Size: 54,357,043
- Dataset Size: 100,000,012
Configuration Information
- Configuration Name: default
- Data File Paths
- Training: data/train-*
- Validation: data/validation-*
- Test: data/test-*
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Natural Language Processing
Text Analysis
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.