DATASET
Open Source Community
curated_20k_spanish
This dataset includes a feature named 'messages', which is a list containing two sub‑features: 'content' (string) and 'role' (string). The dataset is divided into a training split (train) with 20,207 samples, totaling 48,020,454 bytes. The download size is 24,914,380 bytes, and it is licensed under Apache 2.0. The language is Spanish.
Updated 12/16/2024
huggingface
Description
Dataset Overview
Dataset Information
- Features:
- messages:
- content: data type is string
- role: data type is string
- messages:
- Splits:
- train:
- Bytes: 48020454
- Samples: 20207
- train:
- Download Size: 24914380
- Dataset Size: 48020454
Configuration
- Configuration Name: default
- Data Files:
- Split: train
- Path: data/train-*
- Data Files:
License
- License: apache-200
Language
- Language: Spanish (es)
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Natural Language Processing
Spanish
Source
Organization: huggingface
Created: 12/15/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.