JUHE API Marketplace
DATASET
Open Source Community

curated_20k_spanish

This dataset includes a feature named 'messages', which is a list containing two sub‑features: 'content' (string) and 'role' (string). The dataset is divided into a training split (train) with 20,207 samples, totaling 48,020,454 bytes. The download size is 24,914,380 bytes, and it is licensed under Apache 2.0. The language is Spanish.

Updated 12/16/2024
huggingface

Description

Dataset Overview

Dataset Information

  • Features:
    • messages:
      • content: data type is string
      • role: data type is string
  • Splits:
    • train:
      • Bytes: 48020454
      • Samples: 20207
  • Download Size: 24914380
  • Dataset Size: 48020454

Configuration

  • Configuration Name: default
    • Data Files:
      • Split: train
      • Path: data/train-*

License

  • License: apache-200

Language

  • Language: Spanish (es)

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Natural Language Processing
Spanish

Source

Organization: huggingface

Created: 12/15/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.