JUHE API Marketplace
DATASET
Open Source Community

pranjali97/labelled_vi_ko_raw_text

The dataset named labelled_vi_ko_raw_text includes three primary features: src (source text), tgt (target text), and classifier_labels (classification labels). The dataset is primarily used for training, containing 40,000 samples, with a total data size of 9,844,626 bytes and a download size of 5,466,676 bytes.

Updated 3/12/2023
hugging_face

Description

Dataset Overview

Dataset Information

  • Feature Fields:
    • src: type is string
    • tgt: type is string
    • classifier_labels: type is int64

Data Splits

  • Training Set:
    • Bytes: 9844626
    • Number of Samples: 40000

Dataset Size

  • Download Size: 5466676 bytes
  • Dataset Size: 9844626 bytes

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Machine Translation
Text Classification

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.