pranjali97/labelled_vi_ko_raw_text

The dataset named labelled_vi_ko_raw_text includes three primary features: src (source text), tgt (target text), and classifier_labels (classification labels). The dataset is primarily used for training, containing 40,000 samples, with a total data size of 9,844,626 bytes and a download size of 5,466,676 bytes.

Updated 3/12/2023

hugging_face

Description

Dataset Overview

Dataset Information

Feature Fields:
- src: type is string
- tgt: type is string
- classifier_labels: type is int64

Data Splits

Training Set:
- Bytes: 9844626
- Number of Samples: 40000

Dataset Size

Download Size: 5466676 bytes
Dataset Size: 9844626 bytes

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Please login to view download links and access full dataset details.

Topics

Machine Translation

Text Classification

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Check Prices →