DATASET
Open Source Community
pranjali97/labelled_vi_ko_raw_text
The dataset named labelled_vi_ko_raw_text includes three primary features: src (source text), tgt (target text), and classifier_labels (classification labels). The dataset is primarily used for training, containing 40,000 samples, with a total data size of 9,844,626 bytes and a download size of 5,466,676 bytes.
Updated 3/12/2023
hugging_face
Description
Dataset Overview
Dataset Information
- Feature Fields:
src: type isstringtgt: type isstringclassifier_labels: type isint64
Data Splits
- Training Set:
- Bytes: 9844626
- Number of Samples: 40000
Dataset Size
- Download Size: 5466676 bytes
- Dataset Size: 9844626 bytes
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Machine Translation
Text Classification
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.