audichandra/bitext_customer_support_llm_dataset_indonesian
This dataset is the Bitext dataset translated into Indonesian using the Helsinki‑NLP/opus‑mt‑en‑id model. The original Bitext dataset is primarily used for training customer‑support LLM chatbots.
Description
Dataset Overview
Base Dataset
- Name: Bitext Customer Support LLM Chatbot Training Dataset
- Source: Bitext
Translation Information
- Target Language: Indonesian
- Translation Model: Helsinki‑NLP/opus‑mt‑en‑id
Citation Information
-
OPUS‑MT Model: bash @InProceedings{TiedemannThottingal:EAMT2020, author = {J{"o}rg Tiedemann and Santhosh Thottingal}, title = {{OPUS-MT} — {B}uilding open translation services for the {W}orld}, booktitle = {Proceedings of the 22nd Annual Conferenec of the European Association for Machine Translation (EAMT)}, year = {2020}, address = {Lisbon, Portugal} }
-
Bitext Dataset: bash @misc{bitext_chatbot_dataset, title={Bitext Customer Support LLM Chatbot Training Dataset}, author={{Bitext}}, year={2023}, howpublished={url{https://huggingface.co/datasets/bitext/Bitext-customer-support-llm-chatbot-training-dataset}} }
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.