DATASET

Open Source Community

audichandra/bitext_customer_support_llm_dataset_indonesian

This dataset is the Bitext dataset translated into Indonesian using the Helsinki‑NLP/opus‑mt‑en‑id model. The original Bitext dataset is primarily used for training customer‑support LLM chatbots.

Updated 3/3/2024

hugging_face

Description

Dataset Overview

Base Dataset

Name: Bitext Customer Support LLM Chatbot Training Dataset
Source: Bitext

Translation Information

Target Language: Indonesian
Translation Model: Helsinki‑NLP/opus‑mt‑en‑id

Citation Information

OPUS‑MT Model: bash @InProceedings{TiedemannThottingal:EAMT2020, author = {J{"o}rg Tiedemann and Santhosh Thottingal}, title = {{OPUS-MT} — {B}uilding open translation services for the {W}orld}, booktitle = {Proceedings of the 22nd Annual Conferenec of the European Association for Machine Translation (EAMT)}, year = {2020}, address = {Lisbon, Portugal} }
Bitext Dataset: bash @misc{bitext_chatbot_dataset, title={Bitext Customer Support LLM Chatbot Training Dataset}, author={{Bitext}}, year={2023}, howpublished={url{https://huggingface.co/datasets/bitext/Bitext-customer-support-llm-chatbot-training-dataset}} }

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Customer Support

Chatbot

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Check Prices →