JUHE API Marketplace
DATASET
Open Source Community

audichandra/bitext_customer_support_llm_dataset_indonesian

This dataset is the Bitext dataset translated into Indonesian using the Helsinki‑NLP/opus‑mt‑en‑id model. The original Bitext dataset is primarily used for training customer‑support LLM chatbots.

Updated 3/3/2024
hugging_face

Description

Dataset Overview

Base Dataset

  • Name: Bitext Customer Support LLM Chatbot Training Dataset
  • Source: Bitext

Translation Information

  • Target Language: Indonesian
  • Translation Model: Helsinki‑NLP/opus‑mt‑en‑id

Citation Information

  • OPUS‑MT Model: bash @InProceedings{TiedemannThottingal:EAMT2020, author = {J{"o}rg Tiedemann and Santhosh Thottingal}, title = {{OPUS-MT} — {B}uilding open translation services for the {W}orld}, booktitle = {Proceedings of the 22nd Annual Conferenec of the European Association for Machine Translation (EAMT)}, year = {2020}, address = {Lisbon, Portugal} }

  • Bitext Dataset: bash @misc{bitext_chatbot_dataset, title={Bitext Customer Support LLM Chatbot Training Dataset}, author={{Bitext}}, year={2023}, howpublished={url{https://huggingface.co/datasets/bitext/Bitext-customer-support-llm-chatbot-training-dataset}} }

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Customer Support
Chatbot

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.