Datasets | JuheAPI

erhwenkuo/alpaca-data-gpt4-chinese-zhtw

Text Generation

Model Fine-tuning

The dataset named alpaca-data-gpt4-chinese-zhtw contains traditional Chinese instruction‑following data generated by GPT‑4 for fine‑tuning large language models. The dataset originates from a GitHub repository and is a Chinese translation of the original English version. It comprises 52 K instruction‑following entries, formatted like the Alpaca dataset, but with outputs generated by GPT‑4. The three primary fields are: instruction (task description), input (optional task context or input), and output (GPT‑4‑generated answer). Compared with the original Alpaca dataset, this version leverages GPT‑4 for response generation, resulting in higher quality and longer responses. The dataset is suitable for text generation, dialogue, and question‑answering tasks.

hugging_face

View Details

Dataset Hub

Browse by Category

erhwenkuo/alpaca-data-gpt4-chinese-zhtw