zhoukz/TinyStories-Qwen

A Chinese story dataset generated using Qwen series models, modeled after the TinyStories dataset. All data are AI‑generated; the dataset is unfiltered and does not guarantee uniform distribution, safety, harmlessness, or any other properties. The seed information used for generation was randomly selected without any specific meaning.

Updated 1/1/2024

hugging_face

Description

Dataset Overview

License

MIT License

Task Category

Text Generation

Language

Chinese

Configuration

Configuration Name: default
- Data Files:
  - Training Set: data_???.jsonl
  - Validation Set: data_val_???.jsonl

Dataset Description

Chinese story collection generated using Qwen series models, modeled after the TinyStories dataset.
Dataset Characteristics:
- Not a translation of the original dataset.
- Does not follow the original dataset format.
- All data are AI‑generated.
- The dataset is unfiltered and does not guarantee uniform distribution, safety, harmlessness, or any other properties.
- Seed information for generation is randomly selected, with no specific meaning.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Please login to view download links and access full dataset details.

Topics

Text Generation

Chinese Stories

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Check Prices →