DATASET
Open Source Community
zhoukz/TinyStories-Qwen
A Chinese story dataset generated using Qwen series models, modeled after the TinyStories dataset. All data are AI‑generated; the dataset is unfiltered and does not guarantee uniform distribution, safety, harmlessness, or any other properties. The seed information used for generation was randomly selected without any specific meaning.
Updated 1/1/2024
hugging_face
Description
Dataset Overview
License
- MIT License
Task Category
- Text Generation
Language
- Chinese
Configuration
- Configuration Name: default
- Data Files:
- Training Set: data_???.jsonl
- Validation Set: data_val_???.jsonl
- Data Files:
Dataset Description
- Chinese story collection generated using Qwen series models, modeled after the TinyStories dataset.
- Dataset Characteristics:
- Not a translation of the original dataset.
- Does not follow the original dataset format.
- All data are AI‑generated.
- The dataset is unfiltered and does not guarantee uniform distribution, safety, harmlessness, or any other properties.
- Seed information for generation is randomly selected, with no specific meaning.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Text Generation
Chinese Stories
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.