JUHE API Marketplace
DATASET
Open Source Community

zhoukz/TinyStories-Qwen

A Chinese story dataset generated using Qwen series models, modeled after the TinyStories dataset. All data are AI‑generated; the dataset is unfiltered and does not guarantee uniform distribution, safety, harmlessness, or any other properties. The seed information used for generation was randomly selected without any specific meaning.

Updated 1/1/2024
hugging_face

Description

Dataset Overview

License

  • MIT License

Task Category

  • Text Generation

Language

  • Chinese

Configuration

  • Configuration Name: default
    • Data Files:
      • Training Set: data_???.jsonl
      • Validation Set: data_val_???.jsonl

Dataset Description

  • Chinese story collection generated using Qwen series models, modeled after the TinyStories dataset.
  • Dataset Characteristics:
    • Not a translation of the original dataset.
    • Does not follow the original dataset format.
    • All data are AI‑generated.
    • The dataset is unfiltered and does not guarantee uniform distribution, safety, harmlessness, or any other properties.
    • Seed information for generation is randomly selected, with no specific meaning.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Text Generation
Chinese Stories

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.