Back to datasets
Dataset assetOpen Source CommunityLanguage ModelsMind Maps

shareAI/doc2markmap

This dataset is designed to improve the ability of small‑parameter language models to convert articles into markmaps (markdown‑based mind maps). The source documents were collected from WeChat public accounts and CSDN, then processed through multiple rounds of transformation and cleaning using large language models and complex prompting. The dataset is intended for research and educational purposes only.

Source
hugging_face
Created
Nov 28, 2025
Updated
Jul 4, 2024
Signals
96 views
Availability
Linked source ready
Overview

Dataset description and usage context

doc2markmap Dataset Overview

Basic Information

  • License: Apache 2.0
  • Language: Chinese
  • Tags: markdown, markmap, mindmap
  • Scale: n < 1K

Description

  • Goal: Enhance small‑parameter language models' capability to transform articles into markmaps (markdown‑style mind maps).
  • Source: Collected from WeChat public accounts and CSDN.
  • Processing: Multiple rounds of conversion and cleaning using large language models with sophisticated prompts.
  • Usage Restriction: Research and learning use only.

Citation

@misc{shareAI-doc2markmap-2024, author = {Xinlu Lai, shareAI}, title = {The dataset for convert document to markmap}, year = {2024}, publisher = {huggingface}, journal = {huggingface repository}, howpublished = {url{https://huggingface.co/datasets/shareAI/doc2markmap}} }

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio