shareAI/doc2markmap
This dataset is designed to improve the ability of small‑parameter language models to convert articles into markmaps (markdown‑based mind maps). The source documents were collected from WeChat public accounts and CSDN, then processed through multiple rounds of transformation and cleaning using large language models and complex prompting. The dataset is intended for research and educational purposes only.
Dataset description and usage context
doc2markmap Dataset Overview
Basic Information
- License: Apache 2.0
- Language: Chinese
- Tags: markdown, markmap, mindmap
- Scale: n < 1K
Description
- Goal: Enhance small‑parameter language models' capability to transform articles into markmaps (markdown‑style mind maps).
- Source: Collected from WeChat public accounts and CSDN.
- Processing: Multiple rounds of conversion and cleaning using large language models with sophisticated prompts.
- Usage Restriction: Research and learning use only.
Citation
@misc{shareAI-doc2markmap-2024, author = {Xinlu Lai, shareAI}, title = {The dataset for convert document to markmap}, year = {2024}, publisher = {huggingface}, journal = {huggingface repository}, howpublished = {url{https://huggingface.co/datasets/shareAI/doc2markmap}} }
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.