Explore high-quality datasets for your AI and machine learning projects.
GSM8K_zh is a Chinese dataset tailored for mathematical reasoning, consisting of problems and answers translated from the English GSM8K dataset. It includes 7,473 training samples and 1,319 test samples. Training samples contain full questions and answers; test samples provide only the translated questions. The dataset is suitable for Chinese–English question‑answering tasks, especially for mathematical problem solving.