DATASET

Open Source Community

wentingzhao/proofwriter

The proofwriter dataset includes training, validation, and test splits, each with corresponding file paths and statistics. Features include facts, rules, question, answer, depth, length, used_facts, and used_rules.

Updated 2/27/2024

hugging_face

Description

Dataset Overview

Dataset Configuration

Default configuration:
- Training set: path is data/train-*
- Validation set: path is data/validation-*
- Test set: path is data/test-*

Dataset Information

Features

facts: sequence of strings
rules: sequence of strings
question: string
answer: string
depth: 64‑bit integer
len: 64‑bit integer
used_facts: sequence of sequences of strings
used_rules: sequence of sequences of strings

Data Split

Training set:
- Bytes: 229 844 873
- Samples: 348 796
Validation set:
- Bytes: 32 946 645
- Samples: 50 844
Test set:
- Bytes: 69 869 618
- Samples: 100 450

Dataset Size

Download size: 19 864 349 bytes
Dataset size: 332 661 136 bytes

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Logical Reasoning

Automated Theorem Generation

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.

Check Prices →