JUHE API Marketplace
DATASET
Open Source Community

wentingzhao/proofwriter

The proofwriter dataset includes training, validation, and test splits, each with corresponding file paths and statistics. Features include facts, rules, question, answer, depth, length, used_facts, and used_rules.

Updated 2/27/2024
hugging_face

Description

Dataset Overview

Dataset Configuration

  • Default configuration:
    • Training set: path is data/train-*
    • Validation set: path is data/validation-*
    • Test set: path is data/test-*

Dataset Information

Features

  • facts: sequence of strings
  • rules: sequence of strings
  • question: string
  • answer: string
  • depth: 64‑bit integer
  • len: 64‑bit integer
  • used_facts: sequence of sequences of strings
  • used_rules: sequence of sequences of strings

Data Split

  • Training set:
    • Bytes: 229 844 873
    • Samples: 348 796
  • Validation set:
    • Bytes: 32 946 645
    • Samples: 50 844
  • Test set:
    • Bytes: 69 869 618
    • Samples: 100 450

Dataset Size

  • Download size: 19 864 349 bytes
  • Dataset size: 332 661 136 bytes

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Logical Reasoning
Automated Theorem Generation

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.