DATASET
Open Source Community
mbpp
The dataset comprises four features: instance_id (integer), prompt (string), canonical_solution (string), and test (string). It is divided into four parts: training set (train), test set (test), validation set (validation), and prompt set (prompt). Each part has corresponding file paths and sample counts. The total download size is 228,122 bytes, and the total dataset size is 500,198 bytes.
Updated 12/8/2024
huggingface
Description
MBPP Dataset Overview
Dataset Information
Features
- instance_id: data type
int32 - prompt: data type
string - canonical_solution: data type
string - test: data type
string
Data Splits
- train: contains 374 samples, occupying 189,426 bytes
- test: contains 500 samples, occupying 260,317 bytes
- validation: contains 90 samples, occupying 45,555 bytes
- prompt: contains 10 samples, occupying 4,900 bytes
Dataset Size
- Download Size: 228,122 bytes
- Total Size: 500,198 bytes
Configuration
- config_name: default
- data_files:
- train:
data/train-* - test:
data/test-* - validation:
data/validation-* - prompt:
data/prompt-*
- train:
- data_files:
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Programming Education
Code Generation
Source
Organization: huggingface
Created: 12/4/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.