DATASET
Open Source Community
cpp_unit_tests_processed_data_chat_format
The dataset primarily contains message content and role information, divided into training and test sets. The training set has 6,603 samples, and the test set has 826 samples. The total download size is 8,009,476 bytes, and the total size is 39,236,589 bytes. The default configuration stores training files under `data/train-*` and test files under `data/test-*`.
Updated 7/26/2024
huggingface
Description
Dataset Overview
Feature Information
- messages: contains the following sub‑features
- content: string type
- role: string type
Data Split
- train:
- Bytes: 35,054,702
- Samples: 6,603
- test:
- Bytes: 4,181,887
- Samples: 826
Dataset Size
- Download size: 8,009,476 bytes
- Dataset size: 39,236,589 bytes
Configuration Information
- default:
- Data file paths:
- train: data/train-*
- test: data/test-*
- Data file paths:
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
C++ Unit Testing
Natural Language Processing
Source
Organization: huggingface
Created: 7/26/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.