JUHE API Marketplace
DATASET
Open Source Community

cpp_unit_tests_processed_data_chat_format

The dataset primarily contains message content and role information, divided into training and test sets. The training set has 6,603 samples, and the test set has 826 samples. The total download size is 8,009,476 bytes, and the total size is 39,236,589 bytes. The default configuration stores training files under `data/train-*` and test files under `data/test-*`.

Updated 7/26/2024
huggingface

Description

Dataset Overview

Feature Information

  • messages: contains the following sub‑features
    • content: string type
    • role: string type

Data Split

  • train:
    • Bytes: 35,054,702
    • Samples: 6,603
  • test:
    • Bytes: 4,181,887
    • Samples: 826

Dataset Size

  • Download size: 8,009,476 bytes
  • Dataset size: 39,236,589 bytes

Configuration Information

  • default:
    • Data file paths:
      • train: data/train-*
      • test: data/test-*

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

C++ Unit Testing
Natural Language Processing

Source

Organization: huggingface

Created: 7/26/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.