JUHE API Marketplace
DATASET
Open Source Community

NemoSheng/codefuse_fc_v1_sharegpt

The dataset contains dialogues and tool information, primarily for training and testing models. Dialogue information is stored as a list, each dialogue having a source and content field. Tool information is stored as a string. The dataset is split into training and test sets, with 72,032 training examples and 1,250 test examples. Download size 193,720,278 bytes, total size 1,002,393,963 bytes.

Updated 7/18/2024
hugging_face

Description

Dataset Overview

Data Features

  • conversations:
    • from: string type
    • value: string type
  • tools: string type

Data Splits

  • train:
    • Bytes: 999,501,804.0
    • Samples: 72,032
  • test:
    • Bytes: 2,892,159.0
    • Samples: 1,250

Dataset Size

  • Download size: 193,720,278
  • Total size: 1,002,393,963.0

Configuration

  • default:
    • train: data file path data/train-*
    • test: data file path data/test-*

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Dialogue Data
Chatbot

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.