Back to datasets
Dataset assetOpen Source CommunityNatural Language ProcessingWeibo Comments

weibo-comments-v1

The dataset includes features such as id, text content, labeled id, user nickname, comments, and label. It is split into a training set (2,325 samples) and a test set (582 samples). Download size is 810,622 bytes; total size is 1,266,259 bytes.

Source
huggingface
Created
Dec 8, 2024
Updated
Dec 22, 2024
Signals
953 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Information

  • Features:
    • id: data type int64
    • text: data type string
    • id_labeled: data type int64
    • user_nick_name: data type string
    • Comments: data type null
    • label: data type string

Dataset Splits

  • Training Set:
    • Sample count: 2,325
    • Bytes: 1,012,745.8462332301
  • Test Set:
    • Sample count: 582
    • Bytes: 253,513.15376676986

Dataset Size

  • Download Size: 810,622 bytes
  • Total Size: 1,266,259.0 bytes

Configuration

  • Config Name: default
    • Data Files:
      • Training: data/train-*
      • Test: data/test-*
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio