Back to datasets
Dataset assetOpen Source CommunitySocial Network AnalysisPersonal Profiles

profiles_dataset_10000

The dataset contains personal information such as name, date of birth, birth city, educational background, work information, and social relationships (parents, children, best friend, worst enemy). It is divided into a training set with 10,000 samples, totaling 2,137,152 bytes.

Source
huggingface
Created
Nov 5, 2024
Updated
Nov 5, 2024
Signals
182 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Information

  • Dataset Name: profiles_dataset_10000
  • Dataset Size: 2,137,152 bytes
  • Download Size: 1,257,809 bytes

Data Structure

  • Features:
    • name: string
    • index: 32‑bit integer
    • birth_date: timestamp (seconds)
    • birth_city: string
    • university: string
    • employer: string
    • parent: struct
      • name: string
      • index: 32‑bit integer
    • child: struct
      • name: string
      • index: 32‑bit integer
    • best_friend: struct
      • name: string
      • index: 32‑bit integer
    • worst_enemy: struct
      • name: string
      • index: 32‑bit integer
    • bio: string

Data Split

  • train:
    • Number of Samples: 10,000
    • Data Size: 2,137,152 bytes

Configuration

  • Configuration Name: default
    • Data File Path: data/train-*
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio