Back to datasets
Dataset assetOpen Source CommunityMachine LearningMBTI Personality Analysis

PandaVT/Machine_Mindset_MBTI_dataset

This dataset is for supervised fine‑tuning (SFT) and direct preference optimization (DPO), available in English and Chinese versions. It is based on the four MBTI dimensions, each with two opposing attributes: energy (Extraversion E – Introversion I), information (Sensing S – Intuition N), decision (Thinking T – Feeling F), and execution (Judging J – Perceiving P). The dataset follows the Alpaca format, containing instruction, input, and output. Users can select the appropriate file for SFT or DPO based on the MBTI type.

Source
hugging_face
Created
Nov 28, 2025
Updated
Jun 4, 2024
Signals
123 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Introduction

This dataset is for supervised fine‑tuning (SFT) and direct preference optimization (DPO). It is available in English (prefix en) and Chinese (prefix zh).

MBTI Dimensions

The dataset is based on the four MBTI dimensions, each containing two opposing attributes:

  • Energy: Extraversion (E) – Introversion (I)
  • Information: Sensing (S) – Intuition (N)
  • Decision: Thinking (T) – Feeling (F)
  • Execution: Judging (J) – Perceiving (P)

Data Format

The dataset follows the Alpaca format, containing instruction, input, and output.

Usage

Supervised Fine‑Tuning (SFT)

For example, to endow an LLM with ISFJ traits, select the following four corresponding files for SFT:

  • en_energe_introversion.json
  • en_information_sensing.json
  • en_decision_feeling.json
  • en_execution_judging.json

Direct Preference Optimization (DPO)

For instance, to bias an LLM toward Feeling (F) rather than Thinking (T) via DPO, select the following two corresponding files:

  • en_decision_feeling.json
  • en_decision_thinking.json

Then compile them into the correct DPO format.

Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio