JUHE API Marketplace
DATASET
Open Source Community

dSprites

dSprites is a 2D shape dataset generated from six basic independent latent factors (color, shape, scale, rotation, x‑position, and y‑position) for evaluating the disentanglement properties of unsupervised learning methods. The dataset contains 737,280 images, each representing a unique combination of these latent factors.

Updated 5/13/2024
github

Description

dSprites - Disentanglement testing Sprites dataset

Dataset Description

dSprites is a dataset for evaluating the disentanglement properties of unsupervised learning methods, containing 2D shape images generated from six basic independent latent factors. These factors include color, shape, scale, rotation, x position, and y position.

Latent Factor Values

  • Color: white
  • Shape: square, ellipse, heart
  • Scale: 6 values linearly spaced between [0.5, 1]
  • Rotation: 40 values ranging from [0, 2π]
  • x position: 32 values ranging from [0, 1]
  • y position: 32 values ranging from [0, 1]

Dataset Structure

  • imgs: 737,280 images, each 64×64 pixels, uint8 format
  • latents_values: 737,280 × 6 floating‑point matrix representing latent factor values
  • latents_classes: 737,280 × 6 integer matrix representing indices of latent factor values
  • metadata: additional information such as possible latent factor values

Dataset Usage

The dataset is used to test the ability of unsupervised models to recover the aforementioned basic latent factors and to evaluate the quality of disentanglement.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Unsupervised Learning
Image Dataset

Source

Organization: github

Created: 5/9/2017

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.