dSprites
dSprites is a 2D shape dataset generated from six basic independent latent factors (color, shape, scale, rotation, x‑position, and y‑position) for evaluating the disentanglement properties of unsupervised learning methods. The dataset contains 737,280 images, each representing a unique combination of these latent factors.
Description
dSprites - Disentanglement testing Sprites dataset
Dataset Description
dSprites is a dataset for evaluating the disentanglement properties of unsupervised learning methods, containing 2D shape images generated from six basic independent latent factors. These factors include color, shape, scale, rotation, x position, and y position.
Latent Factor Values
- Color: white
- Shape: square, ellipse, heart
- Scale: 6 values linearly spaced between [0.5, 1]
- Rotation: 40 values ranging from [0, 2π]
- x position: 32 values ranging from [0, 1]
- y position: 32 values ranging from [0, 1]
Dataset Structure
imgs: 737,280 images, each 64×64 pixels, uint8 formatlatents_values: 737,280 × 6 floating‑point matrix representing latent factor valueslatents_classes: 737,280 × 6 integer matrix representing indices of latent factor valuesmetadata: additional information such as possible latent factor values
Dataset Usage
The dataset is used to test the ability of unsupervised models to recover the aforementioned basic latent factors and to evaluate the quality of disentanglement.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: github
Created: 5/9/2017
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.