Dataset Hub

Office-31, Office-Home, VisDA-2017, DomainNet

Domain Adaptation

Office‑31 consists of 31 office‑object categories, Office‑Home contains 65 everyday‑object categories, VisDA‑2017 is a dataset for visual domain adaptation challenges, and DomainNet is a large‑scale multi‑domain image dataset.

MinDat-Mineral-Image-Dataset

Mineral Recognition

A dataset containing over 500,000 mineral images, each labeled, sourced from mindat.org. The dataset includes two CSV files that store image URLs and cleaned label information.

IMDb-Face, Megaface

Face Recognition

The IMDb‑Face dataset is used for face recognition and contains facial images gathered from IMDb. The Megaface dataset is a large‑scale face recognition benchmark comprising multiple subsets for various recognition tasks.

dSprites

Unsupervised Learning

dSprites is a 2D shape dataset generated from six basic independent latent factors (color, shape, scale, rotation, x‑position, and y‑position) for evaluating the disentanglement properties of unsupervised learning methods. The dataset contains 737,280 images, each representing a unique combination of these latent factors.