Back to datasets
Dataset assetOpen Source CommunityMachine LearningDeep Learning

alkzar90/CC6204-Hackaton-Cub-Dataset

CC6204‑Hackaton‑CUB200 is a multimodal dataset for image‑classification and text‑classification tasks, especially suitable for multimodal classification problems. It contains bird images and descriptive texts; each image has ten textual descriptions, and each instance is labeled with the bird species. The dataset provides training (5,994 observations) and test (5,794 observations) splits. It originates from the Caltech Vision Lab; the associated paper is "The Caltech‑UCSD Birds‑200‑2011 Dataset". Creators and contributors include Catherine Wah and Cristóbal Alcázar.

Source
hugging_face
Created
Nov 28, 2025
Updated
Jan 12, 2023
Signals
175 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Basic Information

  • Dataset Name: CC6204‑Hackaton‑CUB200
  • License: Apache‑2.0
  • Language: English
  • Size Category: 10K<n<15K
  • Source Dataset: Extension | Other
  • Task Categories:
    • Image Classification
    • Text Classification
  • Task ID: Multiclass Image Classification
  • Paper/Code ID: cub‑200‑2011

Dataset Description

Data Instances

  • Image: RGB image representing a bird
  • Description: Ten textual captions per image, separated by newline characters
  • Label: Integer representing the bird species ID
  • Filename: Image file name

Data Splits

  • Training Set: 5,994 observations
  • Test Set: 5,794 observations

Problem Statement

The goal is to train models to achieve optimal classification of CUB instances. Experiments may explore image‑only, text‑only, or combined multimodal approaches.

Experimental Strategy

Given limited compute resources, a few‑shot strategy is recommended, e.g., reducing per‑class samples or limiting the number of classes.

Evaluation Metric

Accuracy on the test set.

Citation Information

@techreport{WahCUB_200_2011,
	Title = {The Caltech‑UCSD Birds‑200‑2011 Dataset},
	Author = {Wah, C. and Branson, S. and Welinder, P. and Perona, P. and Belongie, S.},
	Year = {2011},
	Institution = {California Institute of Technology},
	Number = {CNS‑TR‑2011‑001}
}
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio