JUHE API Marketplace
DATASET
Open Source Community

alkzar90/CC6204-Hackaton-Cub-Dataset

CC6204‑Hackaton‑CUB200 is a multimodal dataset for image‑classification and text‑classification tasks, especially suitable for multimodal classification problems. It contains bird images and descriptive texts; each image has ten textual descriptions, and each instance is labeled with the bird species. The dataset provides training (5,994 observations) and test (5,794 observations) splits. It originates from the Caltech Vision Lab; the associated paper is "The Caltech‑UCSD Birds‑200‑2011 Dataset". Creators and contributors include Catherine Wah and Cristóbal Alcázar.

Updated 1/12/2023
hugging_face

Description

Dataset Overview

Basic Information

  • Dataset Name: CC6204‑Hackaton‑CUB200
  • License: Apache‑2.0
  • Language: English
  • Size Category: 10K<n<15K
  • Source Dataset: Extension | Other
  • Task Categories:
    • Image Classification
    • Text Classification
  • Task ID: Multiclass Image Classification
  • Paper/Code ID: cub‑200‑2011

Dataset Description

Data Instances

  • Image: RGB image representing a bird
  • Description: Ten textual captions per image, separated by newline characters
  • Label: Integer representing the bird species ID
  • Filename: Image file name

Data Splits

  • Training Set: 5,994 observations
  • Test Set: 5,794 observations

Problem Statement

The goal is to train models to achieve optimal classification of CUB instances. Experiments may explore image‑only, text‑only, or combined multimodal approaches.

Experimental Strategy

Given limited compute resources, a few‑shot strategy is recommended, e.g., reducing per‑class samples or limiting the number of classes.

Evaluation Metric

Accuracy on the test set.

Citation Information

@techreport{WahCUB_200_2011,
	Title = {The Caltech‑UCSD Birds‑200‑2011 Dataset},
	Author = {Wah, C. and Branson, S. and Welinder, P. and Perona, P. and Belongie, S.},
	Year = {2011},
	Institution = {California Institute of Technology},
	Number = {CNS‑TR‑2011‑001}
}

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Deep Learning
Machine Learning

Source

Organization: hugging_face

Created: Unknown

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.