alkzar90/CC6204-Hackaton-Cub-Dataset
CC6204‑Hackaton‑CUB200 is a multimodal dataset for image‑classification and text‑classification tasks, especially suitable for multimodal classification problems. It contains bird images and descriptive texts; each image has ten textual descriptions, and each instance is labeled with the bird species. The dataset provides training (5,994 observations) and test (5,794 observations) splits. It originates from the Caltech Vision Lab; the associated paper is "The Caltech‑UCSD Birds‑200‑2011 Dataset". Creators and contributors include Catherine Wah and Cristóbal Alcázar.
Description
Dataset Overview
Basic Information
- Dataset Name: CC6204‑Hackaton‑CUB200
- License: Apache‑2.0
- Language: English
- Size Category: 10K<n<15K
- Source Dataset: Extension | Other
- Task Categories:
- Image Classification
- Text Classification
- Task ID: Multiclass Image Classification
- Paper/Code ID: cub‑200‑2011
Dataset Description
- Homepage: CUB 200 2011
- Repository: Caltech Vision Lab
- Paper: The Caltech‑UCSD Birds‑200‑2011 Dataset
- Contact: Catherine Wah
Data Instances
- Image: RGB image representing a bird
- Description: Ten textual captions per image, separated by newline characters
- Label: Integer representing the bird species ID
- Filename: Image file name
Data Splits
- Training Set: 5,994 observations
- Test Set: 5,794 observations
Problem Statement
The goal is to train models to achieve optimal classification of CUB instances. Experiments may explore image‑only, text‑only, or combined multimodal approaches.
Experimental Strategy
Given limited compute resources, a few‑shot strategy is recommended, e.g., reducing per‑class samples or limiting the number of classes.
Evaluation Metric
Accuracy on the test set.
Citation Information
@techreport{WahCUB_200_2011,
Title = {The Caltech‑UCSD Birds‑200‑2011 Dataset},
Author = {Wah, C. and Branson, S. and Welinder, P. and Perona, P. and Belongie, S.},
Year = {2011},
Institution = {California Institute of Technology},
Number = {CNS‑TR‑2011‑001}
}
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: hugging_face
Created: Unknown
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.