DATASET
Open Source Community
Flickr30K, COCO
Flickr30K is a multimodal dataset containing images and text, used for training and validating algorithms that mine similarity between images and text. COCO is a large, rich image dataset primarily used for object detection, segmentation, and image captioning tasks.
Updated 6/30/2024
github
Description
LoRS: Low‑Rank Similarity Mining
Datasets
-
Flickr30K:
-
COCO:
Dataset Storage Structure
./distill_utils/data/
├── Flickr30k/
│ ├── flickr30k-images/
│ │ ├── 1234.jpg
│ │ └── ......
│ ├── results_20130124.token
│ └── readme.txt
└── COCO/
├── train2014/
├── val2014/
└── test2014/
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Multimodal Learning
Computer Vision
Source
Organization: github
Created: 6/7/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.