Explore high-quality datasets for your AI and machine learning projects.
SnapGarden v0.6 is a dataset containing 1,000 images of 25 plant species; each image has been augmented five times and is accompanied by a question‑answer style description, intended to help AI learn how to describe plants. The dataset is split into training, validation, and test sets, suitable for image captioning, plant recognition, and educational content development. It is released under the MIT license, emphasizing respect for original image owners and copyright.
COCO is a large-scale dataset for object detection, segmentation, and captioning, primarily used for image-to-text tasks. The dataset provides English captions, each image being associated with multiple textual descriptions. Detailed information about dataset creation, annotation processes, or social impact is not supplied.