SnapGarden_v0.6
SnapGarden v0.6 is a dataset containing 1,000 images of 25 plant species; each image has been augmented five times and is accompanied by a question‑answer style description, intended to help AI learn how to describe plants. The dataset is split into training, validation, and test sets, suitable for image captioning, plant recognition, and educational content development. It is released under the MIT license, emphasizing respect for original image owners and copyright.
Description
Dataset Overview: SnapGarden v0.6
License
- License: MIT
Dataset Information
- Features:
- Name: image
- Type: image
- Name: text
- Type: string
- Name: image
Splits
- Training Set:
- Byte Size: 761,694,844.68
- Number of Samples: 4,367
- Test Set:
- Byte Size: 190,467,316.32
- Number of Samples: 1,092
Download Size
- 951,870,561 bytes
Total Dataset Size
- 952,162,161 bytes
Configuration
- Configuration Name: default
- Training Data Path:
data/train-* - Test Data Path:
data/test-*
- Training Data Path:
Content
- Images: 1,000 images of 25 plant species, each augmented five times.
- Descriptions: Q&A‑style texts to help AI learn how a horticulturist would describe plants.
Usage
- Load with Hugging Face datasets library:
from datasets import load_dataset
dataset = load_dataset("Baran657/SnapGarden_v0.6")
Legal Statement
- Image copyrights belong to the original owners; the dataset is intended for AI research.
Potential Uses
- Image Captioning: Enable machines to describe plants like a natural guide.
- Plant Recognition: Test AI’s ability to differentiate between plant species.
- Educational Content: Build resources for gardening enthusiasts and nature lovers.
License
- The dataset follows the MIT license; redistribution, remixing, and use require appropriate attribution.
Acknowledgements
- Thanks to all photographers and creators of the plant images whose contributions made this dataset possible.
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Please login to view download links and access full dataset details.
Topics
Source
Organization: huggingface
Created: 1/29/2025
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.