JUHE API Marketplace
DATASET
Open Source Community

SnapGarden_v0.6

SnapGarden v0.6 is a dataset containing 1,000 images of 25 plant species; each image has been augmented five times and is accompanied by a question‑answer style description, intended to help AI learn how to describe plants. The dataset is split into training, validation, and test sets, suitable for image captioning, plant recognition, and educational content development. It is released under the MIT license, emphasizing respect for original image owners and copyright.

Updated 1/30/2025
huggingface

Description

Dataset Overview: SnapGarden v0.6

License

  • License: MIT

Dataset Information

  • Features:
    • Name: image
      • Type: image
    • Name: text
      • Type: string

Splits

  • Training Set:
    • Byte Size: 761,694,844.68
    • Number of Samples: 4,367
  • Test Set:
    • Byte Size: 190,467,316.32
    • Number of Samples: 1,092

Download Size

  • 951,870,561 bytes

Total Dataset Size

  • 952,162,161 bytes

Configuration

  • Configuration Name: default
    • Training Data Path: data/train-*
    • Test Data Path: data/test-*

Content

  • Images: 1,000 images of 25 plant species, each augmented five times.
  • Descriptions: Q&A‑style texts to help AI learn how a horticulturist would describe plants.

Usage

  • Load with Hugging Face datasets library:
from datasets import load_dataset
dataset = load_dataset("Baran657/SnapGarden_v0.6")

Legal Statement

  • Image copyrights belong to the original owners; the dataset is intended for AI research.

Potential Uses

  • Image Captioning: Enable machines to describe plants like a natural guide.
  • Plant Recognition: Test AI’s ability to differentiate between plant species.
  • Educational Content: Build resources for gardening enthusiasts and nature lovers.

License

  • The dataset follows the MIT license; redistribution, remixing, and use require appropriate attribution.

Acknowledgements

  • Thanks to all photographers and creators of the plant images whose contributions made this dataset possible.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Plant Identification
Image Captioning

Source

Organization: huggingface

Created: 1/29/2025

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.