Back to datasets
Dataset assetOpen Source CommunityPlant IdentificationImage Captioning

SnapGarden_v0.6

SnapGarden v0.6 is a dataset containing 1,000 images of 25 plant species; each image has been augmented five times and is accompanied by a question‑answer style description, intended to help AI learn how to describe plants. The dataset is split into training, validation, and test sets, suitable for image captioning, plant recognition, and educational content development. It is released under the MIT license, emphasizing respect for original image owners and copyright.

Source
huggingface
Created
Jan 29, 2025
Updated
Jan 30, 2025
Signals
254 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview: SnapGarden v0.6

License

  • License: MIT

Dataset Information

  • Features:
    • Name: image
      • Type: image
    • Name: text
      • Type: string

Splits

  • Training Set:
    • Byte Size: 761,694,844.68
    • Number of Samples: 4,367
  • Test Set:
    • Byte Size: 190,467,316.32
    • Number of Samples: 1,092

Download Size

  • 951,870,561 bytes

Total Dataset Size

  • 952,162,161 bytes

Configuration

  • Configuration Name: default
    • Training Data Path: data/train-*
    • Test Data Path: data/test-*

Content

  • Images: 1,000 images of 25 plant species, each augmented five times.
  • Descriptions: Q&A‑style texts to help AI learn how a horticulturist would describe plants.

Usage

  • Load with Hugging Face datasets library:
from datasets import load_dataset
dataset = load_dataset("Baran657/SnapGarden_v0.6")

Legal Statement

  • Image copyrights belong to the original owners; the dataset is intended for AI research.

Potential Uses

  • Image Captioning: Enable machines to describe plants like a natural guide.
  • Plant Recognition: Test AI’s ability to differentiate between plant species.
  • Educational Content: Build resources for gardening enthusiasts and nature lovers.

License

  • The dataset follows the MIT license; redistribution, remixing, and use require appropriate attribution.

Acknowledgements

  • Thanks to all photographers and creators of the plant images whose contributions made this dataset possible.
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio