JUHE API Marketplace
DATASET
Open Source Community

SH17

The SH17 dataset was created by the Department of Mechanical, Automotive and Materials Engineering at the University of Windsor, containing 8,099 annotated images covering 17 categories of personal protective equipment (PPE) such as helmets and safety glasses. Collected from diverse industrial settings, the dataset aims to enhance worker safety in manufacturing through object detection and convolutional neural network techniques. Images were sourced via the Pexels website and annotated by professionals to ensure quality and diversity. The dataset is primarily used to train and validate object detection models for PPE compliance in industrial environments.

Updated 7/5/2024
arXiv

Description

SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry

Dataset Overview

Abstract

Workplace accidents pose significant risks to human safety, especially in construction and manufacturing sectors where PPE compliance is increasingly essential. This study leverages object detection (OD) and convolutional neural network (CNN) techniques to develop non‑intrusive technologies for detecting and verifying the correct use of various PPE (e.g., helmets, safety glasses, masks, and protective clothing). We introduce the SH17 dataset, comprising 8,099 annotated images with 75,994 instances across 17 PPE categories and body parts, collected from diverse industrial environments. We trained state‑of‑the‑art OD models for benchmarking; preliminary results show the YOLOv9‑e variant exceeding 70.9% accuracy in PPE detection. Cross‑dataset validation indicates that integrating these technologies can markedly improve safety management systems, offering scalable and efficient solutions for industries striving to meet safety regulations and protect their workforce.

Dataset Details

  • Image Count: 8,099 annotated images
  • Instance Count: 75,994 object instances
  • Category Count: 17 PPE items and body parts

Key Features

  • Collected from globally diverse industrial environments
  • High‑quality images (max resolution 8192×5462, min 1920×1002)
  • Average of 9.38 instances per image
  • Includes small objects such as ears and earmuffs (39,764 annotations < 1% image area, 59,025 annotations < 5% area)

Categories

  1. Person
  2. Head
  3. Face
  4. Glasses
  5. Face‑mask‑medical
  6. Face‑guard
  7. Ear
  8. Earmuffs
  9. Hands
  10. Gloves
  11. Foot
  12. Shoes
  13. Safety‑vest
  14. Tools
  15. Helmet
  16. Medical‑suit
  17. Safety‑suit

Models

We used the ultralytics==8.0.38 repository to train all YOLO models. Training weights for YOLO v8, v9, and v10 object detection models are provided for evaluation and benchmarking.

YOLO v8, v9, v10 Results

ModelParams (M)ImagesInstancesPrecision (P)Recall (R)mAP@50mAP@50‑95
Yolo‑8‑n3.216201535867.553.658.036.6
Yolo‑8‑s11.216201535881.555.763.741.7
(remaining rows omitted for brevity)

Usage

from ultralytics import YOLO

model = YOLO(r"path\to\weight.pt")  # Provide path to trained model
results = model("path/to/image.jpg")  # Run inference
results.show()  # Visualize results

Refer to the ultralytics documentation for further details.

Evaluation

Evaluation code is provided to reproduce our benchmark results on the SH17 validation set.

from ultralytics import YOLO

model = YOLO(r"path\to\weight.pt")
model.val(data="sh17.yaml", batch=1, imgsz=640, device="cuda:0")

Citation

If you use this dataset or code in your research, please cite our paper:

@article{ahmad_2024_sh17,
  title={SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry},
  author={Ahmad, Hafiz Mughees and Rahimi, Afshin},
  journal={arXiv},
  year={2024}
}

License

The SH17 dataset is released under the CC BY‑NC‑SA 4.0 license.

AI studio

Generate PPTs instantly with Nano Banana Pro.

Generate PPT Now

Access Dataset

Login to Access

Please login to view download links and access full dataset details.

Topics

Personal Protective Equipment
Object Detection

Source

Organization: arXiv

Created: 7/5/2024

Power Your Data Analysis with Premium AI Models

Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.

Enjoy a free trial and save 20%+ compared to official pricing.