Back to datasets
Dataset assetOpen Source CommunityNetwork Traffic AnalysisIoT Security

Aposemat IoT-23

A labeled dataset of malicious and benign IoT network traffic created by Avast AIC Lab and funded by Avast Software.

Source
github
Created
Apr 29, 2024
Updated
May 7, 2024
Signals
480 views
Availability
Linked source ready
Overview

Dataset description and usage context

Dataset Overview

Dataset Name and Source

  • Dataset Name: Aposemat IoT‑23
  • Dataset Source: iot_23_datasets_small
  • Dataset Description: Contains labeled malicious and benign IoT network traffic created by Avast AIC Lab and funded by Avast Software.

Dataset Content

  • Data Type: Labeled network traffic data
  • Data Size: 8.8 GB
  • Contents: Only includes labeled traffic data; PCAP files are not provided.

Data Processing and Analysis

Data Processing Stage
  1. Data Cleaning and Pre‑processing

  2. Data Training

    • Jupyter Notebook: iot-23-data-training.ipynb
    • Trains and analyses multiple classification models, including Naive Bayes, K‑Nearest Neighbors, Decision Tree, Random Forest, LinearSVC, Artificial Neural Network (ANN), AdaBoost, and XGBoost.
  3. Data Tuning

    • Hyper‑parameter tuning for the same set of models using GridSearchCV.
Data Storage

Dataset Evaluation

  • Evaluation Method: Stratified K‑Fold Cross‑Validator (StratifiedKFold)
  • Parameters: 5 folds with shuffling enabled.
Need downstream help?

Pair the dataset with AI analysis and content workflows.

Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.

Explore AI studio