Back to datasets
Dataset assetOpen Source CommunityNetwork Traffic AnalysisIoT Security
Aposemat IoT-23
A labeled dataset of malicious and benign IoT network traffic created by Avast AIC Lab and funded by Avast Software.
Source
github
Created
Apr 29, 2024
Updated
May 7, 2024
Signals
480 views
Availability
Linked source ready
Overview
Dataset description and usage context
Dataset Overview
Dataset Name and Source
- Dataset Name: Aposemat IoT‑23
- Dataset Source: iot_23_datasets_small
- Dataset Description: Contains labeled malicious and benign IoT network traffic created by Avast AIC Lab and funded by Avast Software.
Dataset Content
- Data Type: Labeled network traffic data
- Data Size: 8.8 GB
- Contents: Only includes labeled traffic data; PCAP files are not provided.
Data Processing and Analysis
Data Processing Stage
-
Data Cleaning and Pre‑processing
- Jupyter Notebook: iot-23-data-cleaning-and-preprocessing.ipynb
- Tasks include data loading, exploration, cleaning, processing, encoding, and storage.
-
Data Training
- Jupyter Notebook: iot-23-data-training.ipynb
- Trains and analyses multiple classification models, including Naive Bayes, K‑Nearest Neighbors, Decision Tree, Random Forest, LinearSVC, Artificial Neural Network (ANN), AdaBoost, and XGBoost.
-
Data Tuning
- Hyper‑parameter tuning for the same set of models using GridSearchCV.
Data Storage
- Data Files: CSV-data
- Trained Models: applied-ML-DL-methods
Dataset Evaluation
- Evaluation Method: Stratified K‑Fold Cross‑Validator (StratifiedKFold)
- Parameters: 5 folds with shuffling enabled.
Need downstream help?
Pair the dataset with AI analysis and content workflows.
Once the source passes your review, move straight into summarization, transformation, report drafting, or presentation generation with the JuheAI toolchain.