DATASET
Open Source Community
CICIDS2018
The dataset comprises labeled network traffic data, encompassing various attacks (e.g., DoS, brute‑force, SQL injection, botnet) and normal traffic.
Updated 10/3/2024
github
Description
Dataset Overview
Dataset Information
Dataset Name
- CICIDS2018 Dataset
Dataset Description
- Description: This dataset contains labeled network traffic data covering multiple attack types (e.g., DoS, brute‑force, SQL injection, botnet) and normal traffic.
- Link: The dataset can be downloaded here.
- Size: Large dataset split into multiple CSV files, total size exceeds several hundred MB.
Dataset Usage
- Training Data:
dataset/train_data.csv - Test Data:
dataset/test.csv - Training Data Version:
artifacts/train_data.csv
Dataset Processing
Data Ingestion
- Script:
src/components/data_ingestion.py
Data Transformation
- Script:
src/components/data_transformation.py
Model Training
- Script:
src/components/model_trainer.py
Model Performance
Test Accuracy
- Test Accuracy: 89.75%
- Training Accuracy: 89.87%
F1 Score
- Test F1 Score: 88.27%
- Training F1 Score: 88.40%
Recall
- Test Recall: 89.75%
- Training Recall: 89.87%
Precision
- Test Precision: 89.08%
- Training Precision: 89.31%
Balanced Accuracy
- Balanced Accuracy: 86.55%
ROC AUC
- Test ROC AUC: 99.17%
- Training ROC AUC: 99.21%
AI studio
Generate PPTs instantly with Nano Banana Pro.
Generate PPT NowAccess Dataset
Login to Access
Please login to view download links and access full dataset details.
Topics
Cybersecurity
Network Intrusion Detection
Source
Organization: github
Created: 10/2/2024
Power Your Data Analysis with Premium AI Models
Supporting GPT-5, Claude-4, DeepSeek v3, Gemini and more.
Enjoy a free trial and save 20%+ compared to official pricing.